Groups
Sign in
Groups
beautifulsoup
Conversations
About
Send feedback
Help
beautifulsoup
Contact owners and managers
1–30 of 1584
Mark all as read
Report group
0 selected
Heck Lennon
Jul 15
Shorter way to create and add new tag + children?
Hello, This code works, but I was wondering if there's a shorter way to create a new element that
unread,
Shorter way to create and add new tag + children?
Hello, This code works, but I was wondering if there's a shorter way to create a new element that
Jul 15
Peter Constable
Jul 11
RE: matching text pattern in spite of intervening markup
I'd like to gather information from a set of HTML pages created over many years. The information
unread,
RE: matching text pattern in spite of intervening markup
I'd like to gather information from a set of HTML pages created over many years. The information
Jul 11
Heck Lennon
,
Chris Papademetrious
3
Jul 9
Remove parent without removing children?
Right on! Thank you. ============ from bs4 import BeautifulSoup as BS from pathlib import Path
unread,
Remove parent without removing children?
Right on! Thank you. ============ from bs4 import BeautifulSoup as BS from pathlib import Path
Jul 9
Simran
,
Chris Papademetrious
2
Jul 4
how to scrape info inside hyperlinks
Hi Simran, I suggest getting started by going through the excellent Beautiful Soup tutorial and
unread,
how to scrape info inside hyperlinks
Hi Simran, I suggest getting started by going through the excellent Beautiful Soup tutorial and
Jul 4
Steve Clarke
,
Chris Papademetrious
2
Jul 4
remove trailing / from col tag
Hi Stephen, I am not sure how to do this in Beautiful Soup, but here is how you could do it with post
unread,
remove trailing / from col tag
Hi Stephen, I am not sure how to do this in Beautiful Soup, but here is how you could do it with post
Jul 4
Will Abbott
, …
Isaac Muse
13
Jun 11
Cannot detect boolean attribute
Ok, thanks for clearing that up for me, it closes a path but opens some potential new ones. On
unread,
Cannot detect boolean attribute
Ok, thanks for clearing that up for me, it closes a path but opens some potential new ones. On
Jun 11
Chris Papademetrious
,
leonardr
3
May 28
with "lxml", can I parse an HTML fragment without normalizing it to a full HTML document?
Thanks Leonard! I just wanted to make sure I wasn't missing something obvious. I have helper
unread,
with "lxml", can I parse an HTML fragment without normalizing it to a full HTML document?
Thanks Leonard! I just wanted to make sure I wasn't missing something obvious. I have helper
May 28
Heck Lennon
, …
leonardr
18
May 27
Find grand-child with double colon + "name" in name?
Looking over the code, it seems we use `f"'{pseudo}' pseudo-class is not implemented at
unread,
Find grand-child with double colon + "name" in name?
Looking over the code, it seems we use `f"'{pseudo}' pseudo-class is not implemented at
May 27
leonardr
,
Chris Papademetrious
5
May 23
Beautiful Soup 4.13.0 beta 2
Hi Leonard, The 4.13-more-specific-than-pageelement branch resolves all the relevant unknown-method
unread,
Beautiful Soup 4.13.0 beta 2
Hi Leonard, The 4.13-more-specific-than-pageelement branch resolves all the relevant unknown-method
May 23
leonardr
, …
Chris Papademetrious
3
May 21
Beautiful Soup at PyCon US 2024
Hi Leonard, Sumana, It was great to meet you both at PyCon 2024! Thanks for throwing a wonderful (and
unread,
Beautiful Soup at PyCon US 2024
Hi Leonard, Sumana, It was great to meet you both at PyCon 2024! Thanks for throwing a wonderful (and
May 21
Chris Papademetrious
, …
Carlos
10
May 21
copy.copy(soup) takes longer than expected
For reference, here is the issue I filed: #2065904: Improve copy.copy() runtime - Chris On Thursday,
unread,
copy.copy(soup) takes longer than expected
For reference, here is the issue I filed: #2065904: Improve copy.copy() runtime - Chris On Thursday,
May 21
Heck Lennon
,
Chris Papademetrious
7
May 15
Read XML tree into treectrl?
Thx! On Wednesday, May 15, 2024 at 9:24:52 PM UTC+2 chris...@gmail.com wrote: I don't know the UI
unread,
Read XML tree into treectrl?
Thx! On Wednesday, May 15, 2024 at 9:24:52 PM UTC+2 chris...@gmail.com wrote: I don't know the UI
May 15
Jonn Doe
,
Chris Papademetrious
3
May 15
Parsing special characters to standard a-z
Thanks I was just wondering if there was a pre cooked routine lol. On Wed, 15 May 2024, 18:30 Chris
unread,
Parsing special characters to standard a-z
Thanks I was just wondering if there was a pre cooked routine lol. On Wed, 15 May 2024, 18:30 Chris
May 15
Elnatan Michael
,
Chris Papademetrious
2
Apr 29
Unable to get bs4 from Beautiful soup
Hi Elnatan, What program are you trying to run? Did you install BeautifulSoup as described on the
unread,
Unable to get bs4 from Beautiful soup
Hi Elnatan, What program are you trying to run? Did you install BeautifulSoup as described on the
Apr 29
Chris Papademetrious
,
leonardr
4
Apr 28
how do I extend BeautifulSoup to add my own convenience methods?
Okay, I think I hit my first wrinkle. There are various methods and properties that work universally
unread,
how do I extend BeautifulSoup to add my own convenience methods?
Okay, I think I hit my first wrinkle. There are various methods and properties that work universally
Apr 28
Heck Lennon
,
Chris Papademetrious
3
Apr 28
Right way to remove duplicates in head?
I'm glad you figured it out! In the future, if you need to perform case-insensitive string
unread,
Right way to remove duplicates in head?
I'm glad you figured it out! In the future, if you need to perform case-insensitive string
Apr 28
Heck Lennon
,
leonardr
5
Apr 24
Why doesn't BS add charset meta in header?
Thanks! On Wednesday, April 24, 2024 at 3:44:40 PM UTC+2 leonardr wrote: I think I can clear this up.
unread,
Why doesn't BS add charset meta in header?
Thanks! On Wednesday, April 24, 2024 at 3:44:40 PM UTC+2 leonardr wrote: I think I can clear this up.
Apr 24
Per Göttlicher
, …
leonardr
5
Apr 17
lxml and html.parser output differs
Thanks for filing the bug report! I was unsure if this was actually a bug or just a weird quirk of
unread,
lxml and html.parser output differs
Thanks for filing the bug report! I was unsure if this was actually a bug or just a weird quirk of
Apr 17
Michael Brown
, …
leonardr
4
Apr 11
JSP file parsing - extend html comments ??
Mike, The way to handle the JSP syntax would be to write a TreeBuilder implementation that can handle
unread,
JSP file parsing - extend html comments ??
Mike, The way to handle the JSP syntax would be to write a TreeBuilder implementation that can handle
Apr 11
Estefanìa Chávez
,
Carlos
2
Apr 8
Webscraping of assets in Morningstar
Hello, the error FeatureNotFound in that case means that the parser you passed to the BeautifulSoup
unread,
Webscraping of assets in Morningstar
Hello, the error FeatureNotFound in that case means that the parser you passed to the BeautifulSoup
Apr 8
Mansour Moufid
,
Chris Papademetrious
4
Mar 29
Adding abbr tags to a document
Hi Mansour, Another way to do this is by building a regex pattern that matches any abbreviation in
unread,
Adding abbr tags to a document
Hi Mansour, Another way to do this is by building a regex pattern that matches any abbreviation in
Mar 29
Chris Papademetrious
,
leonardr
3
Mar 24
is there a way to create a new_tag() without a soup object handy?
Hi everyone, I ended up going with an approach where I construct the desired new content as HTML:
unread,
is there a way to create a new_tag() without a soup object handy?
Hi everyone, I ended up going with an approach where I construct the desired new content as HTML:
Mar 24
Chris M
, …
Carlos
4
Mar 19
Beautiful Soup Cheat Sheet posted
Hello Chris and Phoenix. Thank you for comments on the cheat sheet. I've never used Jupyter
unread,
Beautiful Soup Cheat Sheet posted
Hello Chris and Phoenix. Thank you for comments on the cheat sheet. I've never used Jupyter
Mar 19
Frattos Dj (Dj Frattos)
,
Chris Papademetrious
2
Mar 17
How working correctly with Beautifulsoup to not generate Type Checking alerts in VSCode?
Try adding a type annotation to let VSCode know that table_stats_body contains Tag objects:
unread,
How working correctly with Beautifulsoup to not generate Type Checking alerts in VSCode?
Try adding a type annotation to let VSCode know that table_stats_body contains Tag objects:
Mar 17
Jošt Prevc
Mar 4
Copy of BeautifulSoup object does not preserve element_classes
The code below shows the problem where making a copy does not preserve the element_classes value in
unread,
Copy of BeautifulSoup object does not preserve element_classes
The code below shows the problem where making a copy does not preserve the element_classes value in
Mar 4
Twinkal Paralkar
,
leonardr
2
Feb 29
code is not working for long script
Twinkal, It's difficult to say what's going on without knowing the URL or markup that is
unread,
code is not working for long script
Twinkal, It's difficult to say what's going on without knowing the URL or markup that is
Feb 29
Heck Lennon
,
leonardr
5
Feb 28
Data in head doesn't match what it says in the file
Fantastic! Thank you. On Wednesday, February 28, 2024 at 6:48:43 PM UTC+1 leonardr wrote: Yes,
unread,
Data in head doesn't match what it says in the file
Fantastic! Thank you. On Wednesday, February 28, 2024 at 6:48:43 PM UTC+1 leonardr wrote: Yes,
Feb 28
Krisztián Pintér
,
leonardr
2
Feb 28
parsing invalid href
It looks like this is a strategy for parsing ambiguous HTML specific to Python's built-in HTML
unread,
parsing invalid href
It looks like this is a strategy for parsing ambiguous HTML specific to Python's built-in HTML
Feb 28
Chris Papademetrious
Jan 20
some initial findings with the 4.13 branch
Hi Leonard, I tried the 4.13 branch out on one of our content processing pipelines at my day job. It
unread,
some initial findings with the 4.13 branch
Hi Leonard, I tried the 4.13 branch out on one of our content processing pipelines at my day job. It
Jan 20
leonardr
,
Tara Matheney
3
Jan 18
Beautiful Soup 4.12.3
Thanks for letting me know; I've fixed the problem. Leonard On Thursday, January 18, 2024 at 8:47
unread,
Beautiful Soup 4.12.3
Thanks for letting me know; I've fixed the problem. Leonard On Thursday, January 18, 2024 at 8:47
Jan 18