Groups
Sign in
Groups
beautifulsoup
Conversations
About
Send feedback
Help
beautifulsoup
Contact owners and managers
1–30 of 1591
Mark all as read
Report group
0 selected
Yudhvir “Yash” Narwal
,
leonardr
3
Sep 8
Beautiful soup maintainers
Hi, Thanks for the reply. I really appreciate it. I agree with you that there are many open source
unread,
Beautiful soup maintainers
Hi, Thanks for the reply. I really appreciate it. I agree with you that there are many open source
Sep 8
Chris Papademetrious
2
Sep 5
Wrapping sequences of non-block elements in <p>
I have an improved implementation that moves leading/trailing whitespace within the new <p> to
unread,
Wrapping sequences of non-block elements in <p>
I have an improved implementation that moves leading/trailing whitespace within the new <p> to
Sep 5
Heck Lennon
, …
Isaac Muse
7
Sep 5
Why does BS ignore paragraphs with italics sub-element?
It should be noted that Soup Sieve offers: :-soup-contains() and :-soup-contains-own(). -soup was
unread,
Why does BS ignore paragraphs with italics sub-element?
It should be noted that Soup Sieve offers: :-soup-contains() and :-soup-contains-own(). -soup was
Sep 5
赵正中(Sam Snowman)
,
Isaac Muse
2
Aug 26
What’s the story of the intro pic?
Alice and Wonderland https://aliceinwonderland.fandom.com/wiki/Turtle_Soup. The mock turtle sings a
unread,
What’s the story of the intro pic?
Alice and Wonderland https://aliceinwonderland.fandom.com/wiki/Turtle_Soup. The mock turtle sings a
Aug 26
fsn7...@gmail.com
, …
Chris Papademetrious
5
Aug 22
Proper API reference required.
I use the Beautiful Soup documentation page that phoenix mentioned, along with the autocompletions
unread,
Proper API reference required.
I use the Beautiful Soup documentation page that phoenix mentioned, along with the autocompletions
Aug 22
Evan Erwee
Aug 14
Cant use BeautifulSoup with lxml and cchardet in AWS Lambda.
I have tried several approaches but BeautifulSoup does not seems to work with AWS Lambda Layer with
unread,
Cant use BeautifulSoup with lxml and cchardet in AWS Lambda.
I have tried several approaches but BeautifulSoup does not seems to work with AWS Lambda Layer with
Aug 14
Nathaniel Mills
2
Aug 12
Porting from Java org.jsoup and need Class equivalents
I've decided to just port the logic to BeautifulSoup and use its native classes, etc. as there
unread,
Porting from Java org.jsoup and need Class equivalents
I've decided to just port the logic to BeautifulSoup and use its native classes, etc. as there
Aug 12
Shiv Katira
,
Chris Papademetrious
2
Aug 8
Unable to find element
Hi Shiv, Beautiful Soup does not use XPath. Do you have a question about Beautiful Soup? - Chris On
unread,
Unable to find element
Hi Shiv, Beautiful Soup does not use XPath. Do you have a question about Beautiful Soup? - Chris On
Aug 8
Heck Lennon
,
Chris Papademetrious
2
Jul 28
Shorter way to create and add new tag + children?
Hi frdt, When I need to insert a complex HTML fragment like this, I use helper functions like this:
unread,
Shorter way to create and add new tag + children?
Hi frdt, When I need to insert a complex HTML fragment like this, I use helper functions like this:
Jul 28
Peter Constable
,
Chris Papademetrious
2
Jul 28
RE: matching text pattern in spite of intervening markup
Hi Peter, I cannot think of a generalized solution to this problem that doesn't rely on recursion
unread,
RE: matching text pattern in spite of intervening markup
Hi Peter, I cannot think of a generalized solution to this problem that doesn't rely on recursion
Jul 28
Heck Lennon
,
Chris Papademetrious
3
Jul 9
Remove parent without removing children?
Right on! Thank you. ============ from bs4 import BeautifulSoup as BS from pathlib import Path
unread,
Remove parent without removing children?
Right on! Thank you. ============ from bs4 import BeautifulSoup as BS from pathlib import Path
Jul 9
Simran
,
Chris Papademetrious
2
Jul 4
how to scrape info inside hyperlinks
Hi Simran, I suggest getting started by going through the excellent Beautiful Soup tutorial and
unread,
how to scrape info inside hyperlinks
Hi Simran, I suggest getting started by going through the excellent Beautiful Soup tutorial and
Jul 4
Steve Clarke
,
Chris Papademetrious
2
Jul 4
remove trailing / from col tag
Hi Stephen, I am not sure how to do this in Beautiful Soup, but here is how you could do it with post
unread,
remove trailing / from col tag
Hi Stephen, I am not sure how to do this in Beautiful Soup, but here is how you could do it with post
Jul 4
Will Abbott
, …
Isaac Muse
13
Jun 11
Cannot detect boolean attribute
Ok, thanks for clearing that up for me, it closes a path but opens some potential new ones. On
unread,
Cannot detect boolean attribute
Ok, thanks for clearing that up for me, it closes a path but opens some potential new ones. On
Jun 11
Chris Papademetrious
,
leonardr
3
May 28
with "lxml", can I parse an HTML fragment without normalizing it to a full HTML document?
Thanks Leonard! I just wanted to make sure I wasn't missing something obvious. I have helper
unread,
with "lxml", can I parse an HTML fragment without normalizing it to a full HTML document?
Thanks Leonard! I just wanted to make sure I wasn't missing something obvious. I have helper
May 28
Heck Lennon
, …
leonardr
18
May 27
Find grand-child with double colon + "name" in name?
Looking over the code, it seems we use `f"'{pseudo}' pseudo-class is not implemented at
unread,
Find grand-child with double colon + "name" in name?
Looking over the code, it seems we use `f"'{pseudo}' pseudo-class is not implemented at
May 27
leonardr
,
Chris Papademetrious
5
May 23
Beautiful Soup 4.13.0 beta 2
Hi Leonard, The 4.13-more-specific-than-pageelement branch resolves all the relevant unknown-method
unread,
Beautiful Soup 4.13.0 beta 2
Hi Leonard, The 4.13-more-specific-than-pageelement branch resolves all the relevant unknown-method
May 23
leonardr
, …
Chris Papademetrious
3
May 21
Beautiful Soup at PyCon US 2024
Hi Leonard, Sumana, It was great to meet you both at PyCon 2024! Thanks for throwing a wonderful (and
unread,
Beautiful Soup at PyCon US 2024
Hi Leonard, Sumana, It was great to meet you both at PyCon 2024! Thanks for throwing a wonderful (and
May 21
Chris Papademetrious
, …
Carlos
10
May 21
copy.copy(soup) takes longer than expected
For reference, here is the issue I filed: #2065904: Improve copy.copy() runtime - Chris On Thursday,
unread,
copy.copy(soup) takes longer than expected
For reference, here is the issue I filed: #2065904: Improve copy.copy() runtime - Chris On Thursday,
May 21
Heck Lennon
,
Chris Papademetrious
7
May 15
Read XML tree into treectrl?
Thx! On Wednesday, May 15, 2024 at 9:24:52 PM UTC+2 chris...@gmail.com wrote: I don't know the UI
unread,
Read XML tree into treectrl?
Thx! On Wednesday, May 15, 2024 at 9:24:52 PM UTC+2 chris...@gmail.com wrote: I don't know the UI
May 15
Jonn Doe
,
Chris Papademetrious
3
May 15
Parsing special characters to standard a-z
Thanks I was just wondering if there was a pre cooked routine lol. On Wed, 15 May 2024, 18:30 Chris
unread,
Parsing special characters to standard a-z
Thanks I was just wondering if there was a pre cooked routine lol. On Wed, 15 May 2024, 18:30 Chris
May 15
Elnatan Michael
,
Chris Papademetrious
2
Apr 29
Unable to get bs4 from Beautiful soup
Hi Elnatan, What program are you trying to run? Did you install BeautifulSoup as described on the
unread,
Unable to get bs4 from Beautiful soup
Hi Elnatan, What program are you trying to run? Did you install BeautifulSoup as described on the
Apr 29
Chris Papademetrious
,
leonardr
4
Apr 28
how do I extend BeautifulSoup to add my own convenience methods?
Okay, I think I hit my first wrinkle. There are various methods and properties that work universally
unread,
how do I extend BeautifulSoup to add my own convenience methods?
Okay, I think I hit my first wrinkle. There are various methods and properties that work universally
Apr 28
Heck Lennon
,
Chris Papademetrious
3
Apr 28
Right way to remove duplicates in head?
I'm glad you figured it out! In the future, if you need to perform case-insensitive string
unread,
Right way to remove duplicates in head?
I'm glad you figured it out! In the future, if you need to perform case-insensitive string
Apr 28
Heck Lennon
,
leonardr
5
Apr 24
Why doesn't BS add charset meta in header?
Thanks! On Wednesday, April 24, 2024 at 3:44:40 PM UTC+2 leonardr wrote: I think I can clear this up.
unread,
Why doesn't BS add charset meta in header?
Thanks! On Wednesday, April 24, 2024 at 3:44:40 PM UTC+2 leonardr wrote: I think I can clear this up.
Apr 24
Per Göttlicher
, …
leonardr
5
Apr 17
lxml and html.parser output differs
Thanks for filing the bug report! I was unsure if this was actually a bug or just a weird quirk of
unread,
lxml and html.parser output differs
Thanks for filing the bug report! I was unsure if this was actually a bug or just a weird quirk of
Apr 17
Michael Brown
, …
leonardr
4
Apr 11
JSP file parsing - extend html comments ??
Mike, The way to handle the JSP syntax would be to write a TreeBuilder implementation that can handle
unread,
JSP file parsing - extend html comments ??
Mike, The way to handle the JSP syntax would be to write a TreeBuilder implementation that can handle
Apr 11
Estefanìa Chávez
,
Carlos
2
Apr 8
Webscraping of assets in Morningstar
Hello, the error FeatureNotFound in that case means that the parser you passed to the BeautifulSoup
unread,
Webscraping of assets in Morningstar
Hello, the error FeatureNotFound in that case means that the parser you passed to the BeautifulSoup
Apr 8
Mansour Moufid
,
Chris Papademetrious
4
Mar 29
Adding abbr tags to a document
Hi Mansour, Another way to do this is by building a regex pattern that matches any abbreviation in
unread,
Adding abbr tags to a document
Hi Mansour, Another way to do this is by building a regex pattern that matches any abbreviation in
Mar 29
Chris Papademetrious
,
leonardr
3
Mar 24
is there a way to create a new_tag() without a soup object handy?
Hi everyone, I ended up going with an approach where I construct the desired new content as HTML:
unread,
is there a way to create a new_tag() without a soup object handy?
Hi everyone, I ended up going with an approach where I construct the desired new content as HTML:
Mar 24