Ryhmät
Kirjaudu
Ryhmät
beautifulsoup
Keskustelut
Tietoa
Lähetä palautetta
Ohje
beautifulsoup
Ota yhteyttä omistajiin ja ylläpitäjiin
1–30/1579
Merkitse kaikki luetuiksi
Tee ilmoitus ryhmästä
0 valittu
Will Abbott
, …
Isaac Muse
13
11. kesäk.
Cannot detect boolean attribute
Ok, thanks for clearing that up for me, it closes a path but opens some potential new ones. On
lukematon,
Cannot detect boolean attribute
Ok, thanks for clearing that up for me, it closes a path but opens some potential new ones. On
11. kesäk.
Chris Papademetrious
,
leonardr
3
28. toukok.
with "lxml", can I parse an HTML fragment without normalizing it to a full HTML document?
Thanks Leonard! I just wanted to make sure I wasn't missing something obvious. I have helper
lukematon,
with "lxml", can I parse an HTML fragment without normalizing it to a full HTML document?
Thanks Leonard! I just wanted to make sure I wasn't missing something obvious. I have helper
28. toukok.
Heck Lennon
, …
leonardr
18
27. toukok.
Find grand-child with double colon + "name" in name?
Looking over the code, it seems we use `f"'{pseudo}' pseudo-class is not implemented at
lukematon,
Find grand-child with double colon + "name" in name?
Looking over the code, it seems we use `f"'{pseudo}' pseudo-class is not implemented at
27. toukok.
leonardr
,
Chris Papademetrious
5
23. toukok.
Beautiful Soup 4.13.0 beta 2
Hi Leonard, The 4.13-more-specific-than-pageelement branch resolves all the relevant unknown-method
lukematon,
Beautiful Soup 4.13.0 beta 2
Hi Leonard, The 4.13-more-specific-than-pageelement branch resolves all the relevant unknown-method
23. toukok.
leonardr
, …
Chris Papademetrious
3
21. toukok.
Beautiful Soup at PyCon US 2024
Hi Leonard, Sumana, It was great to meet you both at PyCon 2024! Thanks for throwing a wonderful (and
lukematon,
Beautiful Soup at PyCon US 2024
Hi Leonard, Sumana, It was great to meet you both at PyCon 2024! Thanks for throwing a wonderful (and
21. toukok.
Chris Papademetrious
, …
Carlos
10
21. toukok.
copy.copy(soup) takes longer than expected
For reference, here is the issue I filed: #2065904: Improve copy.copy() runtime - Chris On Thursday,
lukematon,
copy.copy(soup) takes longer than expected
For reference, here is the issue I filed: #2065904: Improve copy.copy() runtime - Chris On Thursday,
21. toukok.
Heck Lennon
,
Chris Papademetrious
7
15. toukok.
Read XML tree into treectrl?
Thx! On Wednesday, May 15, 2024 at 9:24:52 PM UTC+2 chris...@gmail.com wrote: I don't know the UI
lukematon,
Read XML tree into treectrl?
Thx! On Wednesday, May 15, 2024 at 9:24:52 PM UTC+2 chris...@gmail.com wrote: I don't know the UI
15. toukok.
Jonn Doe
,
Chris Papademetrious
3
15. toukok.
Parsing special characters to standard a-z
Thanks I was just wondering if there was a pre cooked routine lol. On Wed, 15 May 2024, 18:30 Chris
lukematon,
Parsing special characters to standard a-z
Thanks I was just wondering if there was a pre cooked routine lol. On Wed, 15 May 2024, 18:30 Chris
15. toukok.
Elnatan Michael
,
Chris Papademetrious
2
29. huhtik.
Unable to get bs4 from Beautiful soup
Hi Elnatan, What program are you trying to run? Did you install BeautifulSoup as described on the
lukematon,
Unable to get bs4 from Beautiful soup
Hi Elnatan, What program are you trying to run? Did you install BeautifulSoup as described on the
29. huhtik.
Chris Papademetrious
,
leonardr
4
28. huhtik.
how do I extend BeautifulSoup to add my own convenience methods?
Okay, I think I hit my first wrinkle. There are various methods and properties that work universally
lukematon,
how do I extend BeautifulSoup to add my own convenience methods?
Okay, I think I hit my first wrinkle. There are various methods and properties that work universally
28. huhtik.
Heck Lennon
,
Chris Papademetrious
3
28. huhtik.
Right way to remove duplicates in head?
I'm glad you figured it out! In the future, if you need to perform case-insensitive string
lukematon,
Right way to remove duplicates in head?
I'm glad you figured it out! In the future, if you need to perform case-insensitive string
28. huhtik.
Heck Lennon
,
leonardr
5
24. huhtik.
Why doesn't BS add charset meta in header?
Thanks! On Wednesday, April 24, 2024 at 3:44:40 PM UTC+2 leonardr wrote: I think I can clear this up.
lukematon,
Why doesn't BS add charset meta in header?
Thanks! On Wednesday, April 24, 2024 at 3:44:40 PM UTC+2 leonardr wrote: I think I can clear this up.
24. huhtik.
Per Göttlicher
, …
leonardr
5
17. huhtik.
lxml and html.parser output differs
Thanks for filing the bug report! I was unsure if this was actually a bug or just a weird quirk of
lukematon,
lxml and html.parser output differs
Thanks for filing the bug report! I was unsure if this was actually a bug or just a weird quirk of
17. huhtik.
Michael Brown
, …
leonardr
4
11. huhtik.
JSP file parsing - extend html comments ??
Mike, The way to handle the JSP syntax would be to write a TreeBuilder implementation that can handle
lukematon,
JSP file parsing - extend html comments ??
Mike, The way to handle the JSP syntax would be to write a TreeBuilder implementation that can handle
11. huhtik.
Estefanìa Chávez
,
Carlos
2
8. huhtik.
Webscraping of assets in Morningstar
Hello, the error FeatureNotFound in that case means that the parser you passed to the BeautifulSoup
lukematon,
Webscraping of assets in Morningstar
Hello, the error FeatureNotFound in that case means that the parser you passed to the BeautifulSoup
8. huhtik.
Mansour Moufid
,
Chris Papademetrious
4
29. maalisk.
Adding abbr tags to a document
Hi Mansour, Another way to do this is by building a regex pattern that matches any abbreviation in
lukematon,
Adding abbr tags to a document
Hi Mansour, Another way to do this is by building a regex pattern that matches any abbreviation in
29. maalisk.
Chris Papademetrious
,
leonardr
3
24. maalisk.
is there a way to create a new_tag() without a soup object handy?
Hi everyone, I ended up going with an approach where I construct the desired new content as HTML:
lukematon,
is there a way to create a new_tag() without a soup object handy?
Hi everyone, I ended up going with an approach where I construct the desired new content as HTML:
24. maalisk.
Chris M
, …
Carlos
4
19. maalisk.
Beautiful Soup Cheat Sheet posted
Hello Chris and Phoenix. Thank you for comments on the cheat sheet. I've never used Jupyter
lukematon,
Beautiful Soup Cheat Sheet posted
Hello Chris and Phoenix. Thank you for comments on the cheat sheet. I've never used Jupyter
19. maalisk.
Frattos Dj (Dj Frattos)
,
Chris Papademetrious
2
17. maalisk.
How working correctly with Beautifulsoup to not generate Type Checking alerts in VSCode?
Try adding a type annotation to let VSCode know that table_stats_body contains Tag objects:
lukematon,
How working correctly with Beautifulsoup to not generate Type Checking alerts in VSCode?
Try adding a type annotation to let VSCode know that table_stats_body contains Tag objects:
17. maalisk.
Jošt Prevc
4. maalisk.
Copy of BeautifulSoup object does not preserve element_classes
The code below shows the problem where making a copy does not preserve the element_classes value in
lukematon,
Copy of BeautifulSoup object does not preserve element_classes
The code below shows the problem where making a copy does not preserve the element_classes value in
4. maalisk.
Twinkal Paralkar
,
leonardr
2
29. helmik.
code is not working for long script
Twinkal, It's difficult to say what's going on without knowing the URL or markup that is
lukematon,
code is not working for long script
Twinkal, It's difficult to say what's going on without knowing the URL or markup that is
29. helmik.
Heck Lennon
,
leonardr
5
28. helmik.
Data in head doesn't match what it says in the file
Fantastic! Thank you. On Wednesday, February 28, 2024 at 6:48:43 PM UTC+1 leonardr wrote: Yes,
lukematon,
Data in head doesn't match what it says in the file
Fantastic! Thank you. On Wednesday, February 28, 2024 at 6:48:43 PM UTC+1 leonardr wrote: Yes,
28. helmik.
Krisztián Pintér
,
leonardr
2
28. helmik.
parsing invalid href
It looks like this is a strategy for parsing ambiguous HTML specific to Python's built-in HTML
lukematon,
parsing invalid href
It looks like this is a strategy for parsing ambiguous HTML specific to Python's built-in HTML
28. helmik.
Chris Papademetrious
20. tammik.
some initial findings with the 4.13 branch
Hi Leonard, I tried the 4.13 branch out on one of our content processing pipelines at my day job. It
lukematon,
some initial findings with the 4.13 branch
Hi Leonard, I tried the 4.13 branch out on one of our content processing pipelines at my day job. It
20. tammik.
leonardr
,
Tara Matheney
3
18. tammik.
Beautiful Soup 4.12.3
Thanks for letting me know; I've fixed the problem. Leonard On Thursday, January 18, 2024 at 8:47
lukematon,
Beautiful Soup 4.12.3
Thanks for letting me know; I've fixed the problem. Leonard On Thursday, January 18, 2024 at 8:47
18. tammik.
محمدمهدی خدادوست
,
Carlos
2
15. tammik.
upload a photo and copy link
Hello, Marco. That page uses a JavaScript function to read the content of the selected QR images.
lukematon,
upload a photo and copy link
Hello, Marco. That page uses a JavaScript function to read the content of the selected QR images.
15. tammik.
Delong Wang
,
Carlos
2
15. tammik.
Update Chinese doc from 4.4 to 4.12
You can easily incorporate your changes to the official repo by cloning it on your Launchpad account,
lukematon,
Update Chinese doc from 4.4 to 4.12
You can easily incorporate your changes to the official repo by cloning it on your Launchpad account,
15. tammik.
Chris Papademetrious
10. tammik.
converting flat HTML to hierarchical HTML based on heading levels
Hi everyone, In various forums (Stack Overflow, etc.), I've seen many people ask how to convert
lukematon,
converting flat HTML to hierarchical HTML based on heading levels
Hi everyone, In various forums (Stack Overflow, etc.), I've seen many people ask how to convert
10. tammik.
Srikanta Raju
8. tammik.
IXBRL generator
hi, I am trying to a build ixbrl generator through word addin. I am able to add tags to word document
lukematon,
IXBRL generator
hi, I am trying to a build ixbrl generator through word addin. I am able to add tags to word document
8. tammik.
Chris Papademetrious
4
29.12.2023
skipping whitespace string tags when navigating the tree?
My understanding of PageElement was incorrect (I thought it was the parent type of Tag,
lukematon,
skipping whitespace string tags when navigating the tree?
My understanding of PageElement was incorrect (I thought it was the parent type of Tag,
29.12.2023