Groups
Groups
Sign in
Groups
Groups
beautifulsoup
Conversations
About
Send feedback
Help
Re: Crash on bad doctype declaration
29 views
Skip to first unread message
Leonard Richardson
unread,
Jul 13, 2012, 1:20:22 PM
7/13/12
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to beauti...@googlegroups.com
> Has anyone seen this? Do you have a workaround?
>
> It looks like this is a bug in libxml2 though.
It is a bug in libxml2. I filed the bug back in March and the lxml
developer committed a fix to the development branch.
https://bugs.launchpad.net/lxml/+bug/984936
Apart from upgrading lxml, the best workaround would be to parse the
document with html5lib or html.parser.
http://www.crummy.com/software/BeautifulSoup/bs4/doc/#installing-a-parser
Leonard
Reply all
Reply to author
Forward
0 new messages