DOCTYPE not parsing correctly

4 views
Skip to first unread message

ristretto.rb

unread,
Feb 8, 2009, 9:11:42 PM2/8/09
to beautifulsoup
Hello,

I'm getting the problem discussed here
http://groups.google.com/group/beautifulsoup/browse_thread/thread/69093cb0d3a3cf63

That bug states that "...if the DOCTYPE declaration doesn't parse
correctly,
BeautifulSoup puts the entire document into a single NavigableString."

Also, in that thread, is a fix in the BS source. I'm just not sure
how to apply the fix. The author provided a unit test, and I would
have thought the fix might have made it in to the latest BS release.
I just had a look at BS 3.1.0.1 and it doesn't look fixed. Perhaps it
is fixed somewhere else in the source, not sure. I don't really want
to upgrade if I don't have too.

And, I don't want to alter my copy of the BS code (fork it) and forget
about it when I upgrade someday. Does someone have a good idea how to
handle this?

thanks
gene

ristretto.rb

unread,
Feb 9, 2009, 12:10:47 AM2/9/09
to beautifulsoup
I'll just impl that unit test in my suite and fix in the BS source in
my tree.
Sorry for the interruption.

cheers
gene


On Feb 9, 3:11 pm, "ristretto.rb" <ristretto...@gmail.com> wrote:
> Hello,
>
> I'm getting the problem discussed here
>  http://groups.google.com/group/beautifulsoup/browse_thread/thread/690...
Reply all
Reply to author
Forward
0 new messages