How to find out what BS chokes on?

14 views
Skip to first unread message

LaundroMat

unread,
Feb 8, 2009, 1:33:05 PM2/8/09
to beautifulsoup
Hi, I'm trying to parse this page: http://www.1up.com/do/reviewPage?cId=3172670
and BS returns with HTMLParseError: malformed start tag, at line 990,
column 12 .

How can I find what element/tag in the HTML is causing this error?

Thanks in advance,

Mathieu

Jim Tittsler

unread,
Feb 8, 2009, 9:21:33 PM2/8/09
to beautifulsoup
2009/2/9 LaundroMat <Lau...@gmail.com>:

> Hi, I'm trying to parse this page: http://www.1up.com/do/reviewPage?cId=3172670
> and BS returns with HTMLParseError: malformed start tag, at line 990,
> column 12 .
>
> How can I find what element/tag in the HTML is causing this error?

The error message you quoted is telling you. Line 990, column 12
points to a broken width attribute of a td tag.

LaundroMat

unread,
Feb 9, 2009, 7:13:50 AM2/9/09
to beautifulsoup
On Feb 9, 3:21 am, Jim Tittsler <jtitts...@gmail.com> wrote:
> 2009/2/9 LaundroMat <Laun...@gmail.com>:
'Broken' in the sense that is a deprecated attribute? Or am I too
blind to see another issue with this tag?

Jim Tittsler

unread,
Feb 9, 2009, 1:56:52 PM2/9/09
to beauti...@googlegroups.com

Alec Schueler

unread,
Feb 9, 2009, 1:39:59 PM2/9/09
to beautifulsoup
There's a missing equals sign.

It reads '<td width"100%"...' where it should be '<td width="100%"...'
Reply all
Reply to author
Forward
0 new messages