Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

HTML::TableExtract w. perl 5.10

25 views
Skip to first unread message

Marko Riedel

unread,
Sep 27, 2012, 7:38:41 PM9/27/12
to

Greetings to all.

the following issue does not occur with perl 5.12, unfortunately I have
to work with 5.10 at my installation and I don't have the administration
rights just to upgrade my perl.

The version is:
"This is perl, v5.10.0 built for x86_64-linux-gnu-thread-multi".

I am trying to use HTML::TableExtract on an ISO-8859-1 encoded file. The
extraction works, the data are precisely what I want, but I always get a
warning, namely that "Parsing of undecoded UTF-8 will give garbage when
decoding entities".

Is there anything other than turning warnings off locally that I can do
to supress this warning? Or does this module not work with
latin1-encoded data? I also tried invoking utf8_mode(0), to no avail.

My version of HTML::Parser is 3.69 and of HTML::TableExtract 2.10.

Best regards,

Marko Riedel

Ben Morrow

unread,
Sep 27, 2012, 11:18:27 PM9/27/12
to

Quoth Marko Riedel <markor...@yahoo.de>:
>
> I am trying to use HTML::TableExtract on an ISO-8859-1 encoded file. The
> extraction works, the data are precisely what I want, but I always get a
> warning, namely that "Parsing of undecoded UTF-8 will give garbage when
> decoding entities".
>
> Is there anything other than turning warnings off locally that I can do
> to supress this warning? Or does this module not work with
> latin1-encoded data? I also tried invoking utf8_mode(0), to no avail.

Please post a *minimal* example of a program which exhibits this
behaviour.

Ben

Marko Riedel

unread,
Sep 28, 2012, 5:03:37 PM9/28/12
to
Greetings.

I will work on that, it's not that easy as the program is complex. In
the meantime, does anyone know how to get HTML::Parser to output the
position and the value of the offending byte sequence? I installed it in
my home directory so I can modify the source if necessary. The warning
is easy to find.

This looks tricky. Like I mentioned the code works fine with Perl 5.12.

Regards,

Marko
0 new messages