On Dec 12, 12:58 pm, "ลฝeljko Filipin" <
zeljko.fili...@gmail.com>
wrote:
> Hi Christian,
>
> Comments are inline.
>
> On Dec 12, 2007 12:45 PM, Christian Fraenkel<
semiha...@gmail.com> <
>
>
semiha...@gmail.com> wrote:
> > C:/ruby/lib/ruby/1.8/rexml/parsers/treeparser.rb:90:in `parse':
> > Last 80 unconsumed characters:
> > <a title="Beim nรยคchsten Start รยถffnen"
>
> I had problems with parsing xml when there was non-English characters. For
> more information seehttp://
zeljkofilipin.com/2006/03/15/utf-8-and-ruby/
I've encountered the problem you specified yesterday and already found
the fix you posted for that issue. (read: my code currently contains
those lines and still throws the error)
> > how are bug reports handled here, should I have tried to post this
> > straight on the jiira tracker or .. .?
>
> Since this looks like REXML bug, there is no point in creating Watir ticket,
> right?
its not a rexml bug - rexml has every right to throw an error here as
watir is not feeding it correct xml.
watir is generating xml that contains ';' and '?' as part of an
attribute name in my case. This is -incorrect- xml and has to be fixed
on the watir side.
watir uses /^(\w|_|:)(.*)$/ to accept attribute names - this is too
"open"; it needs to be much more restrictive, perhaps in the lines of /
^([a-zA-Z_:])([\w.-_:]*)$/ .This is not perfect - it probably excludes
lots and lots of unicode characters as part of attribute names - but
it is correct insofar as it does not allow any forbidden characters
(and as such, rexml should never throw an error about that anymore)