I found an encoding error in the trial data. The documentation
explicitly says all data files will be delivered in UTF-8 format, but at
least bank.data is iso-8859-1(5). It might help if the XML headers of
the data files could explicitly mention their encoding, to prevent any
confusion.
Regards,
--
Maarten van Gompel (Proycon), ILK, Universiteit Tilburg
pro...@anaproy.nl
pro...@unilang.org
--------------------------------------------------------------------------
Personal Homepage: http://proycon.anaproy.nl
My Language Technology Site: http://proylt.anaproy.nl
UniLang Language Community: http://www.unilang.org
--------------------------------------------------------------------------
JABBER: maar...@luon.net, AIM: proycon, YAHOO: proycon
MSN: pro...@anaproy.nl
--------------------------------------------------------------------------