UnicodeDecodeError: 'utf8' codec can't decode byte 0xf3

6 views
Skip to first unread message

Amir Pouya Agha Sadeghi

unread,
Sep 12, 2015, 12:56:12 PM9/12/15
to Yalign
Hi,
I recently found your yalign tools from google. It seems very interesting and simple to use so I decided to give it a try.
Unfortunately after installing it and trying the example in the git hub( the Wikipedia pages) it doesn't print anything. And when I try it with WMT es-en test corpus it gives me the fallowing error. I used apt-get to install it on kubuntu 15.04. Any idea? 

  File "/usr/local/bin/yalign-align", line 63, in <module> 
   document_a = read_document(args['<document_a>'], lang_a) 
 File "/usr/local/bin/yalign-align", line 41, in read_document 
   text = codecs.open(filename, encoding="utf-8").read() 
 File "/usr/lib/python2.7/codecs.py", line 674, in read 
   return self.reader.read(size) 
 File "/usr/lib/python2.7/codecs.py", line 480, in read 
   newchars, decodedbytes = self.decode(data, self.errors) 
UnicodeDecodeError: 'utf8' codec can't decode byte 0xf3 in position 3841: invalid continuation byte 
Reply all
Reply to author
Forward
0 new messages