Can't encode transcription error with Arabic diacritics Fine Tuning process

30 views
Skip to first unread message

Fahad Al-Saidi

unread,
Dec 9, 2017, 2:20:07 AM12/9/17
to tesseract-ocr
Hi,

I am trying to fine tuning process for arabic diacritics. I just add one char shaddad ّ  to arbitrary word. I got first this error:

Normalization failed for string 'َّس'

then
Can't encode transcription: 'روص :ليجستلا هذه ،ةطساوب مالَّسلا ميوقتلا ال« ىلوألا' in language ''

In the wiki page, I read
Encoding of string failed! results when the text string for a training image cannot be encoded using the given unicharset.

but How I can fix it?


Thanks in advance,
Fahad
Reply all
Reply to author
Forward
0 new messages