Train 2 language together

39 views
Skip to first unread message

Zohreh Khosrobeygi

unread,
Jul 1, 2018, 12:36:06 PM7/1/18
to tesseract-ocr
Hi,
I have been training the text:

272-135031-0000 BECAUSE YOU WERE SLEEPING INSTEAD OWHILE POOR SHAGGY SITS THERE A COOING DOVE
فیلم و و , منابع سال آگهی آخرين آخرین بود. ساخت و کنی

It means the text contains Persian and English. But when Tiff file has been created, all English text have been removed. The Tiff file contains this:

272-135031-0000
فیلم و و , منابع سال آگهی آخرين آخرین بود. ساخت و کنی

But for Persian we need to train both language together.
How can I solve the problem? How can I train 2 language together?
Thanks a lot.

Shree Devi Kumar

unread,
Jul 1, 2018, 1:32:55 PM7/1/18
to tesser...@googlegroups.com
The font being used does not support English.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0e854ed2-3ca2-48e7-af79-9f4f1924e38b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

Zohreh Khosrobeygi

unread,
Jul 2, 2018, 8:09:38 AM7/2/18
to tesseract-ocr
Thx. you're right.
Reply all
Reply to author
Forward
0 new messages