Urdu language left to right output and no space recognize

28 views
Skip to first unread message

moen....@gmail.com

unread,
Aug 10, 2018, 8:45:47 AM8/10/18
to tesseract-ocr

i have trained my own model for urdu language using jtessboxeditor to create tiff/box file and then used Serak tesseract trainer for creating trainedata file, my model is recognizing urdu language but there are 2 issues mainly other than accuracy(accuracy will be test after solving following 2 issues).

  1. model is not recognizing the spaces b/w the words.
  2. model is showing the text in LTR form (Urdu is RTL language, similar to arabic)

thanx in advance.

baman.ms...@seecs.edu.pk

unread,
Sep 25, 2018, 6:42:05 AM9/25/18
to tesseract-ocr
Can you help me in getting urdu OCR output? i have been trying to do but it is giving false results. 
Reply all
Reply to author
Forward
0 new messages