missing a line in OCR persian

98 views
Skip to first unread message

reza

unread,
May 22, 2018, 1:15:05 AM5/22/18
to tesseract-ocr
i used tesseract 4 beta for OCR. but the results had some missing words (line 2 have missed).
i attached the PNG and results.

 

می‌شود آسانتر است از زبانهایی مثل فارسی و عربی که حروف یک کلمه به یکدیگر می‌چسبند. این موضوع به
باشند. البته در سالهای اخیر تلاش‌های قابل تقدیری از سوی برخی شرکتهای فعال در زمینه پردازش تصویر انجام

شده که برخی از آنها منجر به محصولات قابل قبولی شده‌است

0002.png

ShreeDevi Kumar

unread,
May 22, 2018, 1:57:19 AM5/22/18
to tesser...@googlegroups.com
Entire lines of text missing. Different missing when psm = 3, 6, 11

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/eb8581d9-d277-462a-bf4b-a9a4146e211a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

reza

unread,
May 22, 2018, 2:06:47 AM5/22/18
to tesseract-ocr
thanks for your replu shree

there isn't any solution for this missing yet ?


On Tuesday, May 22, 2018 at 10:27:19 AM UTC+4:30, shree wrote:
Entire lines of text missing. Different missing when psm = 3, 6, 11

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

2018-05-22 10:45 GMT+05:30 reza <reza...@gmail.com>:
i used tesseract 4 beta for OCR. but the results had some missing words (line 2 have missed).
i attached the PNG and results.

 

می‌شود آسانتر است از زبانهایی مثل فارسی و عربی که حروف یک کلمه به یکدیگر می‌چسبند. این موضوع به
باشند. البته در سالهای اخیر تلاش‌های قابل تقدیری از سوی برخی شرکتهای فعال در زمینه پردازش تصویر انجام

شده که برخی از آنها منجر به محصولات قابل قبولی شده‌است

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages