--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ae0aa097-93ba-4424-baf5-b4ed93ca574a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You could try doing your own layout analysis instead of relying o tesseract's auto mode?Have you tried gimagereader and vietocr as gui interface for tesseract for Nepali?
ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
On Wed, Aug 23, 2017 at 10:03 AM, Nirajan Pant <nira...@gmail.com> wrote:
I am working on GUI for tesseract OCR 4.0.0 (Nepali Language). When I started analysis of the recognition results I found some missing words or sentences. To find the reason behind this I just draw the boxes detected by tesseract (using hocr) recognition result. The detection was shown here-This is a part of document with paragraph detection error. Red line is the boundary of detected paragraph (second column of original image given below).The original image is:Help me to deal with this issue.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/8e726246-a186-47f7-9850-f49441e75191%40googlegroups.com.
I am working on GUI for tesseract OCR 4.0.0 (Nepali Language). When I started analysis of the recognition results I found some missing words or sentences. To find the reason behind this I just draw the boxes detected by tesseract (using hocr) recognition result. The detection was shown here-