textline finding fail

26 views
Skip to first unread message

Pndaza

unread,
Apr 13, 2020, 11:37:40 PM4/13/20
to tesseract-ocr
tesseract fail textlind finding for myanmar script

Firstly i think psm error and  i post this post in forum.

Bur this is not psm error. It is textline finding error like arabic script. (ssues/657)

tesseract seperate upper vowel from baseline and marks as seperate line.


textline_fail.jpg
textline_fial_result.txt

Pndaza

unread,
Apr 14, 2020, 12:12:26 AM4/14/20
to tesseract-ocr
Textline finding fails when base constants and their upper vowel or asat are seperate.
When base constants and their upper vowel or asat are join, it ok
textline.png

Shree Devi Kumar

unread,
Apr 14, 2020, 9:32:28 PM4/14/20
to tesseract-ocr
I have also noticed the same for Javanese and Balinese scriptts.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c427cdd6-1355-46b6-a462-e44f0bc7b4e9%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages