Low Quality image but little no Noise

77 views
Skip to first unread message

Shester Msouobu

unread,
Sep 15, 2022, 5:43:21 AM9/15/22
to tesseract-ocr
Hey ! I have a set of lot quality images tesseract can't well read. Though there is literally no noise on there. Any help ? 

Example

images1871.png

Tesseract output 
"Cerra)" 

Zdenko Podobny

unread,
Sep 15, 2022, 5:44:14 AM9/15/22
to tesser...@googlegroups.com
Did you try documentation?

Zdenko


št 15. 9. 2022 o 11:43 Shester Msouobu <msouobu...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/d923d4e2-6eb0-42e9-bc80-76a6698406a9n%40googlegroups.com.

vc Jayan

unread,
Sep 15, 2022, 5:50:28 AM9/15/22
to tesser...@googlegroups.com
Hi
I think its due to the inverse binary issue. Black textbooks in white background is needed to detect and read. 

--

Helmut Wollmersdorfer

unread,
Sep 25, 2022, 3:02:34 AM9/25/22
to tesseract-ocr
With tesseract version 5.0.0 it works fine:

$ tesseract images1871.png images1871 --tessdata-dir /usr/local/share/tessdata hocr txt; cat images1871.txt

Estimating resolution as 106

Genesis 12:4

Seems you need to upgrade your Tesseract version.
Reply all
Reply to author
Forward
0 new messages