Low Accurate ini bold font

114 views
Skip to first unread message

afrizal firdaus

unread,
Mar 27, 2017, 3:48:20 AM3/27/17
to tesseract-ocr
Hello guys. I am trying to ocr the picture that has text using meme's font. But i always get bad result for this font.


 result : PlEflSE


result : TIEllMIE MUHE

Can anyone help me please?

Thank you

Auto Generated Inline Image 1
Auto Generated Inline Image 2

ShreeDevi Kumar

unread,
Mar 27, 2017, 5:17:57 AM3/27/17
to tesser...@googlegroups.com
Try latest version of tesseract - build from master. Use --psm 7 --oem 1

I get correct result for both.

tesseract unnamed1.png unnamed1 --psm 7 --oem 1

Tesseract Open Source OCR Engine v4.00.00alpha-347-g60c8b12 with Leptonica
Warning. Invalid resolution 0 dpi. Using 70 instead.


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com



--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/1f9c0d7a-b1aa-4fbf-85d9-93f227379791%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

unnamed1.txt
unnamed.txt
unnamed1.png
unnamed.png

afrizal firdaus

unread,
Mar 31, 2017, 12:04:54 AM3/31/17
to tesseract-ocr
Thank you Shree.

I just try it and that solve my problem :)

But i have something weird in my tesseract.
While i type 'tessearct -v' i am getting following output

tesseract 362b68e
 leptonica-1.74.1
  libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8

 Found AVX
 Found SSE

Why the version is "tesseract 362b68e"? is it normal?

ShreeDevi Kumar

unread,
Mar 31, 2017, 2:48:50 AM3/31/17
to tesser...@googlegroups.com
Did you build it with debug option?

That number refers to the git revision of the code, so it is easy to know what version of source commit it refers to.

Look in github for commit that begins with that number.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
Reply all
Reply to author
Forward
0 new messages