Improve Text reading on image

100 views
Skip to first unread message

Nicholas Chew

unread,
May 11, 2015, 5:38:48 AM5/11/15
to tesser...@googlegroups.com
Hi
I need help to process this image to read the text. I had tried the command below but Tesseract still cant read it. What else did i miss? 

I need to use this cmd to run my program. 

convert DSCN6431.jpg -density 288 -resize 2000x1500 -extent 2000x1500 -morphology Smooth  Octagon:3 -level 70x100% -fill white -fuzz 10% +opaque "#000000" -blur 5 -threshold 70% text.jpg


DSCN6431.jpg
text.jpg

Allistair

unread,
May 11, 2015, 5:55:56 AM5/11/15
to tesser...@googlegroups.com
OK - so what OCR have you tried so far?

I got (psm 6) ...

I-" ” " ’  I
I“ 1""?
1' '.
_ % %
92907120 W% A%'% %


--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ae75f3aa-c02f-46f3-a985-2fac0f59abf4%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Nicholas Chew

unread,
May 11, 2015, 9:06:59 AM5/11/15
to tesser...@googlegroups.com
Hi Allistair 
Thanks for your reply. I had used Tesseract only. I need to process many images and i need the numbers on the images to be captured.
I am using Imagemagick to process my image before running Tess. 

Also i am using -psm 4 where i set the tesseract to read the bottom part. 

Allistair

unread,
May 11, 2015, 9:14:53 AM5/11/15
to tesser...@googlegroups.com
I guess I am not seeing the problem. Your pipeline is

raw image -> image preprocess -> tesseract psm 6 -> postprocess to find likely numeric string

Cheers

Reply all
Reply to author
Forward
0 new messages