any chance to get this .tiff converted to text?

114 views
Skip to first unread message

boris

unread,
Oct 27, 2014, 7:47:59 PM10/27/14
to tesser...@googlegroups.com
Hi Commnunity,

I started using Tesseract today because I need it for a small project. I have run tesseract with the attached .tiff file but I get very poor results. Am I doing something wrong or is the quality/contrast of the .tiff just too bad?

The example file was taken from a .bmp screenshot and then converted without compression to .tiff

I would be very greatful for any answers. Please consider that I am novice to Tesseract when answering.

Best Regards,

   Boris
example.tiff

ShreeDevi Kumar

unread,
Oct 28, 2014, 4:19:07 AM10/28/14
to tesser...@googlegroups.com

but, just OCRing the image without any changes in VietOCR (GUI frontend for tesseract) with German traineddata gives perfect result - see image.

What version are you using, on what platform, ??

I would suggest that you try Vietocr - download the german traineddata and give a try .. 


Inline image 1

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ef80a595-4f6d-49f8-aa6f-4cae85122c6f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

boris

unread,
Oct 29, 2014, 11:38:40 AM10/29/14
to tesser...@googlegroups.com
Hi Shree,

many thanks for your help. I have installed the german trained data and VietOCR but I still get very poor results.

"Bme a\|fpn|fen mn" is what I get

I must be doing something wrong. 


My platform is Windows 8.1 


Best Regards,

   Boris
OCRresult.gif

ShreeDevi Kumar

unread,
Oct 29, 2014, 1:11:42 PM10/29/14
to tesser...@googlegroups.com
Please choose german in the dropdown for language on right hand side.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.

boris

unread,
Oct 29, 2014, 2:10:49 PM10/29/14
to tesser...@googlegroups.com
Hi Shree,

I have changed language to German but it won´t realy improve.

Anyhow, I am thinking of programming my own OCR for my project as I need 100% accuracy. It won´t be a big deal because the font is always the same.

Regards,

   Boris

ShreeDevi Kumar

unread,
Oct 30, 2014, 9:50:39 AM10/30/14
to tesser...@googlegroups.com
for pre-processing steps for your images to improve recognition regardless of the OCR you use.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.

Quan Nguyen

unread,
Oct 30, 2014, 7:02:38 PM10/30/14
to tesser...@googlegroups.com
Hi Boris,

Be sure to select Screenshot Mode. The image has too low resolution.

Quan

boris

unread,
Oct 31, 2014, 2:05:20 AM10/31/14
to tesser...@googlegroups.com
Hi Quan,

thanks for your help.

Sorry, this might be a silly question, but as told I am novice with Tesseract ;-)
How do I select the Screenshot mode?

Regards,

  Boris


ShreeDevi Kumar

unread,
Oct 31, 2014, 3:22:01 AM10/31/14
to tesser...@googlegroups.com
In VietOCR's image menu, check 'screenshot mode'

Use the filters submenu to experiment with other settings to improve your image.

Look under properties for the dpi, convert your input images to 300dpi as they are currently low res (72dpi or so).

experiment :-)

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.

Boris Riter

unread,
Oct 31, 2014, 3:42:43 AM10/31/14
to tesser...@googlegroups.com

OMG it works!!!!

 

Thank you so much!

 

I have one last question. I would need the command line sysntax for this

 

TGIF,

 

   Boris

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/gH1jp_9Lpxs/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.


To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.

Quan Nguyen

unread,
Oct 31, 2014, 8:17:40 PM10/31/14
to tesser...@googlegroups.com
The command syntax are:

java -jar vietocr.jar -?

vietocr -?

The command-line mode does not support rescaling of images though. Use ImageMagick's convert command to rescale or resize.

http://www.imagemagick.org/Usage/resize/
Reply all
Reply to author
Forward
0 new messages