If I pass a binary (black-white) image into tesseract-ocr, will tesseract process the image on its own way?
It seems tesseract ocr uses Otsu global thresholding method (not looking into the code carefull yet, but saw the key word “otsu” in the source code)
If so, the Otsu method will not change the binary (black-white) image.
So in my understanding, tesseract-ocr will not change the input binary image.
Just ask to make sure...
Thanks.
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
Patrick,I ran into your answer and I think you might be able to help me given that your scanbizcards app is doing similar processing to what I need (don't worry, no competition :) ).I'm using tesseract-ocr on android, taking pictures of portions of receipts in grayscale mode (EFFECT_MONO, if available on device), and converting the bitmap to ARGB_888 for tesseract processing. I'm getting pretty good results, but am trying to improve accuracy.Do you know whether you do any other preprocessing of the bitmap before passing to tesseract, like perhaps converting to black/white (you mentioned in your above answer you're passing "black & white image"). I tried out your app and it seems to give better results than mine on similar types of images. Very nice app, BTW.Thanks,Tin
Thanks for your response. In the meantime, I read up on many of your answers in this group which will be very valuable to me. Thank you for sharing your expertise.I have in my incoming bitmap a single line of text (mostly numbers), so I don't need to worry about most of the details you do. I believe the variances in accuracy occur due to different lighting/brightness conditions under which the picture is taken so sometimes the incoming grayscale image is overall darker, etc. I've played around with contrast settings of the camera, and that helps as well, but am still trying to improve. Preprocessing to black and white was just one idea I had in mind. Needless to say, I'm fairly new to tesseract, so read up on possibilities of setting configuration parameters/variables which may help me.If you guys have nothing better to do, the "Show Processed Image" option would be nice to have in Android as well. :)Thanks again,Tin