Terrible accuracy even though "tessinput.tiff" looks fine.

73 views
Skip to first unread message

Mike Russell

unread,
Aug 11, 2019, 9:59:16 AM8/11/19
to tesseract-ocr
Hi

I installed tesseract v4.0.0 on macOS. I am trying to use it to recognize vehicle VIN code images (image is already cropped, single line 17 characters).

I have what I think is a clear image but the accuracy is terrible. I have set "tessedit_write_images" and checked the "tessinput.tiff" (attached) and it looks fine. 

For the attached image the result I get in the output file is:

"WDAPFICDOKFOES | 2"

Can anyone help me to understand what I am doing wrong?

Thanks

Michael
b34ca02dfe14ba1b2bb3538d60d6fc04x.jpg
tessinput.tif

Shree Devi Kumar

unread,
Aug 11, 2019, 10:21:12 AM8/11/19
to tesseract-ocr
Most tesseract 4.0 models have been trained on line images with 48 pixels height. Resize your image to 48 pixels height, 300 dpi and try .

my test results

ubuntu@tesseract-ocr:~/TEST$ tesseract VIN.png - --tessdata-dir ~/tessdata_fast -c tessedit_write_images=1
WD4PF 1ICDOKP075122
ubuntu@tesseract-ocr:~/TEST$ tesseract tessinput.tif - --tessdata-dir ~/tessdata_fast
Page 1
WD4PFICDOKP075122

Edited Files are attached.


--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ffdb60d8-c982-4bf3-bc8e-331d50aa0cc9%40googlegroups.com.


--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
VIN.png
tessinput.tif
Reply all
Reply to author
Forward
0 new messages