need help: tesseract 4 faild to detect <<<<<<< symbols

60 views
Skip to first unread message

SACHIN CHAVAN

unread,
Oct 10, 2018, 7:05:54 AM10/10/18
to tesseract-ocr
I need help, I'm using tesseract 4, and it converting <<<<<<<<<< symbols into "k" "c" or sometimes "L", the result is like <<<kkkLLLccc<<<<<

Soumik Ranjan Dasgupta

unread,
Oct 11, 2018, 12:10:56 PM10/11/18
to tesser...@googlegroups.com
I would suggest changing the eng.training_txt and fine-tuning the eng.traineddata file.

On Wed, Oct 10, 2018 at 4:35 PM SACHIN CHAVAN <sach...@gmail.com> wrote:
I need help, I'm using tesseract 4, and it converting <<<<<<<<<< symbols into "k" "c" or sometimes "L", the result is like <<<kkkLLLccc<<<<<

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ae3c7a5e-7d1f-46e2-a4ba-f281d9ba0b01%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


--
Regards,
Soumik Ranjan Dasgupta

sachin chavan

unread,
Oct 12, 2018, 3:12:26 AM10/12/18
to tesser...@googlegroups.com
https://stackoverflow.com/users/4766168/dmitrii-z
Help me with that "I would recommend to remove that image, because it likely contains sensitive information. You can use an Utopia mrz image as an example or hide passport number from your image. Also I would recomment you to use custom font traineddata (OCR-B), you can find one for older version of tesseract somewhere in the internet, or train one for tesseract 4 by yourself."

and it works 

Soumik Ranjan Dasgupta

unread,
Oct 12, 2018, 3:18:58 AM10/12/18
to tesser...@googlegroups.com
Please be a bit more clear with your statements. Have you solved the problem or do you still  need suggestions?
Also, the link you provided leads to the account of a stackoverflow user, not sure what to do with that.


For more options, visit https://groups.google.com/d/optout.

sachin chavan

unread,
Oct 12, 2018, 6:54:25 AM10/12/18
to tesser...@googlegroups.com
For now, it is working, is used custom font traineddata (OCR-B), in tesseract4 

Reply all
Reply to author
Forward
0 new messages