Tesseract 4 training related issue

69 views
Skip to first unread message

pranaya mhatre

unread,
Jun 15, 2018, 3:12:36 AM6/15/18
to tesseract-ocr
Hi,

I trained tesseract 4 many times on images by fine tuning english model, but after training tesseract wont give space between two words. How should i resolve spacing problem ?

And how should i train tesseract for detecting text boxes appropriately for italic fonts ?

Thank you

ShreeDevi Kumar

unread,
Jun 15, 2018, 4:18:29 AM6/15/18
to tesser...@googlegroups.com
Are you using images and box files? Does your box file have boxes for spaces between words?

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/adcfff72-4bb2-4900-9332-300beb8b0c2b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

pranaya mhatre

unread,
Jun 15, 2018, 4:35:36 AM6/15/18
to tesseract-ocr
Yes I am using images and box files. I did both box files with spaces and without spaces.
But when i trained tesseract using box files with space it is generating space in some images not in all test images and it also sometimes print digits between spaces
Reply all
Reply to author
Forward
0 new messages