Tolerance when recognizing spaces

238 views
Skip to first unread message

Lemvigh

unread,
Oct 28, 2008, 8:45:16 AM10/28/08
to tesseract-ocr
Is there a parameter controlling the with of a space character? Our
biggest problem regarding the OCR is concatenated words. There is a
lot of spaces that Tesseract doesn't find. Is there a way the
recognition of spaces can be tuned or a variable controlling the
minimum with of a space character?

Regards,
Lemvigh

Ray Smith

unread,
Oct 29, 2008, 12:26:08 AM10/29/08
to tesser...@googlegroups.com
Yes, but I don't know what will affect it the way you want in your particular circumstances.
I suggest looking at textord/tospace.cpp, where you will find lots of control parameters near the top. You could try adjusting some of these...
Ray.
Reply all
Reply to author
Forward
0 new messages