Does any parameter to control ocr region?

96 views
Skip to first unread message

Duck

unread,
Jun 6, 2017, 12:20:12 PM6/6/17
to tesseract-ocr
I need some help.

The following pic is my problem, it was always recognized as "I".

I trace for a while, find out that OCR engine segement again, it takes out the mid area of the "D".

but I tried a lot of parameter, can't disable the segement process.

Does anyone have any idea?  Or the only way is adjusting image?

And if in any condition that each result in same tesseract class instance in different times would not the same?

the same pic in my program, I click "start" twice and get the differnet result.

but if I new tesseract class every time, it would not happen...

please!

(the enginemode is tesseractonly, pagesegmode is sinlechar, I use Tesseract 3.04 in EmguCV)
PadResult=I.bmp

ShreeDevi Kumar

unread,
Jun 6, 2017, 1:47:30 PM6/6/17
to tesser...@googlegroups.com

I converted the bmp to png and tried with command line tesseract 4 and get correct result.

$ tesseract I.png stdout --oem 1 --psm 6
D


$ tesseract I.png stdout --oem 0 --psm 6
D

original .bmp also works.

$ tesseract I.bmp stdout --oem 0 --psm 6
D

Warning. Invalid resolution 0 dpi. Using 70 instead.


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/87915489-7649-4da2-9408-1a23b5543194%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Duck

unread,
Jun 7, 2017, 1:46:33 AM6/7/17
to tesseract-ocr
Because of my company's project, I can't change version by myself.

But I changed pagesegmode to singleblock and it was fixed....

Why? I thought singlechar is better setting?

Reply all
Reply to author
Forward
0 new messages