dealing with image with text of separate columns

33 views
Skip to first unread message

Jingjing Lin

unread,
Jun 11, 2019, 12:09:28 PM6/11/19
to tesseract-ocr
I'm wondering, what are the parameters to tune to get better result for image with text of several columns, example as attached.

Basically I would like to have separate columns separate, instead of getting different columns sticking together. Like the middle part in the .txt file. 

I used '-c preserve_interword_spaces=1' and '--psm 6' and also whitelist

Thanks for your help.
bloodtest11.jpg
bloodtest11.txt

Jingjing Lin

unread,
Jun 12, 2019, 2:06:47 PM6/12/19
to tesseract-ocr
Further question, will training help for images like this?

在 2019年6月11日星期二 UTC-4下午12:09:28,Jingjing Lin写道:
Reply all
Reply to author
Forward
0 new messages