trying to find a way to use with config options

53 views
Skip to first unread message

장기석

unread,
Jun 15, 2020, 6:51:14 AM6/15/20
to tesseract-ocr
-Environment
TESSERACT5.0ALPHA
PYTHON3.8
WINDOWS10
I need to recognize CAPITAL ENGLISH,KOREAN,JAPAN,CHINESSE,THAILAND,RUSSIAN words, but mostly ENGLISH words

bcuz of serveral recognition issues(0>O,1>I,I>l ...) Im using pytesseract.image_to_boxes instead of image_to_string
but I found out I need to keep using config options(like WHITELIST BLACKLIST) that I used in image_to_string
after tesseract has a problem to recognize capital i as lower case of L
any way to do it?
need some helps, thank you
Reply all
Reply to author
Forward
0 new messages