EXTRACTING JUST THE DIGITS

91 views
Skip to first unread message

Jisha J

unread,
Sep 6, 2020, 4:14:34 PM9/6/20
to tesseract-ocr
I'm trying to extract just the numbers '1' and '2', it works if I crop out just the digits and feed it through my code, but if I include the heading "WEEK", it doesn't detect the numbers.
I tried all the page segmentation methods (0-10). Could someone please help asap!
THE TEXT IM TRYING TO EXTRACT IS ATTACHED BELOW!
34.png

Ignacio Martín Ochoa

unread,
Sep 30, 2020, 6:39:09 AM9/30/20
to tesseract-ocr
Have you tried whitelisting the characters you're looking for?


tesseract input_file output_file --oem 0 -c tessedit_char_whitelist=1234567890

Reply all
Reply to author
Forward
0 new messages