Recognize dot-matrix text that overlaps form elements

76 views
Skip to first unread message

Paul Gallagher

unread,
Mar 28, 2016, 4:04:18 PM3/28/16
to tesseract-ocr

My printer is old and needs to be adjusted regularly.  Instead of re-printing out all of our tickets to make it easy for me to digitize them, I'd like to be able to have tesseract ignore lines running through desired text.

This is the only text I need in the whole document, and is the only unique/identifying text. Here is an example:





I've tried photoshop to come up with a procedure to basically remove the blue line and leave the numbers, but I can't figure any way that works consistently.

akhil katpally

unread,
Mar 29, 2016, 3:55:53 PM3/29/16
to tesseract-ocr
Hi Paul

Please check the vertical line extraction in this link. http://docs.opencv.org/3.1.0/d1/dee/tutorial_moprh_lines_detection.html#gsc.tab=0

Thanks,
Akhil Katpally
Reply all
Reply to author
Forward
0 new messages