Hello Guys, I'm making an experiment with pytesseract and tesseocr to read some files receives in my company mailbox.
One problem i`m finding is with symbols. This particular file has some "borders" made with "*"
But the tesseract recognizes it only as a sequence of "r", "k" and"e" , like the one attached he translate as: "KRREKKKKKKK Shipping Instructions KREKKEKKKKKE".
Is there some configuration that I may insert informing that my text may have symbols in it?
Or at least ignore them instead of try to fit them into a character.