Enhance an image to perform OCR recognition
I need to make a Java application that must extract relevant information from an image. the problem is that when i use available libraries like Tesserarct, AspriseOcr ... the results are not accurate.
I tell myself that it may be due to the following factor:
1 - The image contains different fonts
2 - some are in bold and some are not
3 - The font used to print the numbers is really not very difficult to recognize by the APIs that I use (I guess). This
is a real problem because the goal is to retrieve information such as
the invoice amount, the fee, the expiration date, the subscriber number,
the subscriber name, the customer, the meter number , the period of consumption ...
4
- the last problem is that the system used for the printing of the
character does not do it at the same places for two different bills.
My
real question is if I can use Marvin with Java to craft the image
before OCR recognition or how can I otherwise achieve my results?
see attachments for more details.