Hi,
I have to load non-readable PDFs which are mainly invoices. They are mostly scans of excel generated data and are in tabular format. I am able to read the data within these tables however in some cases the position or column of a particular value in the table is important to me (so as to determine what attributes I need to set in my code).
Some of the scans are pretty complex (with certain columns blank so I need to assume a 0 or blank value) but after the OCR is done these minor yet significant details are missed out. Due to confidentiality reasons I cannot share the complete images but can share some part of them Any help with this effort is appreciated.
Thanks,
Javed