Dear Friends,
Kindly find the attached pdf file "TextOnlyPDF_NoData.pdf".
This pdf file is created using the Tesseract OCR v5.0.0. using the below command:
Command: tesseract Invoice.tiff TextOnlyPDF_NoData -l eng -c textonly_pdf=1 pdf
But, this pdf does not contain any data. It is empty.
Kindly let us know is there any bug/issue present in Tesseract OCR v5.0.0.0 latest source which generates above output pdf file with textonly_pdf=1.
NOTE:
For your reference, we are attaching a text only pdf file "Invoice--Adobe-PDF-(ABBYY-OCR).pdf" generated by ABBYY OCR.
We are trying to generate similar text only pdf file using Tesseract OCR v5.0.0.
Kindly help us to fix the above textonly pdf issue from Tesseract OCR v5.0.0. side.
Thank you very much in advance.
Regards,
Subramanyam