issues with pdf

146 views
Skip to first unread message

Simon Eigeldinger

unread,
Oct 6, 2014, 2:03:50 PM10/6/14
to tesser...@googlegroups.com
hi all,

just tried the following with the provided eurotext.tif in the testing
dir of the source package.
used current git from this afternoon european time:

i get this:

$ tesseract eurotext.tif eurotext -l eng pdf

Tesseract Open Source OCR Engine v3.04.00 with Leptonica
Page 1
Error in fopenWriteStream: stream not opened
Error in pixWrite: stream not opened
Error in fopenReadStream: file not found
Error in extractG4DataFromFile: stream not opened to file
Error in l_generateG4Data: datacomp not extracted
Error in pixGenerateCIData: g4 data not made
Error in l_generateCIDataForPdf: file eurotext.tif format is 4; unreadable
Error during processing.


the text file is fine but the pdf is 4 kb and adobe reader doesn't like
the file either.


here are the files:
https://dl.dropboxusercontent.com/u/1598766/tesseract-error.7z


language data is from the tesseract git repository as well.

greetings,
simon

---
Diese E-Mail ist frei von Viren und Malware, denn der avast! Antivirus Schutz ist aktiv.
http://www.avast.com

Reply all
Reply to author
Forward
0 new messages