hi all,
just tried the following with the provided eurotext.tif in the testing
dir of the source package.
used current git from this afternoon european time:
i get this:
$ tesseract eurotext.tif eurotext -l eng pdf
Tesseract Open Source OCR Engine v3.04.00 with Leptonica
Page 1
Error in fopenWriteStream: stream not opened
Error in pixWrite: stream not opened
Error in fopenReadStream: file not found
Error in extractG4DataFromFile: stream not opened to file
Error in l_generateG4Data: datacomp not extracted
Error in pixGenerateCIData: g4 data not made
Error in l_generateCIDataForPdf: file eurotext.tif format is 4; unreadable
Error during processing.
the text file is fine but the pdf is 4 kb and adobe reader doesn't like
the file either.
here are the files:
https://dl.dropboxusercontent.com/u/1598766/tesseract-error.7z
language data is from the tesseract git repository as well.
greetings,
simon
---
Diese E-Mail ist frei von Viren und Malware, denn der avast! Antivirus Schutz ist aktiv.
http://www.avast.com