PDFs still broken?

59 views
Skip to first unread message

Simon Eigeldinger

unread,
Oct 8, 2014, 1:20:45 PM10/8/14
to tesser...@googlegroups.com
Hi all,

I just recompiled the newest tesseract rom git.

seems pdf output is still broken:

$ tesseract eurotext.tif eurotext -l eng+deu+fra+spa pdf

Tesseract Open Source OCR Engine v3.04.00 with Leptonica
Page 1
Error in fopenWriteStream: stream not opened
Error in pixWrite: stream not opened
Error in fopenReadStream: file not found
Error in extractG4DataFromFile: stream not opened to file
Error in l_generateG4Data: datacomp not extracted
Error in pixGenerateCIData: g4 data not made
Error in l_generateCIDataForPdf: file eurotext.tif format is 4; unreadable
Error in fopenWriteStream: stream not opened
Error in pixWrite: stream not opened
Error in fopenReadStream: file not found
Error in extractG4DataFromFile: stream not opened to file
Error in l_generateG4Data: datacomp not extracted
Error in pixGenerateCIData: g4 data not made
Error in l_generateCIDataForPdf: file eurotext.tif format is 4; unreadable
Error during processing.


tested with the eurotext.tif file from the testing directory on a
windows system.
compiled with cygwin.

https://dl.dropboxusercontent.com/u/1598766/tesseract-error.7z


greetings,
simon



--
Simon Eigeldinger
Follow me on Twitter: http://www.twitter.com/domasofan/
E-Mail: simon.ei...@vol.at
MSN: simon_ei...@hotmail.com
ICQ: 121823966
Jabber: doma...@andrelouis.com

---
Diese E-Mail ist frei von Viren und Malware, denn der avast! Antivirus Schutz ist aktiv.
http://www.avast.com

Reply all
Reply to author
Forward
0 new messages