No searchable text in pdf output

52 views
Skip to first unread message

Grady Congdon

unread,
Jul 24, 2014, 3:51:21 PM7/24/14
to tesser...@googlegroups.com
I'm attempting to get a searchable pdf as output from tesseract, the pdf comes out, but there doesn't appear to be any searchable text.  If I run it using the same .tif for plain text output it makes the txt file with the correct output. I'm stumped. 

my command:  tesseract in.tif out pdf
the only line output from the above command: Tesseract Open Source OCR Engine v3.03 with Leptonica

tesseract -v output:

tesseract 3.03
 leptonica-1.71
  libjpeg 6b : libpng 1.2.49 : libtiff 3.9.4 : zlib 1.2.3


zdenko podobny

unread,
Jul 24, 2014, 4:20:26 PM7/24/14
to tesser...@googlegroups.com
Can you provide more information about your system? 
How did you check if there is searchable text in out.pdf?
Did you compiled tesseract by yourself (which revision) or what is source for tesseract executable

Zdenko


--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/87412d4f-0e6a-4dad-a3e8-8212b9ced463%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages