I'm trying tesseract for the first time with a png of a multipage document I saved out of a pdf (which itself was just an image).When I run tesseract, I get an output of the first page, but that's all. I notice that there's a control-L (^L) at the end of the text file.How do I get the entire file output to txt?
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/4067da33-b1d1-4bbe-9909-9b5552c49549%40googlegroups.com.
--
tesseract Downloads/foundations-of-mathematics.tiff foundations-of-mathematics
tesseract Downloads/foundations-of-mathematics.tiff foundations-of-mathematics
Provide exact information what you did.Make sure you use the latest tesseract and leptonica.Zdenko
pi 9. 8. 2019 o 7:41 ilevy <textr...@gmail.com> napísal(a):
I'm trying tesseract for the first time with a png of a multipage document I saved out of a pdf (which itself was just an image).--When I run tesseract, I get an output of the first page, but that's all. I notice that there's a control-L (^L) at the end of the text file.How do I get the entire file output to txt?
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesser...@googlegroups.com.
Tesseract Open Source OCR Engine v4.1.0 with Leptonica
Page 1
Page 2
Page 3
Page 4
Page 5
Page 6
Page 7
Page 8
Page 9
Page 10
Page 11
Page 12
Page 13
Detected 14 diacritics
Page 14
Try creating a multipage tiff from your pdf and try.
On Fri, 9 Aug 2019, 11:11 ilevy, <textr...@gmail.com> wrote:
I'm trying tesseract for the first time with a png of a multipage document I saved out of a pdf (which itself was just an image).--When I run tesseract, I get an output of the first page, but that's all. I notice that there's a control-L (^L) at the end of the text file.How do I get the entire file output to txt?
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesser...@googlegroups.com.