--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c5b4f9c7-67e5-41d8-8c24-b4e5e4c39ed3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xGPRx0ZriLS%2BH7kyNHEFaAFHweKJc5KhycfLKT87XG8A%40mail.gmail.com.
I have tried this, but this is showing the default behaviour. I think the default output is overlaying on pdf instead of hocr out.
On Mon, Sep 17, 2018 at 5:47 PM Monica <monicak...@gmail.com> wrote:
Thanks Zdenko for you response.will "tesseract scannedFile.png scanned.pdf -l eng hocr pdf" overlay on pdf file ?
On Mon, Sep 17, 2018 at 5:44 PM Zdenko Podobny <zde...@gmail.com> wrote:
Something like this?tesseract scannedFile.png scanned.pdf -l eng hocr pdfZdenko
po 17. 9. 2018 o 14:12 monica kumari <monicak...@gmail.com> napísal(a):
for OCRing a scanned pdf,--first it is converted to image format then OCRed and gives a temperory file of pdf/text format and overlays on original scanned pdf.I want the output format to be hocr. for this, I ran the command"convert scannedFile.pdf scannedFile.png" and then "tesseract scannedFile.png scanned.pdf -l eng hocr"I got the hocr fomat as output.Now I need a help to overlay it on scannned pdf file.Anybody have any idea about it ?
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c5b4f9c7-67e5-41d8-8c24-b4e5e4c39ed3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xGPRx0ZriLS%2BH7kyNHEFaAFHweKJc5KhycfLKT87XG8A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAPgEwRjWnOe%3DXwxbZp_F9ZUFFPVDtDztcTiq%3DRyychterctsVQ%40mail.gmail.com.
I think pdf creation adds a text layer only and there isn't an option to add HOCR to it.@jbreiden can confirm.
On Mon, Sep 17, 2018 at 6:10 PM, Monica <monicak...@gmail.com> wrote:
I have tried this, but this is showing the default behaviour. I think the default output is overlaying on pdf instead of hocr out.
On Mon, Sep 17, 2018 at 5:47 PM Monica <monicak...@gmail.com> wrote:
Thanks Zdenko for you response.will "tesseract scannedFile.png scanned.pdf -l eng hocr pdf" overlay on pdf file ?
On Mon, Sep 17, 2018 at 5:44 PM Zdenko Podobny <zde...@gmail.com> wrote:
Something like this?tesseract scannedFile.png scanned.pdf -l eng hocr pdfZdenko
po 17. 9. 2018 o 14:12 monica kumari <monicak...@gmail.com> napísal(a):
for OCRing a scanned pdf,--first it is converted to image format then OCRed and gives a temperory file of pdf/text format and overlays on original scanned pdf.I want the output format to be hocr. for this, I ran the command"convert scannedFile.pdf scannedFile.png" and then "tesseract scannedFile.png scanned.pdf -l eng hocr"I got the hocr fomat as output.Now I need a help to overlay it on scannned pdf file.Anybody have any idea about it ?
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c5b4f9c7-67e5-41d8-8c24-b4e5e4c39ed3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xGPRx0ZriLS%2BH7kyNHEFaAFHweKJc5KhycfLKT87XG8A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAPgEwRjWnOe%3DXwxbZp_F9ZUFFPVDtDztcTiq%3DRyychterctsVQ%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAHjiUbpuHSzzsC31fN6BqmzVPb6_TJxDmFiwBiTRPEM_wnTY2A%40mail.gmail.com.