How recognize footnotes

286 views
Skip to first unread message

Felipe Ghiardo

unread,
May 30, 2017, 10:10:47 AM5/30/17
to tesseract-ocr
Hi all. 
 
Using another ocr engines (abby, for ex.), the process recognize the footnotes and make the link. Also recognize header and footer. The answer is how can i do the same with tesseract, at least with the footnotes. IIts something that one can train? And how do you do it? Thanks for the help (and sorry for my english). 

ShreeDevi Kumar

unread,
May 30, 2017, 10:57:43 AM5/30/17
to tesser...@googlegroups.com
Try the `hocr` output and see if it provides some of what you need.

I don't think tesseract will link to footnotes though it may recognize the text.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Tue, May 30, 2017 at 7:20 PM, Felipe Ghiardo <paip...@gmail.com> wrote:
Hi all. 
 
Using another ocr engines (abby, for ex.), the process recognize the footnotes and make the link. Also recognize header and footer. The answer is how can i do the same with tesseract, at least with the footnotes. IIts something that one can train? And how do you do it? Thanks for the help (and sorry for my english). 

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/dfaec4b7-77a2-4f01-be40-cf2fe1809ddd%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

bohdan.mo...@gmail.com

unread,
Nov 26, 2018, 12:19:28 PM11/26/18
to tesseract-ocr
hocr doesn’t help
see also https://groups.google.com/forum/#!searchin/tesseract-ocr/footer%7Csort:date/tesseract-ocr/YY4jMNmSoTM/KAMTzkc5AQAJ

вівторок, 30 травня 2017 р. 17:57:43 UTC+3 користувач shree написав:
Try the `hocr` output and see if it provides some of what you need.

I don't think tesseract will link to footnotes though it may recognize the text.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Tue, May 30, 2017 at 7:20 PM, Felipe Ghiardo <paip...@gmail.com> wrote:
Hi all. 
 
Using another ocr engines (abby, for ex.), the process recognize the footnotes and make the link. Also recognize header and footer. The answer is how can i do the same with tesseract, at least with the footnotes. IIts something that one can train? And how do you do it? Thanks for the help (and sorry for my english). 

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages