ocr_line vs. ocrx_line

107 views

Skip to first unread message

Zdenko Podobný

unread,

May 30, 2012, 10:49:03 AM5/30/12

to hOCR

I need clarification of ocr_line vs. ocrx_line

hOCR spec define ocrx_line as:
* any kind of "line" returned by an OCR system that differs from the
standard ocr_line above
* might be some kind of "logical" line

hocr-tools provide this example of ocr_line[1]:

Alice was
beginning to get very tired of sitting by her sister on the bank,

And tesseract-ocr (r729) produce this hocr output:


Alice
was
...
bank,


Does tesseract-ocr ocr_line meets criteria of "standard ocr_line" or
should it use ocrx_line?

[1] http://code.google.com/p/hocr-tools/source/browse/sample.html#13

--
Zdenko

Reply all

Reply to author

Forward

0 new messages