Hi everyone!
I am trying to figure out what the two numbers after "baseline" in the hocr's ocr_line tags are, but so far I couldn't sort it out. Here is one of this tags as an example: <span xmlns="
http://www.w3.org/1999/xhtml" class="ocr_line" id="line_1_48" title="bbox 879 1300 1240 1335; baseline 0 -6">
So far, I've noticed some things:
If line is skewed upward (or anticlockwise) or not skewed, the second number is zero if no characters extend below the baseline (e.g. p's, q's, y's). If characters do extend below the baseline, then this value is negative.
If line is skewed downward (or clockwise), second number is negative.
Second number is always integer.
First number is zero if no skew, negative if line skewed anticlockwise, and positive if clockwise. This value is decimal. I thought it could be an angle expressed in some way, but I couldn't understand how.
Documentation says: baseline pn pn-1 … p0 - a polynomial describing the baseline of a line of text, the polynomial is in the coordinate system of the line, with the bottom left of the bounding box as the origin
But it is not clear to me.
Can anybody help me here? I would really appreciate it!