Can somebody please advice - is it possible to get the coordinates and
bounding boxes of words, recognized by tesseract? If so - can somebody
please point me to where I should learn more about it?
Ideally, the output (or API callback) should contain the word itself,
the [X,Y] of upper-left corner and [X,Y] of bottom-right one.
Thank you all in advance!
--
Eugene
A word of caution: the Tesseract space detection is so-so and is wrong
IMHO about 5-10% of the time.
Patrick
Hello, Patrick!
Thank you for the quick response. May be you can also tell me what are
the units
for x0,y0,x1,y1 used? Are those pixels or something else?
static int TesseractExtractResult(char** string,
int** lengths,
float** costs,
int** x0,
int** y0,
int** x1,
int** y1,
PAGE_RES* page_res);
Thank you!
--
Eugene