Am Freitag, 30. Oktober 2015 16:39:17 UTC+1 schrieb Zack Cohen:
Thanks!
You can get it just from the console API at nearly no additional runtime:
$ tesseract page_152.png page_152 -l deu-frak+deu makebox hocr
This will output three files: page_152.txt, page_152.hocr and page_152.box.
With the data in the box-file you can cut out the areas from the image.