Unable to get hocr output

65 views
Skip to first unread message

oohi...@yahoo.com

unread,
Feb 24, 2016, 4:04:37 AM2/24/16
to tesseract-ocr
What am I doing wrong?

Output of
$ tesseract -v
tesseract 3.04.00
 leptonica-1.72
  libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.3.0) : libpng 1.2.51 : libtiff 4.0.3 : zlib 1.2.8 : libwebp 0.4.3 : libopenjp2 2.1.0

I use the command line:
tesseract 0001.png -l fra hocr

and I get an hocr.txt output file
the ocr output in text format

I see an hocr config file in
/usr/share/tesseract-ocr/tessdata/configs/hocr
with the contents:

tessedit_create_txt 0
tessedit_create_hocr 1
tessedit_pageseg_mode 1

oohi...@yahoo.com

unread,
Feb 24, 2016, 4:16:10 AM2/24/16
to tesseract-ocr


On Wednesday, February 24, 2016 at 4:04:37 AM UTC-5, oohi...@yahoo.com wrote:
What am I doing wrong?

Well, that worked well.
Post a question and as soon as I hit send the answer comes to me.

 With this command line I get 0001.hocr
tesseract 0001.png 0001 -l fra hocr

Reply all
Reply to author
Forward
0 new messages