Groups
Groups
Sign in
Groups
Groups
tesseract-ocr
Conversations
About
Send feedback
Help
Unable to get hocr output
65 views
Skip to first unread message
oohi...@yahoo.com
unread,
Feb 24, 2016, 4:04:37 AM
2/24/16
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to tesseract-ocr
What am I doing wrong?
Output of
$ tesseract -v
tesseract 3.04.00
leptonica-1.72
libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.3.0) : libpng 1.2.51 : libtiff 4.0.3 : zlib 1.2.8 : libwebp 0.4.3 : libopenjp2 2.1.0
I use the command line:
tesseract 0001.png -l fra hocr
and I get an hocr.txt output file
the ocr output in text format
I see an hocr config file in
/usr/share/tesseract-ocr/tessdata/configs/hocr
with the contents:
tessedit_create_txt 0
tessedit_create_hocr 1
tessedit_pageseg_mode 1
oohi...@yahoo.com
unread,
Feb 24, 2016, 4:16:10 AM
2/24/16
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to tesseract-ocr
On Wednesday, February 24, 2016 at 4:04:37 AM UTC-5,
oohi...@yahoo.com
wrote:
What am I doing wrong?
Well, that worked well.
Post a question and as soon as I hit send the answer comes to me.
With this command line I get 0001.hocr
tesseract 0001.png 0001 -l fra hocr
Reply all
Reply to author
Forward
0 new messages