Unable to create Tesseract 4.0 compatible box files using the image data

46 views
Skip to first unread message

abhishek chopde

unread,
Jun 8, 2017, 10:18:31 AM6/8/17
to tesseract-ocr
I have the image data. I want to create the tiif/box file pairs using that data. I amusing Tesseract 4.0. I ran the following commands:

$ tesseract ../../image.jpg path/to/output batch.nochop makebox

I am getting the box files in the output but they are not in the format as required by  Tesseract 4.0. They are in the format of older versions of tesseract. I have checked the version which is being used for this process. It is Tesseract 4.00.00alpha.
I have tried creating tiff/box file pairs using tesstrain.sh command, where I provided the text file and obtained the required tiff/box file pairs compatible to Tesseract 4.0. But this options is only for text2image. I want to create the dataset (box files) using the images i have.

Can someone please help?

Reply all
Reply to author
Forward
0 new messages