Tesseract training

75 views

Skip to first unread message

karim abed el hadi

unread,

Feb 20, 2021, 12:07:55 PM2/20/21

to tesseract-ocr

Hello everyone,

My name is Karim Abed El Hadi, I am a telecommunications engineering student at the Holy Spirit University of Kaslik, Lebanon.

I am working on a Syriac OCR project.

For the first part of my project, I did a GUI that lets the user select an image or a PDF file from the device, and use it as an input to the OCR engine and it is working really well.

For the second part, I want to train the tesseract to improve the accuracy of the results. I am working with Python 3.8 and tesseract version 4.

1- What are the steps to train the tesseract v4?

2- Is there a way to train the tesseract automatically every time an input image is introduced to the tesseract (fine tuning)?