Handwritten malayalam

55 views

Skip to first unread message

Vrinda Gopinath

unread,

Mar 13, 2025, 2:08:50 AM3/13/25

to tesseract-ocr

Hi,

I need to extract hand written malayalam text. I think it's possible to fine-tune Tesseract 5for handwritten Malayalam.

There is no single document explicitly stating the data requirements for fine tune Tesseract 5 on handwritten Malayalam (at least, I couldn’t find one—though there may be some). According to ChatGPT, the estimated data requirement is 4 lakh text samples. From where we get the authenticity of this data requirement. Additionally, based on the documentation, I believe it runs only on a CPU. How much time is required for training, but I couldn’t find answers to these questions in the documentation. Where can we find information on aspects like training time, data requirements, etc.?

Reply all

Reply to author

Forward

0 new messages