Fine-Tune Arabic Model

93 views
Skip to first unread message

Omar Samir

unread,
Apr 12, 2024, 5:25:48 PM4/12/24
to tesseract-ocr
I have created a dataset with almost 200 million words. So there are about 20 million examples to train the model on if each image contains 10 words. Is it enough to get better results? 
under consideration, we have fine-tuned a model using 20 thousand examples and it did worse than the pre-trained model.
Reply all
Reply to author
Forward
0 new messages