Dear all ,
I have been using Tamil Language test data downloaded from Google
Repo and also from Viet OCR
source forge . (Thanks to Shree ) .
Some times I use them as single entity , some times combined using a + . But I have been facing a problem .
While the data from Viet OCR has information like tam.font properties which helps in finding out which fonts have been used , tam.txt (0,1,2,3,4) which helps in giving idea of what text has been used for training .
Is there a way to find out these files ?
-Sibi