Hi ,
The frequently word list is added into the traindata using the following commands:
wordlist2dawg frequent_words_list LAN.freq-dawg LAN.unicharset
combine_tessdata LAN.
How can I judge if an OCR result is from the frequent_words list or not ?
Could I fix the the tesseract source code to achieve my objective ?
Please help,
Thanks very much ^^