Thanks,
Max
Keep up the good work!
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
sorry if i'm missing something obvious...
max
// Specify option -u to unpack all the components to the specified path: |
combine_tessdata -u tessdata/ell.traineddata /home/$USER/temp/ell
combine_tessdata language_data_path_prefix (e.g. tessdata/eng.)Combines all individual tessdata components (unicharset, DAWGs, classifier templates, ambiguities, language configs). The result will be a combined tessdata file lang_code.traineddata
Hope it helps,182:tess max$ combine_tessdata -u eng.traineddata eng
Extracting tessdata components from eng.traineddata
tesseract::TessdataManager::TessdataTypeFromFileName( filename, &type, &text_file):Error:Assert failed:in file tessdatamanager.cpp, line 241
Segmentation fault
this is tesseract 3.00. seems to have some problem with the traineddata suffix.
thanks,
max
the output dir needs to end in a period.
my bad.
max
Well, it was answered enough in that I was able to make my own xxx.traineddata file. unfortunately, even with that traineddata file, I'm running into the same problem that you are and I can't seem to get tesseract to use the freq-dawg that I included. I've been digging through the source code to find the right config but haven't succeeded yet. I'll let you and the group know when I do!
thanks,
max