I already know the pattern of the text I will be OCRing, e.g. \c\d\d\d, a character followed by 3 digits. I am using Tesseract API, not command line. I found the following two functions in the API
void initialize_patterns(UNICHARSET *unicharset);
bool read_pattern_list(const char *filename, const UNICHARSET &unicharset);
- I tried including <tesseract/dict/trie.h> where those functions are located but it didn't work. The tesseract_build_dir/include doesn't have those files. Did I miss something when I was compiling the tesseract src locally via cmake? As a matter of fact tire.h is in the tesseract source folder but it never made it to the build/inlcude folder.
- Is there an alternative way of setting the pattern instead of using a file? Is there something like set user-pattern or so?
Thanks,
Egamal