forcing a certain pattern via read_pattern_list

51 views
Skip to first unread message

emran...@gmail.com

unread,
Nov 15, 2017, 2:40:19 AM11/15/17
to tesseract-ocr
Hello everyone, 

        I already know the  pattern of the text I will be OCRing, e.g. \c\d\d\d, a character followed by 3 digits. I am using Tesseract API, not command line. I found the following two functions in the API 

void initialize_patterns(UNICHARSET *unicharset);
bool read_pattern_list(const char *filename, const UNICHARSET &unicharset);

  1. I tried including <tesseract/dict/trie.h> where those functions are located but it didn't work. The tesseract_build_dir/include doesn't have those files.  Did I miss something when I was compiling the tesseract src locally via cmake? As a matter of fact tire.h is in the tesseract source folder  but it never made it to the build/inlcude folder.
  2. Is there an alternative way of setting the pattern instead of using a file? Is there something like set user-pattern or so?
Thanks, 
Egamal


Reply all
Reply to author
Forward
0 new messages