Setting dictionaries to use at run time

53 views
Skip to first unread message

Praveen D.

unread,
Aug 5, 2015, 1:17:53 PM8/5/15
to tesseract-ocr
Hi,

We are trying to process a image where different regions would need different dictionaries --> for example the first 4 characters could have one list of dictionaries and the second 4 characters have another.

From what i understood from the documents user_words_suffix is used to load the custom dictionaries but it looks like that is an init parameter and cannot be overriden at runtime.

Is the only possible solution to create different instances of tesseract to process the different zones?

NOTE : This is not character whitelisting but word whitelisting.

Also is it possible to pick use words only from the custom dictionary and ignore the default dictionary for the language?

Thx.
Reply all
Reply to author
Forward
0 new messages