x_wconf values

28 views
Skip to first unread message

Donald Winston

unread,
May 1, 2014, 10:14:22 AM5/1/14
to tesser...@googlegroups.com
I added "user_words_suffix user-words" to "configs/hocr" where tessdata/eng.user-words contains some "extra" words to try and increase the x_wconf values that are reported.

This did not improve the x_wconf values. They remained the same. Can someone give me a short explanation as to why? (I'm assuming the user-words technique is working. Maybe not?)

These are some of the words:
Triethyleneglycol
Monolauryl
Sorbitari
Polysorbate
oxides (for some reason this had a relatively low conf value)
mixed (for some reason this had a relatively low conf value)
Polyethoxyethanol
octylphenoxy

Are these x_wconf values not sensitive to merely word matching?
Message has been deleted

Donald Winston

unread,
May 2, 2014, 4:16:14 PM5/2/14
to tesser...@googlegroups.com
I added language_model_penalty_non_dict_word 3.00 to my config file. No changes in x_wconf values.

Donald Winston

unread,
May 2, 2014, 4:18:25 PM5/2/14
to tesser...@googlegroups.com
language_model_penalty_non_dict_word 3.00 does not do anything either


On Thursday, May 1, 2014 10:14:22 AM UTC-4, Donald Winston wrote:
Reply all
Reply to author
Forward
0 new messages