Where to find documentation on config files and parameters?

177 views
Skip to first unread message

mzma

unread,
Jun 5, 2023, 9:36:28 AM6/5/23
to tesseract-ocr
I've been reading for 2 days through different pages of documentation and couldn't find the in depth info for the parameters and the configuration files, like `configs` and `tessconfigs`.
Looking in such a config file I can see 600+ parameters and I'd like to know a brief comment of what each does and possible values.
For instance I searched "tessedit_char_blacklist" on github and found that there would be references in `tesseractclass.cpp` and `tesseractclass.h`.
Following the former file gives "oops 500" error. Following the latter brings me to the file itself but there's no comment on said parameter, it's just
`STRING_VAR_H(tessedit_char_blacklist);`

Zdenko Podobny

unread,
Jun 5, 2023, 10:34:34 AM6/5/23
to tesser...@googlegroups.com
Funny, but when I open your link ( github )  I see there:

, STRING_MEMBER(tessedit_char_blacklist, "", "Blacklist of chars not to recognize",
                    this->params())

BTW: did you try to run 'tesseract --help-extra'?

Anyway you should also check source code if the parameter is used (there could be old unused parameters), how it is  used, for which tesseract engine it is valid etc... 

Zdenko


po 5. 6. 2023 o 15:36 mzma <mire...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/590ef9b1-059c-4318-8a35-b40c33109267n%40googlegroups.com.

mzma

unread,
Jun 9, 2023, 4:34:11 PM6/9/23
to tesseract-ocr
Thanks for the helpful suggestions. I'll do the due research.
About that link you and me posted it's a search for files containing "tessedit_char_blacklist". In that list of results a couple of files show up looking as if they contained the comment, i.e.
"STRING_MEMBER(tessedit_char_blacklist, "", "Blacklist of chars not to recognize",
                    this->params())"
but the actual files that they point to, are different.
Reply all
Reply to author
Forward
0 new messages