recognition languages sets? with hierarchy?

1 view
Skip to first unread message

tt

unread,
Aug 21, 2010, 5:12:11 AM8/21/10
to tesseract-ocr
Is it possible for Tesseract to make ocr with languages put in ordered
set? I have lots of text to ocr consisting primarily of lang1, with
small portions in lang2 and lang3 (quotes and refs). It would be ideal
for Tesseract to recognise "what it can" in lang1 (e.g., to 90%
match), then switch to the lang2 for the unmatched, then to lang3.

Jimmy O'Regan

unread,
Aug 21, 2010, 6:09:56 AM8/21/10
to tesser...@googlegroups.com

There's some code for doing that, but it's not finished yet.


--
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.

tt

unread,
Aug 21, 2010, 7:18:51 AM8/21/10
to tesseract-ocr
I'm not much of a programmer, but could you point me to the code doing
that?
Reply all
Reply to author
Forward
0 new messages