Extracting text from un-known language

84 views
Skip to first unread message

Avi Fatal

unread,
Dec 15, 2024, 7:32:14 AM12/15/24
to tesseract-ocr
Hi,
I have a usecase for extracting OCR from images that I dont know the language ahead.
it can be any language.
How is it possible to extract the OCR with the original language without knowing it ahead?
Thanks

Nikola Smolenski

unread,
Dec 26, 2024, 6:00:05 AM12/26/24
to tesser...@googlegroups.com
It should be possible to make AI that would recognize language from an image, but I don't know if there is one that is freely available.

You may try to OCR using every available language, then use spellchecking dictionaries to see which language best matches the text.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/tesseract-ocr/ecd1c443-62fd-4ee5-aa8e-d362a5dc2248n%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages