Invalid Start of grapheme sequence

28 views
Skip to first unread message

Ankur Rana

unread,
Apr 24, 2023, 2:19:02 AM4/24/23
to tesser...@googlegroups.com
Hi,

We am trying to extended Devanagari OCR training data with the four more unicode Devanagari characters i.e.ॻ, ॼ, ॾ, ॿ. We are getting the error for other character combinations. Screenshot attached.

How can we train the tesseract for above consonant with other matras like following:

ॻ ॻा ॻि ॻी ॻु ॻू ॻे ॻै ॻो ॻौ ॻं ॻॉ
ॼ ॼा ॼि ॼी ॼु ॼू ॼे ॼै ॼो ॼौ ॼं ॼॉ
ॾ ॾा ॾि ॾी ॾु ॾू ॾे ॾै ॾो ॾौ ॾं ॾॉ
ॿ ॿा ॿि ॿी ॿु ॿू ॿे ॿै ॿो ॿौ ॿं ॿॉ

--
Regards
---------------------------------------------------------------------------------------
Dr. Ankur Rana
System Analyst
Research Centre for Technical Development of Punjabi Language, Literature and Culture
Punjabi University Patiala

Screenshot from 2023-04-20 16-34-08.png
Reply all
Reply to author
Forward
0 new messages