Is somebody using byt5-sanskrit or sanskrit-byt5-ocr-postcorrection ?

16 views
Skip to first unread message

Anunad Singh

unread,
Mar 4, 2026, 11:41:43 AM (6 days ago) Mar 4
to sanskrit-programmers
I tried to use both byt5-sanskrit and sanskrit-byt5-ocr-postcorrection for spell checking Sanskrit text in Devanagari. The response is far from being satisfactory.

Is there a better way to spell check Sanskrit text ?

-- anunAda

Rishabh Kumar

unread,
Mar 4, 2026, 9:00:15 PM (5 days ago) Mar 4
to sanskrit-p...@googlegroups.com
Hello,


You need to fine-tuned the model for your task (passed on your model outputs).

Rishabh Kumar
CSE Department
IIT Bombay


--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-program...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/sanskrit-programmers/CACa%2Bt%3DNMAQBDZAiTHoHLBR_PLDVkukEWOcG%2B68vB_mG9TNAsgw%40mail.gmail.com.

Anunad Singh

unread,
Mar 4, 2026, 11:01:10 PM (5 days ago) Mar 4
to sanskrit-p...@googlegroups.com
Rishabh, do you mean that this model is not already trained for spell-checking Sanskrit text? If so, what does the 'sanskrit' contained in its name indicate?

-- anunAda

Prasanna Venkatesh

unread,
Mar 5, 2026, 12:46:32 AM (5 days ago) Mar 5
to sanskrit-p...@googlegroups.com
Hi,

I am working on a Hunspell-based spellchecker of Sanskrit. Can you let me know what texts or kinda of texts you want spellcheck? I might be able to set something up for you.

Regards,
Prasanna Venkatesh

Anunad Singh

unread,
Mar 5, 2026, 2:33:52 AM (5 days ago) Mar 5
to sanskrit-p...@googlegroups.com
Prasanna Venkatesh mahodaya,

As a simple test, I started with very simple misspelled Sanskrit text, "धर्मक्षत्रे कुरुक्षत्रे समवता युयुत्सव" .

You say you are using "Hunspell-based spellchecker of Sanskrit" . But it is generally understood that Hunspell is NOT suitable for Sanskrit. Even AI says so. 

I have also read your paper "Spellchecker for Sanskrit: The Road Less Taken" .  I also read a paper on ByT5 (

ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks )

and then found the AI models on Huggingface and tried to test them. 

I understand that you are the most appropriate person to throw light on 'the best way to spell check Sanskrit text today' . Could you do so please?

-- anunAda

Reply all
Reply to author
Forward
0 new messages