Frequently used dhatus using AI

67 views
Skip to first unread message

K.N.RAMESH

unread,
Mar 12, 2026, 10:39:18 AM (yesterday) Mar 12
to
Courtesy: Rohan Pandey
@khoomeik

Sanskrit has >2000 verb roots (dhātus). But do you really need to learn them all?

I had Claude analyze 270 Sanskrit texts, and it found that with just the 192 most common dhātus, you can understand ~90% of verbs in literature.

Attached are those 192 dhātus, ordered by frequency: 
IMG-20260311-WA0101.jpg

Hrishikesh Terdalkar

unread,
Mar 12, 2026, 11:56:17 AM (yesterday) Mar 12
to sams...@googlegroups.com
If this was performed on DCS, why do you need AI, in particular LLMs for that analysis?
Isn't this a simple frequency counting problem?

Not saying the results themselves are invalid or contesting their utility. Just wondering what role Claude played, and more importantly, why?

Regards,
Hrishikesh


--
You received this message because you are subscribed to the Google Groups "samskrita" group.
To unsubscribe from this group and stop receiving emails from it, send an email to samskrita+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/samskrita/CAOSP8Jd4swpaPtgx5mJvAeEoRMiN20qUCudsA%3D9rfePMzNPk6w%40mail.gmail.com.

Srikanth

unread,
Mar 12, 2026, 2:58:14 PM (yesterday) Mar 12
to sams...@googlegroups.com
That's really helpful. Thanks for sharing. The image shared is of compressed quality... could you please share a text/pdf format or a higher quality (png, larger) image so that I can take a print out.

Regards,
Srikanth
--

kenp

unread,
Mar 12, 2026, 10:15:40 PM (23 hours ago) Mar 12
to samskrita

vishal jaiswal

unread,
Mar 12, 2026, 10:16:57 PM (23 hours ago) Mar 12
to sams...@googlegroups.com
The process used as well as the names of the texta would be good to have. 

Sudhir Pattar

unread,
Mar 12, 2026, 11:37:20 PM (21 hours ago) Mar 12
to sams...@googlegroups.com
What does the color code mean?
Also, please share a picture of higher resolution.

Thanks,
Sudhir

--

Raunak Dhar

unread,
Mar 12, 2026, 11:37:26 PM (21 hours ago) Mar 12
to sams...@googlegroups.com
Yes, please upload a higher quality image if you have it. 

--

Anunad Singh

unread,
5:46 AM (15 hours ago) 5:46 AM
to sams...@googlegroups.com
Though this is an important fact to know, it is not 'new'. It is not limited to dhatus but is true for all words and even for all languages. So, one can find word-frequency charts for most of the languages. Even when AI tools were not there, then also, people developed and used tools which counted the frequency of words by reading large quantities of text.

This frequency distribution loosely follows the so-called '80-20 rule' . That is, 20% words of a language are used in 80% of cases. (or 80% of words are only used in 20% of cases.)

In fact, this phenomenon is so prominent that British linguist and philosopher Charles Kay Ogden introduced Basic English (British American Scientific International Commercial English), a simplified version of the language designed for international communication in 1930 and claimed the following in his 1937 work, Basic English and Grammatical Reform :

"It is a language of eight hundred and fifty English words which will say clearly and simply almost everything we normally say with fifteen or twenty thousand."

I again thank you for providing this frequency list of Sanskrit dhatus which will be very useful for learning Sanskrit by prioritising the learning of higher frequency dhatus.

-- अनुनाद 



karthik holla

unread,
7:55 AM (13 hours ago) 7:55 AM
to sams...@googlegroups.com
1000209998.jpg
1000209997.jpg

Sreenadh Jonnavithula

unread,
10:00 AM (11 hours ago) 10:00 AM
to sams...@googlegroups.com, Anunad Singh

Thanks for this list, I think it will be very useful!

My guess on the reason why AI had to be used : because verbs in Sanskrit have SO many different forms with all the lakaras and pratyayas etc. It is not a simple matter of counting frequencies of word occurances

- Sreenadh

Reply all
Reply to author
Forward
0 new messages