Is there a corpus of Sanskrit->English translations?

34 views
Skip to first unread message

Shreevatsa R

unread,
Aug 15, 2019, 6:39:43 PM8/15/19
to sanskrit-programmers, vishvAs vAsuki
Am curious about the state of publicly available data: does there exist / has anyone compiled a set of parallel Sanskrit / English texts, or a corpus of sentences (say a few hundred thousand simple sentences) translated from one language into the other? The application would be for use in training ML (not by me).

If not, how large is the largest corpus you're aware of?

ken p

unread,
Aug 15, 2019, 10:50:37 PM8/15/19
to sanskrit-programmers
You may take a look at this site for Sanskrit to English translation.
http://www.sanskritbible.in/index.html

Shreevatsa R

unread,
Aug 16, 2019, 6:36:52 AM8/16/19
to sanskrit-programmers
Thanks, that's a start. I can see parallel English text at http://sanskritbible.in/readmode.html and
- there seem to be about 8508 sentences total,
- the text is CC-BY-SA licensed, based on the Bible "published by Calcutta Baptist Missionaries in 1851", 
- the language is not very idiomatic but also not entirely garbage either.

Now I just want a corpus that's 10x or 100x the size :-)

On Thu, 15 Aug 2019 at 19:50, ken p <drk...@gmail.com> wrote:
You may take a look at this site for Sanskrit to English translation.
http://www.sanskritbible.in/index.html

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-program...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/sanskrit-programmers/de34f9e4-12d6-4410-a155-272819c63a66%40googlegroups.com.

ken p

unread,
Aug 18, 2019, 8:41:14 PM8/18/19
to sanskrit-programmers
This parallel Sanskrit / English text can be read in all Indic languages via Read my language. We need to provide Sanskrit knowledge via translation and transliteration to all.

ken p

unread,
Aug 26, 2019, 9:26:26 PM8/26/19
to sanskrit-programmers
Reply all
Reply to author
Forward
0 new messages