tests, tests, tests

35 views
Skip to first unread message

Michael Bykov

unread,
May 23, 2016, 8:43:17 AM5/23/16
to sanskrit-p...@googlegroups.com
Namaste,

What I miss most about my job - a reliable, serious, full, standard, set of the tests. On all occasions in the morphology of Sanskrit first of all.

For example, (I prefer JSON, but anything goes):

{
  name: 'sandhi',
  type: 'dirgha',
  descr: 'simple vowel, followed by a similar vowel',
  // a + a = A
  ['योगानन्द', 'योग', 'आनन्द'],
  ['महामृत', 'महा', 'अमृत'],
  . . .
  . . .
},

or something like that. (I believe - transliteration is evil, but anything goes, again).

maybe we all can create such a set of reliable tests by efforts of the sanskrit-developer community? This could be a separate large project on githab, for example. It would be very useful and very helpful for all of us.

dhaval patel

unread,
May 23, 2016, 10:05:12 AM5/23/16
to sanskrit-p...@googlegroups.com

I am sure you must have explored http://sanskrit.uohyd.ac.in/Corpus/ but still for sake of record.

Shreevatsa R

unread,
May 23, 2016, 10:06:05 AM5/23/16
to sanskrit-programmers
+1 great idea. We can come up with test data for all the common tasks we want to do: a set of tests for transliteration, another set of tests for doing sandhi, etc. We've often been writing code to do the same things in different languages, we should pool our tests together.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-program...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

dhaval patel

unread,
May 23, 2016, 10:34:07 AM5/23/16
to sanskrit-p...@googlegroups.com

Just to ensure that we all are in concurrence regarding nomenclature, I propose to use the guidelines developed at UoHyd for tagging.
http://sanskrit.uohyd.ac.in/Corpus/guideline.html

I am in no way connected to UoHyd, but we should honour the work done earlier. (That way we can exploit the data generated already)

Any change may be voted and accepted.

Michael Bykov

unread,
May 25, 2016, 8:31:56 AM5/25/16
to sanskrit-p...@googlegroups.com

 Thank you, very useful, I did not see it earlier




 

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-program...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages