If you want to develop a open source program for identifying vruttas, I suggest that instead of manually building up a dictionary of vruttas, let us develop a program that builds up a data base of vruttas. Every time you input a verse, the program scans it, checks if it matches with its database and if it does not it provides you an option to add it to its database. You can give it a name which it keeps in its database.Murthy
BUt, the problem currently is not a paucity of vRRitta-s in the database - thanks to Ananda mishra, we don not have that problem.
Now that we have collected the vRtta definitions mechanically, some manual curation is required. We need to record:
- examples (esp from popular verses),
- lakShaNa-s,
- sUtra-s,
- yati-sthAna-s (caesura),
- Its jAti(s)
- link to recitation styles,
- classify based on the gati (druta or vilambita or mishra)
- identify whether it sticks to a regular tAla
- a measure of its popularity
- notes about the bhAva which goes naturally with it
- notes about relationship with other metres.
- ease of composition
- memorability
Chanda-s identifying machine can interact with this database which will increasingly grow richer for the benefit of all sahRdaya-s. For its basic function of identification, it would only concern itself with the definition.Such a rich database can easily be converted to stardict dictionaries for easy reference on phones and all sorts of devices, used to make flashcards etc.., besides being valuable by itself to the verse maker.
WIth this in mind I have seeded https://docs.google.com/spreadsheet/ccc?key=0Al_QBT-hoqqVdDhjNVRMTXdsdDVTZG9kcDIwVnhhN0E&pli=1#gid=7 with some vRtta-s. The sheet named छन्दस् had my collection prior to the addition of the ananda-mishra dump (सम, अर्धसम, विषम sheets) - and I hope to transfer information from there to the newly added sheets.
If you would like to contribute to the creation of this database, please let me know.Note:
1. Marking the yati-sthAna is the most immediate need. Vague or loose recommendations are put within paranthesis or brackets like: (४, ५).2. द दा द दा way of marking mAtra-s is preferred over ल गु ल गु or LGLG for the simple reason that that is how poets often remember verse structure.
I've been using a । to denote yati-sthANa in definitions in the छन्दस् spreadsheet, but I am not particular about it - I am ok with switching to -- as a yati-marker. But all that can be mechanically inserted if we just write down the yati-sthAna as a number.
3. One easy way of quickly gathering the yati-sthAna-s is by looking at आप्टेकोशः .
4. shrI-dhavala had shared digitized vRtta-ratnAkara commentary - one can copy paste examples and laxaNa-s thence.
On Saturday, 26 October 2013 20:54:17 UTC+4, विश्वासो वासुकिजः wrote:BUt, the problem currently is not a paucity of vRRitta-s in the database - thanks to Ananda mishra, we don not have that problem.Yes, he solved it.
Now that we have collected the vRtta definitions mechanically, some manual curation is required. We need to record:
- examples (esp from popular verses),
This should not be manual. We need to loop all GRETIL texts and get statistics and hundreds of samples for popular metres.
- classify based on the gati (druta or vilambita or mishra)
Never seen this classification, but interesting.
- notes about relationship with other metres.
There must be some smart books on this topic.
- ease of composition
Popularity is not the same?
- memorability
How do you measure it?
Chanda-s identifying machine can interact with this database which will increasingly grow richer for the benefit of all sahRdaya-s. For its basic function of identification, it would only concern itself with the definition.Such a rich database can easily be converted to stardict dictionaries for easy reference on phones and all sorts of devices, used to make flashcards etc.., besides being valuable by itself to the verse maker.Stardict does seems to be a dinosaur. Consider http://sandic.ru/en/downloads
WIth this in mind I have seeded https://docs.google.com/spreadsheet/ccc?key=0Al_QBT-hoqqVdDhjNVRMTXdsdDVTZG9kcDIwVnhhN0E&pli=1#gid=7 with some vRtta-s. The sheet named छन्दस् had my collection prior to the addition of the ananda-mishra dump (सम, अर्धसम, विषम sheets) - and I hope to transfer information from there to the newly added sheets.The file needs more comments. As it is it is a great beginning, but can hardly imagine how to use it in real life conditions.
I've been using a । to denote yati-sthANa in definitions in the छन्दस् spreadsheet, but I am not particular about it - I am ok with switching to -- as a yati-marker. But all that can be mechanically inserted if we just write down the yati-sthAna as a number.Vishvas, what kind of numbers are used?
3. One easy way of quickly gathering the yati-sthAna-s is by looking at आप्टेकोशः .http://dsal.uchicago.edu/dictionaries/apte/ ? Scraping can be done once again.
On Sat, Oct 26, 2013 at 11:35 AM, Mārcis Gasūns <gas...@gmail.com> wrote:
On Saturday, 26 October 2013 20:54:17 UTC+4, विश्वासो वासुकिजः wrote:BUt, the problem currently is not a paucity of vRRitta-s in the database - thanks to Ananda mishra, we don not have that problem.Yes, he solved it.To a large extant.
Now that we have collected the vRtta definitions mechanically, some manual curation is required. We need to record:
- examples (esp from popular verses),
This should not be manual. We need to loop all GRETIL texts and get statistics and hundreds of samples for popular metres.That is a fair idea. But ideally, we want to use popular and beautiful verses as examples -- not obscure and difficult ones.
- classify based on the gati (druta or vilambita or mishra)
Never seen this classification, but interesting.Educate yourself by listening to this - http://www.youtube.com/watch?v=bnsA9XWBXPI&list=PLABJEFgj0PWVXr2ERGu2xtoSXrNdBs5xS
- notes about relationship with other metres.
There must be some smart books on this topic.Smart books? There are excellent commentaries to वृत्तरत्नाकर if that is what you mean.
- ease of composition
Popularity is not the same?No. There is some correlation, but there are plenty of obscure metres which are easy to compose.
- memorability
How do you measure it?By distance to popular metres, similicity of the gaNa or mAtrA structure, for example.
Stardict does have drawbacks (lack of linking ability), sandic is more the dinosaur - or maybe a platypus. It won't run on a mobile phone without an internet connection, does it?
WIth this in mind I have seeded https://docs.google.com/spreadsheet/ccc?key=0Al_QBT-hoqqVdDhjNVRMTXdsdDVTZG9kcDIwVnhhN0E&pli=1#gid=7 with some vRtta-s. The sheet named छन्दस् had my collection prior to the addition of the ananda-mishra dump (सम, अर्धसम, विषम sheets) - and I hope to transfer information from there to the newly added sheets.The file needs more comments. As it is it is a great beginning, but can hardly imagine how to use it in real life conditions.The file as it stands is only useful to:a] programs [metre recognizer, dictionary maker]
I've been using a । to denote yati-sthANa in definitions in the छन्दस् spreadsheet, but I am not particular about it - I am ok with switching to -- as a yati-marker. But all that can be mechanically inserted if we just write down the yati-sthAna as a number.Vishvas, what kind of numbers are used?DevanAgarI. But I am fine with English numbers if it is more convenient for the machines.
3. One easy way of quickly gathering the yati-sthAna-s is by looking at आप्टेकोशः .http://dsal.uchicago.edu/dictionaries/apte/ ? Scraping can be done once again.Nope - that only contains the Sanskrit-English version, and probably does not include the appendices. That is known to be inferior to his sanskrit-hindI dictionary, anyway.
WIth this in mind I have seeded https://docs.google.com/spreadsheet/ccc?key=0Al_QBT-hoqqVdDhjNVRMTXdsdDVTZG9kcDIwVnhhN0E&pli=1#gid=7 with some vRtta-s. The sheet named छन्दस् had my collection prior to the addition of the ananda-mishra dump (सम, अर्धसम, विषम sheets) - and I hope to transfer information from there to the newly added sheets.The file needs more comments. As it is it is a great beginning, but can hardly imagine how to use it in real life conditions.The file as it stands is only useful to:a] programs [metre recognizer, dictionary maker]How can it help a dictionary maker?
I've been using a । to denote yati-sthANa in definitions in the छन्दस् spreadsheet, but I am not particular about it - I am ok with switching to -- as a yati-marker. But all that can be mechanically inserted if we just write down the yati-sthAna as a number.Vishvas, what kind of numbers are used?DevanAgarI. But I am fine with English numbers if it is more convenient for the machines.I mean how many of them are needed?
3. One easy way of quickly gathering the yati-sthAna-s is by looking at आप्टेकोशः .http://dsal.uchicago.edu/dictionaries/apte/ ? Scraping can be done once again.Nope - that only contains the Sanskrit-English version, and probably does not include the appendices. That is known to be inferior to his sanskrit-hindI dictionary, anyway.So you mean the Sanskrit-Hindi version appendices, right?
There must be some smart books on this topic.Smart books? There are excellent commentaries to वृत्तरत्नाकर if that is what you mean.Never seen them, but good to know the name.
- memorability
How do you measure it?By distance to popular metres, similicity of the gaNa or mAtrA structure, for example.What is the formula of matra simplicity?
Stardict does have drawbacks (lack of linking ability), sandic is more the dinosaur - or maybe a platypus. It won't run on a mobile phone without an internet connection, does it?No, no need for internet. It's a local SQLite database. Can run on Ubuntu, Android - it does not matters.
On Sat, Oct 26, 2013 at 1:16 PM, Mārcis Gasūns <gas...@gmail.com> wrote:How can it help a dictionary maker?You can take a csv file and convert it to a standard dictionary format various dictionaries recognize. Eg: using a program that comes with the stardict-tools package.
I mean how many of them are needed?One pAda can have many yati-sthAna-s (besides the often default one at the end of the pAda) - so one may write ४ ६ ७ to indicate yati-sthAna-s in mandAkrAntA. Sometimes, there is ambiguity as to where the yati lies - so one may write ४/५.
The skt-eng dictinonary is inferior to the skt-hindi one in many respects (eg: it does not show derivation of words).So you mean the Sanskrit-Hindi version appendices, right?
Not sure how the appendices compare..
On Sunday, 27 October 2013 00:33:00 UTC+4, विश्वासो वासुकिजः wrote:On Sat, Oct 26, 2013 at 1:16 PM, Mārcis Gasūns <gas...@gmail.com> wrote:How can it help a dictionary maker?You can take a csv file and convert it to a standard dictionary format various dictionaries recognize. Eg: using a program that comes with the stardict-tools package.Yes, a reference file one can get, but it's not a full dictionary, only part of one...
I mean how many of them are needed?One pAda can have many yati-sthAna-s (besides the often default one at the end of the pAda) - so one may write ४ ६ ७ to indicate yati-sthAna-s in mandAkrAntA. Sometimes, there is ambiguity as to where the yati lies - so one may write ४/५.So 4 different types?
The skt-eng dictinonary is inferior to the skt-hindi one in many respects (eg: it does not show derivation of words).So you mean the Sanskrit-Hindi version appendices, right?Did not knew about the derivation. I miss it in Apte, the one I have known. Is derivation for all words in Hindi Apte?
Is derivation for all words in Hindi Apte?yes.
--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-program...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.