it will be easy in the sense the cyclomatic complexity will be lesser (less if conditions etc).
For eg to convert this text to "unicode", all you have to do is check if the letter is followed by a dot and that's it. Essentially only one if condition. Then append the corresponding unicode from a collection/map.
compare to s/h/sh, R/u/Ru/RRu, ^ ~ etc. Also there is 1:1 uniform mapping to letters as opposed to multiples (eg in baraha, aa vs A, but not uu vs U) - which is (i agree) easier to write, but qtn is are multiple options really necessary? In native scripts, we don't substitue one letter for another, but why do so far transliteration? --- im glad we don't have "Transliteration Spelling Bees" :-0)
On Wed, Aug 24, 2011 at 1:02 PM, vishvAs vAsuki <vishvas...@gmail.com> wrote:
Transliteration scheme एतत् समीचीनं एव दृश्यते, परन्तु तदुपयोगात् parsing सौलभ्ये महत्व-पूर्ण-वर्धनं भवति इति स्पष्ठः नास्ति - finite automaton उपयोगः सुलभः एव, खलु। परन्तु parsing-दृष्ट्या transliteration scheme कल्पना (कल्पनात्मक-दृष्ट्या) आकर्शका अस्ति एव।
--
cavishvAs
Very nice to see lot of ideas and discussions being floated.
I think scala is a good choice. I know that Prof. Huet's ZEN toolkit
is based on Ocaml (and hence, it is all French school languages
winning :) Did someone have a look at his tagger? It is said to parse
baala-raamayaNa and do sandhi-vichcheda, as he told me long time back.
I did not have time to pursue.
On the subject of transliteration, I was about to as if writing
programs that do conversion from devanaagari to other indic languages
(and in the reverse direction) is in the roadmap, but it seems Shri
Shreevatsa's program already does that. Very nice to know. Is there
some plan to add it into the code base?
I am asking this because one of my friend (Shri Bhasar in CC) is
working on making a Telugu PDFs of the sanskrit docs files and had
needs some help in this aspect.
http://sanskritdocuments.org/telugu/
Regards
Ramakrishna
(See Changeset)..........This link does not open file.Please re post the link.Thanks.
We need to minimize use of Nukta,chandrabindu,anuswar as much as we can for easy Roman transliteration.If word has no other meaning and retains pronunciation then why not spell it easy way?
I like this keyboard for modified IAST...where ā , ē ,ī ,ō, ū =aa,ae,ii,au,uu where characters are uniform
--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-program...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
Like famous "brahma"- It is always pronounced as "bramha". (m--h interchange)Hindi native speakers say- "chinham" for "chihnaM" and "vanhi" for "vahni" (n--h interchange)