Thanks for your valuable thought/comment, Pan. See my reply in RED.[Pan, You seem to be suggesting even if we have build up ' lots of data to refer to, and past usage ie. digital data, to interpret and guess a best often use of words in some known context (base on occurrences and use of different words= statistics)', similar to what the Thai have done, we still won't get similar result to what Google Translate has done for Thai language. Really?
Simple, the Google system has no linguistic knowledge; it is a statistical translation machine, relying on a brute force method, with lots of data to refer to, and past usage ie. digital data, to interpret and guess a best often use of words in some known context (base on occurrences and use of different words= statistics). Google Lao does not work, simply; we have dictionaries i.e even if we pay someone to key in a selected EN-La dictionary today! Google -La will still not work. It will contribute but it will not still not be functional. Or it will be only a translation of word for word, with sometimes funny context meanings.
If that is the case we are wasting our time at this Google place, then.
Is there any other place we can spend our time and effort for advancing the Lao language?
I recently heard our Hmong brothers have score a success with Microsoft Translation platform. See previous post. I cannot assess its quality since I don't read Hmong. I wonder if anyone has investigate it for Lao language. ].
[Is there any development in this direction for Lao? any link?]
For a decent Lao Machine Translation (LMT) to work:
- Google-La will needs to built up a Lao words statistics.
- Develop a hint dictionary, with context relevant use of words.
- Hint, to include variations of Lao words and spellings use. A root words/radical dictionary.
[Pan, I search the net and find this, ...
Some may have seen my LMT working, which is an extension of Google Thai, with a small TH-La dictionary, and built in rules and exceptions to handle Th-la idioms and variations of words. It was originally conceived to facilitate the creation of a digital En-La corpus/dictionary. In fact it did help, where we quickly derive some 25,000 words and collate it with STEA En-La dictionary, to filter out a useful set. One application is NUOL Wordnet (http://en.wikipedia.org/wiki/WordNet) dictionary. By the way, the hope is also to derive other dictionary e.g JP-En-TH, Th-La => JP-La etc.
http://lo.asianwordnet.org/ . I think this is what you are referring to. Is that right?
I will have a good look at it.]
Thanks for your valuable thought/comment, Pan. See my reply in RED.[Pan, You seem to be suggesting even if we have build up ' lots of data to refer to, and past usage ie. digital data, to interpret and guess a best often use of words in some known context (base on occurrences and use of different words= statistics)', similar to what the Thai have done, we still won't get similar result to what Google Translate has done for Thai language. Really?
Simple, the Google system has no linguistic knowledge; it is a statistical translation machine, relying on a brute force method, with lots of data to refer to, and past usage ie. digital data, to interpret and guess a best often use of words in some known context (base on occurrences and use of different words= statistics). Google Lao does not work, simply; we have dictionaries i.e even if we pay someone to key in a selected EN-La dictionary today! Google -La will still not work. It will contribute but it will not still not be functional. Or it will be only a translation of word for word, with sometimes funny context meanings.
If that is the case we are wasting our time at this Google place, then.
Is there any other place we can spend our time and effort for advancing the Lao language?
I recently heard our Hmong brothers have score a success with Microsoft Translation platform. See previous post. I cannot assess its quality since I don't read Hmong. I wonder if anyone has investigate it for Lao language. ].
[Is there any development in this direction for Lao? any link?]
For a decent Lao Machine Translation (LMT) to work:
- Google-La will needs to built up a Lao words statistics.
- Develop a hint dictionary, with context relevant use of words.
- Hint, to include variations of Lao words and spellings use. A root words/radical dictionary.
[Pan, I search the net and find this, ...
Some may have seen my LMT working, which is an extension of Google Thai, with a small TH-La dictionary, and built in rules and exceptions to handle Th-la idioms and variations of words. It was originally conceived to facilitate the creation of a digital En-La corpus/dictionary. In fact it did help, where we quickly derive some 25,000 words and collate it with STEA En-La dictionary, to filter out a useful set. One application is NUOL Wordnet (http://en.wikipedia.org/wiki/WordNet) dictionary. By the way, the hope is also to derive other dictionary e.g JP-En-TH, Th-La => JP-La etc.
http://lo.asianwordnet.org/ . I think this is what you are referring to. Is that right?
I will have a good look at it.]
Hi Dara and all,
Sorry to take a while to reply to your query, Dara.
My computer hard disk crashed.Maybe I over use it.