Khmer language in Google translation.

2,353 views
Skip to first unread message

សួន សីហាពណ្ណរាយ

unread,
Dec 2, 2010, 11:29:15 PM12/2/10
to pyk...@googlegroups.com, barcampp...@googlegroups.com
Dear all

I want to ask anyone who knowledge in Google translation. how does it work and if i want to create translation from English to Khmer? how to do it? I write in English coz maybe someone from foreigner can help me some guide or idea to do. I just feeling jealous when I see other language can use this translate service can i dream to use Khmer language also. If anyone who have more time to research about this just spend a little time to do it. even it's the best one but i think at least we can build it and still  develop it to make it the best one.

Best regards

Sovichea SOU

unread,
Dec 2, 2010, 11:32:08 PM12/2/10
to pyk...@googlegroups.com, barcampp...@googlegroups.com
Great! Good point. We used to talk about it many years, but still no solution.



2010/12/3 សួន សីហាពណ្ណរាយ <puna...@gmail.com>

chanborn

unread,
Dec 3, 2010, 11:16:04 AM12/3/10
to barcampp...@googlegroups.com
ho, if some private firm use money from their pocket to conduct research? literature review on that?. This is the only way develop translate software from English to Khmer.

Cheer
chanborn

http://cambodiaairdefense.wordpress.com/

2010/12/3 សួន សីហាពណ្ណរាយ <puna...@gmail.com>
Dear all

I want to ask anyone who knowledge in Google translation. how does it work and if i want to create translation from English to Khmer? how to do it? I write in English coz maybe someone from foreigner can help me some guide or idea to do. I just feeling jealous when I see other language can use this translate service can i dream to use Khmer language also. If anyone who have more time to research about this just spend a little time to do it. even it's the best one but i think at least we can build it and still  develop it to make it the best one.

Best regards

--
You received this message because you are subscribed to the Google
Groups "BarCamp Phnom Penh" group.
To unsubscribe from this group, send email to
barcampphnompe...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/barcampphnompenh?hl=en?hl=en
Also visit: http://barcampphnompenh.org/

Socheat Yi

unread,
Dec 3, 2010, 1:54:32 PM12/3/10
to puna...@gmail.com, barcampp...@googlegroups.com
Hi,

To have en-kh translation it would be more complexity than en-kh dictionary:)...
we require to have both english and khmer dictionary systems for first step, and mapping both dictionary systems.
second step, you have to involve in a lot of process for translating mechanisms like morphological analysis, syntactic/semantic analysis, use sentence pattern base technique to match up the sentence structure since khmer and english have similar structure (SVO), and you have to do to use statistical analysis algorithm for most frequently used sentences, so to help the machine translation getting best match when it tries to translate. Anyhow, I checked En-Thai, it is not 90 or 100%, but it is just acceptable too.  If you have still want to do so, you may try to get some research paper on machine translation.

For technical skill, you should algorithm in mind, and if u used to learn compiler or use to write compiler, so you would understand quite easily on how to parse grammar rule, and do syntactic and semantic analysis, regular expression and so on.

I and some people are still working on the pre-processing steps too regarding this translation tool, and i think other group may be doing something on this too, I am not sure who are doing it since we don't have broad contact to those people.


That's just my limit research and opinion.

Hope it is useful for you.



--
You received this message because you are subscribed to the Google
Groups "BarCamp Phnom Penh" group.
To unsubscribe from this group, send email to
barcampphnompe...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/barcampphnompenh?hl=en?hl=en
Also visit: http://barcampphnompenh.org/



--
------------------------------------------------------------
Socheat Yi

MCPD: ASP.Net Developer 3.5
MCTS: .NET Framework 3.5, ASP.NET Applications/WCF
MCPs

Nal MEN

unread,
Mar 16, 2013, 10:33:32 AM3/16/13
to barcampp...@googlegroups.com, pyk...@googlegroups.com
Dear Friends
I am respected your highest Idea to create Khmer translate in google, but I would like to say that it very difficult, not because of the technical but because of your language. you know the same word but have different meaning. and the technical translate of google is cut string by string, word by word, and find that word in dictionary. If suggest you use same as chines translate in google, while you type that word, it will drop down to show you what that word really mean, so you can sure to translate that word correctly. but the arrangement is according to our grammar. I am sorry if I got any mistake, but hope that it can help.  

Brad Collins

unread,
Mar 16, 2013, 11:07:47 AM3/16/13
to barcampp...@googlegroups.com

Nal is correct. Google uses a brute force approach to
translation. It does it by indexing texts that translate
from one language to another and then attempts to use those
as references in translating whatever the user wants to
translate. In other words it's a big data approach using
lots and lots of data, rather than understanding the
languages it translates.

This approach works, but requires a very large corpus of
text to work from. The more translated texts they have to
index, the better the translation. I am sure they would be
happy to add Khmer translation, but they probabaly still
don't have enough translated texts to be able to offer the
service.

Hopefully this will change in the future. I would love to
have this feature for both Khmer and Lao as much as you do.

b/
--
Brad Collins <br...@chenla.la>
skype|twitter|wave: deerpig
http://chenla.la

Sitthykun LY

unread,
Mar 16, 2013, 1:56:24 PM3/16/13
to barcampp...@googlegroups.com, pyk...@googlegroups.com
Hello friends and Brad,

In fact you are thinking our language is the same as others but are more vowels and consonants ... each of language always has native style ... the problems just are not the grammar or word ... make sure those articles or sentences are correct, follow the rule ...
Ex: 
ពាក្យ(Word): សាលា 
ឃ្លា(Sentence): ប្រធាន(Subject) + កិរិយាស័ព្ទ​(Verb) + កម្មបទ(Object)
In English, they have to token those word by space then analyzes them, but in our language we use Hidden Space to separate the words. Every word have to use Hidden Space(Space in NiDA Keyboard), Khan(Stop), New Line, or .., I think it will enough.

Ex: គាត់​ទៅ​សាលា

Best,
Sitthykun





You received this message because you are subscribed to the Google Groups "BarCamp Phnom Penh" group.
To unsubscribe from this group and stop receiving emails from it, send an email to barcampphnompe...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.





--
Sitthykun LY
a little developer in the big world \o/
mobile: +85595 7788 39
skype: cityx9
twitter: sitthykun <http://twitter.com/sitthykun>
site: niyum.com

Cloud Strife

unread,
Mar 16, 2013, 4:24:31 PM3/16/13
to barcampp...@googlegroups.com

Thai language are almost similar style us but why they can do it?

If they can do it? Why not us? I think, ots a good idea to start our own now.

Cheers

--

Sothoeuth Ith

unread,
Mar 17, 2013, 12:51:42 AM3/17/13
to barcampp...@googlegroups.com
I agree. Translation on the internet or google is not always accurate,
but at least it helps us to get the idea or context of the article. I
think it's good idea to have one for Khmer and we can keep impoving
it. I'm sure our Khmer programmers can do it well. Khmer unicode is a
good example. They've done a very good job. :)
--

Sothoeuth Ith
(+66)87 5526 204

Pongsametrey SOK

unread,
Mar 16, 2013, 11:31:43 PM3/16/13
to barcampp...@googlegroups.com
Thai language they have a long research to separate grammar rules [42 rules, Theeramunkong and Usanavasin 2001]  but Khmer language currently does not have exactly grammar rule in Computer definition yet or it has but we don't find any publish paper about it.

Khmer words are composed of one or more consonantal clusters sometimes called "sylleme" [Sola 2004a]

I only found paper published by Sola 2004 (found in KhmerOS.info).
Some papers published by PAN.
And from Puthik Hok about Khmer Spelling Checker that could make us understand the process.

Google translator would be one good sign and we hope more people help to move it on.

Cheers,
Metrey
--

Mr. SOK Pongsametrey
http://www.linkedin.com/in/pongsametreysok
http://osify.com
It's not hard to give a spare time to community for free!

Brad Collins

unread,
Mar 16, 2013, 10:24:45 PM3/16/13
to barcampp...@googlegroups.com

You didn't understand what I said.

Google doesn't care about grammer, or sentence structure or
definitions. They look for texts that are translated.
Google looks for articles, books, emails, anything that is
given in both english and khmer. Then they map phrases and
words between the translations and come up with an index.
This is a terrible way to translate when you only have a few
texts, but when you have millions of texts it works very
well.

Again, Google doesn't use smart algorithms, it uses a
statistical approach using lots and lots of data. Their
approach is dumb, but works very well if you have enough
data. Cheap and dump will always win over smart and
expensive.

As I said in my last post, it is likely that the reason
they do not offer khmer in Google translate is that they
don't have enough texts to index to do a proper translation.

Sitthykun LY

unread,
Mar 17, 2013, 12:38:01 PM3/17/13
to barcampp...@googlegroups.com
@brad,
Haha, this is not bad algorithm, I have referred to grammar structure.
I've agreed with your speech that Google has matched our sentences in the database.
That's why now we cannot refer to Google translation ... cos some of those sentences are not correct.

Cheer!!

Cloud Strife

unread,
Mar 17, 2013, 5:33:43 PM3/17/13
to barcampp...@googlegroups.com

At least they did that and they have their translation online. They should be a team like we are, so I think if we could find out some help and it would be great if they help us.

Pongsametrey SOK

unread,
Mar 18, 2013, 3:00:37 AM3/18/13
to barcampp...@googlegroups.com
It's OK, we understand how it works. 
Since we have no any product like google to have such translation, now we will have it, it's enough to start something else. 

We thank to everyone who helps to make it for Google, they have worked very hard to have it for us or the world to know our language and understand.

No need to debate, we just hope we can have more idea to improve our language structures in computer literature.

If you guys found any research paper that could help, just help to share so that some of us might found something to do for our nation.

Cheers,
Metrey

Rohin Gupta

unread,
Sep 23, 2013, 7:02:13 AM9/23/13
to barcampp...@googlegroups.com, pyk...@googlegroups.com
Hello, Please can you help me for my letter - English language translate to Khmer language because my uncle is Cambodian.

Brak Sodom

unread,
Sep 23, 2013, 9:29:23 PM9/23/13
to barcampp...@googlegroups.com, pyk...@googlegroups.com
Dear all guides
 
In google translate it've has already in Khmer to other languages and in other languages to Khmer , so you all just select a language as you want  it'll be ok.Please try again
 
good luck
 
my best wish
 


--
Sweeten your digital life in Cambodia: http://sweetmemorystore.com/

---
You received this message because you are subscribed to the Google Groups "BarCamp Phnom Penh" group.
To unsubscribe from this group and stop receiving emails from it, send an email to barcampphnompe...@googlegroups.com.
To post to this group, send email to barcampp...@googlegroups.com.
Visit this group at http://groups.google.com/group/barcampphnompenh.

frogsl...@gmail.com

unread,
Jun 20, 2017, 10:06:50 PM6/20/17
to BarCamp Phnom Penh, pyk...@googlegroups.com
Ironically, the posts here are severely impaired, grammatically and spellingly pOstS. toetail unrespeck off tha inglish sandwich. Anne apsoloot undissgrazingfoolnessessityalism! 

Lors Sren

unread,
Jun 22, 2017, 8:09:22 AM6/22/17
to barcampp...@googlegroups.com
Dear 
I think it should be more updates bcos it still not right but it ok sometime.
Bests 

--
Rekindle your love of reading with Kindle: http://goo.gl/VVRMkg

---
You received this message because you are subscribed to the Google Groups "BarCamp Phnom Penh" group.
To unsubscribe from this group and stop receiving emails from it, send an email to barcampphnompenh+unsubscribe@googlegroups.com.
Visit this group at https://groups.google.com/group/barcampphnompenh.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages