Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Check string language

6 views
Skip to first unread message

Cylix

unread,
Jan 29, 2008, 5:24:17 AM1/29/08
to
Is there any existing method in VB.NET or any 3-third party function
can find out the language in a string?
Let say, isChinese? isFrench?

kimiraikkonen

unread,
Jan 29, 2008, 6:51:24 AM1/29/08
to

Don't know a direct library, but there may be an indirect method:
Create some of simple text files for some specific languages of the
equilavent of your string via Babelfish.altavista.com or Google
Translation, then put each language's strings into each text file.

Then under your project you can compare the any string with language
within these language-specific text files which you want to know.

Hope this helps.

Patrice

unread,
Jan 29, 2008, 6:50:31 AM1/29/08
to
Hi,

The language of ?
http://msdn2.microsoft.com/en-us/library/system.globalization.cultureinfo.currentculture(VS.80).aspx
could be what you are looking for...


"Cylix" <cyli...@gmail.com> a écrit dans le message de news:
4884639c-063b-4d12...@i29g2000prf.googlegroups.com...

Patrice

unread,
Jan 29, 2008, 7:36:23 AM1/29/08
to
Oups, looks like Kimiraikkonen understood much better (you don't want to
place the computer current language setting in a string but to check in
which language could be written a particular string of text ?)...

Don't know about 3rd party but :

- you could first quickly check based on letters i.e. if you see a
particular unicode charset it could give a first indication (for example if
you have chinese characters, cyrillic characters, latin characters it could
give a first clue).

- if you have then characters that are used in a fair number of languages
you could :
- either check based on basic words frequency (for example "this",
"that" ,"the", "it", etc... is more likely frequent in English)
- also letter frequency could be perhaps an indication

You may want to give some more details. For example on a web site, you could
also have lang attributes that are supposed to tell in which language is a
particular page/section (for use by screen readers for example). If you get
some text from there, you could then also check this info...

--
Patrice

"Patrice" <http://www.chez.com/scribe/> a écrit dans le message de news:
OIlqo0mY...@TK2MSFTNGP02.phx.gbl...

Patrice

unread,
Jan 29, 2008, 7:40:45 AM1/29/08
to

Cylix

unread,
Jan 30, 2008, 9:06:24 PM1/30/08
to
Thanks Patrice anyway.

Actually, the method mention above is a large project that I cannot
effort.
My problem is quite simple, I a string from the email subject.
It may include three types of charater, english letter, chinese
letter, others.
I would like to trim the string have english letters and chinese
letters only.

0 new messages