Corrupted Hindi Email

2 views
Skip to first unread message

Hariram

unread,
Oct 27, 2007, 7:19:05 AM10/27/07
to Hindi...@yahoogroups.com, hi...@googlegroups.com, indlinu...@lists.sourceforge.net, eka...@yahoogroups.com, Chit...@googlegroups.com, Poetry...@yahoogroups.com
Email msgs in Indic Unicode are frequently received as corrupted,
(specially from Yahoogroups when received as daily digest.)
 
Gmail provides a option to set as default "Sent all outgoing msgs as Unicode(UTF8), so msgs sent from Gmail are normally does not corrupt. But msgs recieved in Gmail from other sources are found corrupted.
 
Some msgs display in correct form in IE6 after changing the setting after opening of each new webpage as "View-->Encoding-->Unicode(UTF8)". But may times this setting also does not work.
 
We use the following tools to repair these
 

http://www.mandarintools.com/

http://lang.ojnk.net/hindi/unifix.html

 

But the output got is not 100% correct, i.e. having many errors.

 

And some texts which appear as '??????...' can't be repaired.

 

And some wrong encodings are not being repaired.

 

Would experts kindly guide :

 

1. Any other better tool?

2. Any tool in Open Source? So that we can try to improve it.

3. What is the reason of corruption?

4. What is the permanent solution to avoid such corruption?

5. Is any actions is being taken by authorities to implement any rules for Email-service providers to avoid this?

 

Hariram

अनुनाद

unread,
Oct 28, 2007, 7:05:05 AM10/28/07
to Chithakar
यदि भ्रष्ट मेल को ठीक करने वाले साफ़्टवेयर या भाषा पहचानने वाले
साफ़्टवेयर की सोर्स कोड मिल जाय तो कुछ पता चले कि भ्रष्ट होने का
मेकेनिज्म क्या है। और फिर इसका स्थायी हल निकाला जाय।

विकिपीडिया पर स्थित इस लेख से भी कुछ सहायता मिल सकती है:

Wikipedia:Language_recognition_chart
http://en.wikipedia.org/wiki/Wikipedia:Language_recognition_chart


¿Qué? for AUTOMATICALLY identifying language and character encoding in
software applications
http://www.alis.com/en/services_que.html

Reply all
Reply to author
Forward
0 new messages