把 Thunderbird 寄送郵件的預設編碼修改成 UTF-8?

281 views
Skip to first unread message

Peter Pin-Guang Chen

unread,
Jan 31, 2014, 11:53:24 AM1/31/14
to MozTW
大家好,

英文版的 Thunderbird 從昨天開始把寄件時的預設編碼從 windows-1252/ISO-8859-1 換成 UTF-8 了,我們是不是也可以考慮跟進?

幾個思考點:
  • 台灣人常用的 Mail Server 跟 Client 處理 UTF-8 應該已經沒有什麼大問題了?
    • 使用 Big5 且不能調整編碼設定的企業/學校內部應用應該比較少了?
  • 比較舊的 Client 收件時就算有問題,是不是可以不管它了?
    • 例如比 Outlook Express 跟 Outlook 2003 還舊的程式
  • 跟其他平台 (netnews/mailing lists) 的相容性?
    • Google Groups 大概沒問題了,會有問題的大概只剩下 PTT 一類與 netnews 直接對轉的BBS
其他的理由請見原信,如果沒有人反對的話我就找個時間來把預設編碼換成 UTF-8 囉。

Peter


---------- Forwarded message ----------
From: Henri Sivonen <hsiv...@hsivonen.fi>
Date: 2014-01-31
Subject: Please consider if your locale could default to UTF-8 for outgoing email
To: dev-...@lists.mozilla.org, dev-apps-t...@lists.mozilla.org


Yesterday, the default character encoding for new outgoing email in the
en-US localization of Thunderbird changed from ISO-8859-1 (which really
means windows-1252 labeled as ISO-8859-1) to UTF-8. (Thanks Magnus!)
Previously, many other localizations had already set the default
character encoding for new outgoing email to UTF-8. This localizations are:
Arabic
Belarusian
Greek
Farsi
Georgian
Macedonian
South Ndebele
Panjabi
Polish
Romanian
Russian
Serbian
Swati
Southern Sotho
Tsonga
Venda
Vietnamese
Xhosa
Traditonal Chinese
Zulu

Note that this isn't just a matter of African languages that have to use
UTF-8 using UTF-8. The above list includes languages that do have
Thunderbird-supported legacy encodings (Arabic, Belarusian, Greek,
Farsi, Macedonian, Polish, Romanian, Russian, Serbian, Vietnamese,
Traditional Chinese).

I encourage other localizations to consider whether they'd be ready to
change the default for outgoing email (mailnews.send_default_charset in
messenger.properties) to UTF-8.

The only reason not to would be regionally still-popular UTF-8-incapable
email clients. (Unlikely at this day and age.)

In particular, since the Belarusian, Macedonian, Russian and Serbian
localizations have already been able to make the switch to UTF-8, it is
very probable that the Bulgarian and Ukrainian localizations could do
so, too, at this time.

I'd also like to especially encourage the German, Italian and Breton
localizations to consider if they could make the switch to UTF-8. These
three locales currently default to ISO-8859-15 for outgoing email. This
doesn't make sense, because:

 1) ISO-8859-15 is a younger encoding than UTF-8. If you are worrying
about UTF-8 being too new to be supported, on newness grounds, you
should worry even more about ISO-8859-15!

 2) Compared to ISO-8859-1, ISO-8859-15 was motivated by three things:
less common French letters mysteriously omitted from ISO-8859-1,
transliterations in the Finnish context (totally missing the point of
transliterations!) and the euro sign. When the French and Finnish
localizations as well as many other localizations used in the Eurozone
don't need to default to ISO-8859-15, German, Italian and Breton almost
certainly don't need to, either. But instead of changing back to
ISO-8859-1, let's treat this as an opportunity to switch to UTF-8.

- -

Currently, Thunderbird already silently upgrades a message to UTF-8 even
when the default is something else if the message contains characters
that can't be expressed in the preferred legacy encoding, so it's not
like the defaults prevent people from writing stuff in email. So why
bother changing the default then?

The reason why Thunderbird doesn't always use UTF-8 for outgoing email
is that way back when UTF-8 support was initially added to email
clients, older email clients out there didn't all already supports
UTF-8, so it makes sense to use already-supported legacy encodings when
possible. But that was a long ago. UTF-8 was invented over 20 years ago
and at least for the past decade, the ability to receive UTF-8 has been
commonplace. We should stop pretending that UTF-8 avoidance still adds
value when pretty much everyone already uses email clients capable of
receiving UTF-8. The Gaia email client already simply uses UTF-8 for all
outgoing email.

The UTF-8 avoidance capabilities don't come for free. In addition to
having to maintain encoders for legacy encodings (which needs to happen
to support form submissions in Firefox anyway) and having code
complexity to deal with the silent upgrade to UTF-8 (which isn't that
much additional code complexity considering what Firefox form
submissions require anyway) it means having to maintain the capability
of the HTML serializers (for HTML mail) to output encodings other than
UTF-8 and it comes with the UI burden (both in terms of adding more
stuff that the user may end up thinking about and in terms of adding a
ball and chain legacy to code maintenance) since we seem to feel that if
there is the possibility to use encodings other than UTF-8, there should
be UI for it. (My primary interest in this is the inertia that
Thunderbird having UI for this imposes on making encoding-related
changes to Firefox.)

So the two previous paragraphs seem to argue for removing the
configurability and just always sending UTF-8. What's this about
localizations changing the default configuration then?

Changing the default configuration is a first step that makes reducing
configurability and the associated code complexity less scary in the
future. Making a change like this product-wide regardless of locale
tends to bring up concerns that maybe the email client environment for
some locale out there isn't ready yet. As more and more locales make the
call that they can make the switch and default outgoing email to UTF-8,
the concern of the form "Where *I* live, we don't have problems with
this but I *worry* that *they* over *there* do." becomes less of a
source of inertia.

--
Henri Sivonen
hsiv...@hsivonen.fi
_______________________________________________
dev-l10n mailing list
dev-...@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-l10n



--
Peter Pin-Guang Chen (petercpg)
Chinese (Traditional) [zh-TW] Localization Project Coordinator
Mozilla Taiwan Community (MozTW) http://moztw.org

Timothy Chien

unread,
Feb 1, 2014, 7:33:29 AM2/1/14
to moztw-...@googlegroups.com
> Previously, many other localizations had already set the default character encoding for new outgoing email to UTF-8. This localizations are:
> Traditional Chinese


我以為我們已經換了?
> --
> 您已訂閱「Google 網上論壇」的「moztw-general: 摩茲將軍」群組,因此我們特別傳送這封郵件通知您。
> 如要取消訂閱這個群組並停止接收來自這個群組的郵件,請傳送電子郵件到
> moztw-genera...@googlegroups.com
> 如要在此群組張貼留言,請傳送電子郵件至 moztw-...@googlegroups.com
> 如要在網路上查看這項討論,請造訪
> https://groups.google.com/d/msgid/moztw-general/CAPd3dBkg9RyNjYcQ9JgT-McO8BzQ5m964EgFvG_e8%3DAdxJjBaQ%40mail.gmail.com
> 如需更多選項,請前往:https://groups.google.com/groups/opt_out

Peter Chen

unread,
Feb 1, 2014, 10:35:47 AM2/1/14
to MozTW
對欸,我發現我們從古早以來就是這樣了,那就...case closed XD

目前的情況是寄信預設 UTF-8,讀信預設 Big5 (為了保持與舊軟體跟舊信件的相容性)


Peter

2014-02-01 Timothy Chien <timd...@gmail.com>:
Reply all
Reply to author
Forward
0 new messages