> From 0.0.2 it would be nice to support RFC 2616 (see #5). However
> there are a few methods to do it:
> - Choose the best language, choose the best charset. Convert string
> into charset changing other characters into HTTP Entities
> Pros: Very simple
> Cons: Chainise language in Japanise Charset will take 7*2=14 bytes
> (assuming 2 byte Japanise charset and 7 char per HTTP Entities
>
> - Choose the best language. Choose the best charset but only if it
> 'supports' the language.
> Pros: The most 'proper' behaviour
> Cons: We would need to store somewhere list of proper charsets and
> maintain them
>
From XMPP conversation with Alex Coles it seems that he is in favour
of this option. However what we should do with the user content data?
We would need to convert them each time on request:
- How will it affect the performance?
- Is it possible in merb?
> - Choose the best language. Send it in UTF-8
> Pros: Very simple
> Cons: UTF-8 may not be supported by older clients
>
I seem to be in favour of this option. It impose certain restriction
on user but:
- utf-8 should be supported by virtually anything (including IE 4)
- π ≈ 3.14159 and similar do not have to be represented in any other
encoding (and for sure not all of them)
> Any other ideas?