Hi Jisoo,
On Thu, Mar 27, 2014 at 3:28 AM, Jisoo Park <
xxx...@gmail.com> wrote:
> Hi Johannes,
>
> Thanks for your detailed response.
>
> What I found 'natural' came from the previous discussion [1] about changing
> the default encoding. It's not a 100% same case, however:
>
> 1) Many browsers and libraries tend to set the filename field without proper
> encoding parameter, regardless of the RFC.
indeed, that's what I also noticed when running a quick test. So, what
the browser I tested this with seemed to do is to try an ISO-8859-1
encoding and replace all characters outside of that encoding with '?'.
So, probably that's a consequence of what I've described here:
https://github.com/spray/spray/issues/526
So, if you want to support the full range of UTF8 characters *the HTML
page containing the form* must be encoded using UTF8. Have you tried
that? (I quickly tried but it doesn't decode correctly.)
> 2) UTF-8 is a superset of ISO-8859-1
Superset in which regard? UTF8 may support a superset of ISO-8859-1's
characters but it's not a superset in the sense that every valid
ISO-8859-1 encoding is also a valid UTF8 encoding (ISO-8859-1 uses 8
bit encoding, while in UTF8 the highest-order bit has always special
meaning).