Comment #1 by srowen:
This is a tough one.
Really, you can't use UTF-8 in a QR Code. QR Code 'byte mode' assumes
ISO-8859-1 by
default. QR Code provides allows for defining character encoding by use of
ECI
segments (see the spec). The decoder does support these. But I do not know
of a
character set ECI that selects UTF-8.
The decoder does try to guess the encoding though, since in practice, many
QR Codes
just use Shift_JIS in byte mode (instead of Kanji mode) instead of
ISO-8859-1. The
decoder even tries to guess UTF-8.
The decoder will guess UTF-8 if the bytes start with a UTF-8 byte order
mark, but
this one doesn't.
The problem is that this short message encoded in UTF-8 is the valid
encoding of a
string in Shift_JIS, so that is what is guessed in this case.
So, I am saying this symbol is not correctly encoded. To address it:
1) add a UTF-8 byte order mark at the start (EF BB BF) or
2) specify UTF-8 with an ECI segment (and then let me know what it is so we
can
support it since i've not found this value yet!) or
3) use an alternate encoding for Chinese, one that is supported by ECI, and
define
the character set via ECI in the encoding
Issue attribute updates:
Status: WontFix
Owner: srowen