why golang split the katakana range into multirange , and don't include 0x30fb --0x30fc, it should include it..
in golang src/unicode/tables
var _Katakana = &RangeTable{ R16: []Range16{ {0x30a1, 0x30fa, 1}, {0x30fd, 0x30ff, 1}, {0x31f0, 0x31ff, 1}, {0x32d0, 0x32fe, 1}, {0x3300, 0x3357, 1}, {0xff66, 0xff6f, 1}, {0xff71, 0xff9d, 1}, }, R32: []Range32{ {0x1b000, 0x1b000, 1}, },}
% unicode -d -U 30fa 30fb 30fc
U+30FA 'ヺ' KATAKANA LETTER VO
category: Lo
canonical combining classes: 0
bidirectional category: L
character decomposition mapping: 30F2 3099
mirrored: N
U+30FB '・' KATAKANA MIDDLE DOT
category: Po
canonical combining classes: 0
bidirectional category: ON
mirrored: N
U+30FC 'ー' KATAKANA-HIRAGANA PROLONGED SOUND MARK
category: Lm
canonical combining classes: 0
bidirectional category: L
mirrored: N
%
I think the behavior might be a bug, but the character is peculiar, or at least punctuation rather than "letters". Leaving for Marcel, who is the curator of the Unicode packages these days.
-rob
--
You received this message because you are subscribed to the Google Groups "golang-nuts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to golang-nuts...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/golang-nuts/f524ca6d-c98e-43d5-a594-41c3c9481e18%40googlegroups.com.