Daniel Ehrenberg
unread,Jan 12, 2016, 2:31:29 AM1/12/16Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Sign in to report message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to v8-u...@googlegroups.com
Hi V8 users,
V8 has its own built-in, custom, minimal Unicode library called
Unibrow. I am looking into replacing Unibrow with the ICU library.
There would be a fallback for non-ICU builds which supports the UTF-8
and UTF-16 encodings for source code, but only gets operations like
case mapping right on ASCII. This has a few advantages:
- The long-term maintenance burden of our separate Unicode
implementation would be removed, freeing up effort to work on other
efforts, including better internationalization.
- Unibrow has a few long-standing bugs, and sometimes lags the ICU
version in the same Chrome build. Replacing it with ICU or ASCII
support, depending on build-time flags, would remove these bugs.
- The binary size would be slightly reduced, especially on builds
without internationalization enabled.
I'm wondering whether anyone who builds without ICU depends on the
following features, which I'm proposing to make dependent on ICU:
- Unicode-aware case mapping (e.g., String.prototype.toUpperCase())
- JavaScript code with non-ASCII identifiers
- Unicode-aware case-insensitive RegExp matching
Thanks,
Dan