[ANN] charset-detector

23 views
Skip to first unread message

Denis Bardadym

unread,
Oct 22, 2014, 8:01:01 AM10/22/14
to nod...@googlegroups.com
Hi group.

I did a port of ICU character detection to javascript. (the same charset detection used in chromium e.g)

NPM module: charset-detector

I did it for several reasons: to replace my binding for icu4c in one of my projects, to learn a bit about encodings and as i have some free time.

I know that exists already jschardet - which is a port of mozilla charset detector, but 
by license MIT (my) vs LGPL
by speed my ~30% faster (i did simple benchmarks of utf8 detection and several multibyte encodings)
by list of supported encodings: jschardet knows more =) (if the ICU would have it, I would have also ported =)


Reply all
Reply to author
Forward
0 new messages