PyYoshi,
I have integrated cchardet support into Beautiful Soup and it will be
in the next release. You can see my code in revision 246:
http://bazaar.launchpad.net/~leonardr/beautifulsoup/bs4/revision/246
Speaking as a user, I have some feedback about the installation
instructions. I've never installed a Cython extension before, and it
took me a while to discover that the installation process requires two
dependencies other than Cython.
I'm running Ubuntu, and in addition to the 'cython' package I had to
install the "python-dev" package and the "g++" package before 'pip
install -U cchardet' would run to completion. It would be helpful to
have this information alongside the Cython dependency.
My only feedback about the library itself is that it's unfortunate
that it doesn't have the same API as chardet. In particular,
chardet.detect() returns a dictionary and cchardet.detect() returns a
string.
I understand that the API is different because cchardet is based on
the libcharsetdetect library, and not a C implementation of the
chardet algorithm. But since the library is called 'cchardet' I
thought it would look more like chardet.
Thanks again for creating this outstanding library.
Leonard