Hi all,
I'm pleased to announce the release of a new version of gensim!
http://radimrehurek.com/gensim/
You can get 0.8.6 from the usual places (pip, PyPI, github, ...).
Quoting from CHANGELOG:
* added HashDictionary (by Homer Strong)
* support for adding target classes in SVMlight format (by Corrado
Monti)
* fixed problems with global lemmatizer object when running in
parallel on Windows
* parallelization of Wikipedia processing + added script version that
lemmatizes the input documents
* added class method to initialize Dictionary from an existing corpus
(by Marko Burjek)
The new HashDictionary by Homer is especially cool; check out
http://en.wikipedia.org/wiki/Hashing-Trick and gensim's API docs. It
lets you use new (previously unseen) words in your BoW model, without
re-creating/updating any Dictionary.
Enjoy,
Radim