On Windows, Anaconda x64 3.3:
[py3k] C:\gensim-py3k>python setup.py test
running test
running egg_info
creating gensim.egg-info
writing requirements to gensim.egg-info\requires.txt
writing gensim.egg-info\PKG-INFO
writing dependency_links to gensim.egg-info\dependency_links.txt
writing top-level names to gensim.egg-info\top_level.txt
writing manifest file 'gensim.egg-info\SOURCES.txt'
reading manifest file 'gensim.egg-info\SOURCES.txt'
reading manifest template 'MANIFEST.in'
warning: no files found matching '*.sh' under directory '.'
writing manifest file 'gensim.egg-info\SOURCES.txt'
running build_ext
test_load (gensim.test.test_corpora.TestBleiCorpus) ... ok
test_save (gensim.test.test_corpora.TestBleiCorpus) ... ok
test_serialize (gensim.test.test_corpora.TestBleiCorpus) ... ok
test_serialize_compressed (gensim.test.test_corpora.TestBleiCorpus) ... ok
test_load (gensim.test.test_corpora.TestLowCorpus) ... ok
test_save (gensim.test.test_corpora.TestLowCorpus) ... ok
test_serialize (gensim.test.test_corpora.TestLowCorpus) ... ok
test_serialize_compressed (gensim.test.test_corpora.TestLowCorpus) ... ok
test_load (gensim.test.test_corpora.TestMalletCorpus) ... ok
test_load_with_metadata (gensim.test.test_corpora.TestMalletCorpus) ... ok
test_save (gensim.test.test_corpora.TestMalletCorpus) ... ok
test_serialize (gensim.test.test_corpora.TestMalletCorpus) ... ok
test_serialize_compressed (gensim.test.test_corpora.TestMalletCorpus) ... ok
test_load (gensim.test.test_corpora.TestMmCorpus) ... ok
test_save (gensim.test.test_corpora.TestMmCorpus) ... ok
test_serialize (gensim.test.test_corpora.TestMmCorpus) ... c:\gensim-py3k\gensim
\corpora\indexedcorpus.py:118: ResourceWarning: unclosed file <_io.TextIOWrapper
name='c:\\users\\svansc~1\\appdata\\local\\temp\\2\\gensim_corpus.tst' mode='r'
encoding='cp1252'>
return self.docbyoffset(self.index[docno])
ok
test_serialize_compressed (gensim.test.test_corpora.TestMmCorpus) ... ok
test_load (gensim.test.test_corpora.TestSvmLightCorpus) ... ok
test_save (gensim.test.test_corpora.TestSvmLightCorpus) ... ok
test_serialize (gensim.test.test_corpora.TestSvmLightCorpus) ... ok
test_serialize_compressed (gensim.test.test_corpora.TestSvmLightCorpus) ... ok
test_load (gensim.test.test_corpora.TestUciCorpus) ... ok
test_save (gensim.test.test_corpora.TestUciCorpus) ... ok
test_serialize (gensim.test.test_corpora.TestUciCorpus) ... ok
test_serialize_compressed (gensim.test.test_corpora.TestUciCorpus) ... ok
testBuild (gensim.test.test_corpora_dictionary.TestDictionary) ... ok
testDocFreqAndToken2IdForSeveralDocsWithOneWord (gensim.test.test_corpora_dictio
nary.TestDictionary) ... ok
testDocFreqForOneDocWithSeveralWord (gensim.test.test_corpora_dictionary.TestDic
tionary) ... ok
testDocFreqOneDoc (gensim.test.test_corpora_dictionary.TestDictionary) ... ok
testFilter (gensim.test.test_corpora_dictionary.TestDictionary) ... ok
test_dict_interface (gensim.test.test_corpora_dictionary.TestDictionary)
Test Python 2 dict-like interface in both Python 2 and 3. ... ok
test_doc2bow (gensim.test.test_corpora_dictionary.TestDictionary) ... ok
test_from_corpus (gensim.test.test_corpora_dictionary.TestDictionary)
build `Dictionary` from an existing corpus ... ok
test_saveAsText_and_loadFromText (gensim.test.test_corpora_dictionary.TestDictio
nary)
`Dictionary` can be saved as textfile and loaded again from textfile. ... ok
testBuild (gensim.test.test_corpora_hashdictionary.TestHashDictionary) ... ok
testDebugMode (gensim.test.test_corpora_hashdictionary.TestHashDictionary) ... o
k
testDocFreqAndToken2IdForSeveralDocsWithOneWord (gensim.test.test_corpora_hashdi
ctionary.TestHashDictionary) ... ok
testDocFreqForOneDocWithSeveralWord (gensim.test.test_corpora_hashdictionary.Tes
tHashDictionary) ... ok
testDocFreqOneDoc (gensim.test.test_corpora_hashdictionary.TestHashDictionary) .
.. ok
testFilter (gensim.test.test_corpora_hashdictionary.TestHashDictionary) ... ok
testRange (gensim.test.test_corpora_hashdictionary.TestHashDictionary) ... ok
test_saveAsText (gensim.test.test_corpora_hashdictionary.TestHashDictionary)
`HashDictionary` can be saved as textfile. ... ERROR
test_saveAsTextBz2 (gensim.test.test_corpora_hashdictionary.TestHashDictionary)
`HashDictionary` can be saved & loaded as compressed pickle. ... ok
test_corpus (gensim.test.test_lee.TestLeeTest)
availability and integrity of corpus ... ok
test_lee (gensim.test.test_lee.TestLeeTest)
correlation with human data > 0.6 ... ok
test_miislita_high_level (gensim.test.test_miislita.TestMiislita) ... ok
test_save_load_ability (gensim.test.test_miislita.TestMiislita) ... c:\gensim-py
3k\gensim\interfaces.py:60: UserWarning: corpus.save() stores only the (tiny) it
eration object; to serialize the actual corpus content, use e.g. MmCorpus.serial
ize(corpus)
warnings.warn("corpus.save() stores only the (tiny) iteration object; "
ok
test_textcorpus (gensim.test.test_miislita.TestMiislita)
Make sure TextCorpus can be serialized to disk. ... ok
testLargeMmap (gensim.test.test_models.TestLdaMallet) ... ok
testPersistence (gensim.test.test_models.TestLdaMallet) ... ok
testTransform (gensim.test.test_models.TestLdaMallet) ... ok
testLargeMmap (gensim.test.test_models.TestLdaModel) ... WARNING:gensim.models.l
damodel:no word id mapping provided; initializing from corpus, assuming identity
WARNING:gensim.models.ldamodel:too few updates, training might not converge; con
sider increasing the number of passes or iterations to improve accuracy
c:\gensim-py3k\gensim\models\ldamodel.py:636: DeprecationWarning: using a non-in
teger number instead of an integer will result in an error in the future
score += numpy.sum(cnt * logsumexp(Elogthetad + Elogbeta[:, id]) for id, cnt i
n doc)
ok
testPersistence (gensim.test.test_models.TestLdaModel) ... WARNING:gensim.models
.ldamodel:no word id mapping provided; initializing from corpus, assuming identi
ty
WARNING:gensim.models.ldamodel:too few updates, training might not converge; con
sider increasing the number of passes or iterations to improve accuracy
ok
testTransform (gensim.test.test_models.TestLdaModel) ... ok
testPersistence (gensim.test.test_models.TestLogEntropyModel) ... ok
testTransform (gensim.test.test_models.TestLogEntropyModel) ... ok
testCorpusTransform (gensim.test.test_models.TestLsiModel)
Test lsi[corpus] transformation. ... WARNING:gensim.models.lsimodel:no word id m
apping provided; initializing from corpus, assuming identity
ok
testLargeMmap (gensim.test.test_models.TestLsiModel) ... WARNING:gensim.models.l
simodel:no word id mapping provided; initializing from corpus, assuming identity
C:\Anaconda\envs\py3k\lib\site-packages\scipy\sparse\compressed.py:122: UserWarn
ing: indices array has non-integer dtype (float64)
ok
testOnlineTransform (gensim.test.test_models.TestLsiModel) ... WARNING:gensim.mo
dels.lsimodel:no word id mapping provided; initializing from corpus, assuming id
entity
ok
testPersistence (gensim.test.test_models.TestLsiModel) ... WARNING:gensim.models
.lsimodel:no word id mapping provided; initializing from corpus, assuming identi
ty
ok
testTransform (gensim.test.test_models.TestLsiModel)
Test lsi[vector] transformation. ... WARNING:gensim.models.lsimodel:no word id m
apping provided; initializing from corpus, assuming identity
ok
testPersistence (gensim.test.test_models.TestRpModel) ... ok
testTransform (gensim.test.test_models.TestRpModel) ... ok
testInit (gensim.test.test_models.TestTfidfModel) ... ok
testPersistence (gensim.test.test_models.TestTfidfModel) ... ok
testTransform (gensim.test.test_models.TestTfidfModel) ... ok
testSplitAlphanum (gensim.test.test_parsing.TestPreprocessing) ... ok
testStemText (gensim.test.test_parsing.TestPreprocessing) ... ok
testStripMultipleWhitespaces (gensim.test.test_parsing.TestPreprocessing) ... ok
testStripNonAlphanum (gensim.test.test_parsing.TestPreprocessing) ... ok
testStripNumeric (gensim.test.test_parsing.TestPreprocessing) ... ok
testStripShort (gensim.test.test_parsing.TestPreprocessing) ... ok
testStripStopwords (gensim.test.test_parsing.TestPreprocessing) ... ok
testStripTags (gensim.test.test_parsing.TestPreprocessing) ... ok
testChunking (gensim.test.test_similarities.TestMatrixSimilarity) ... ok
testFull (gensim.test.test_similarities.TestMatrixSimilarity) ... ok
testIter (gensim.test.test_similarities.TestMatrixSimilarity) ... ok
testLarge (gensim.test.test_similarities.TestMatrixSimilarity) ... ok
testMmap (gensim.test.test_similarities.TestMatrixSimilarity) ... ok
testNumBest (gensim.test.test_similarities.TestMatrixSimilarity) ... ok
testPersistency (gensim.test.test_similarities.TestMatrixSimilarity) ... ok
testChunking (gensim.test.test_similarities.TestSimilarity) ... ok
testFull (gensim.test.test_similarities.TestSimilarity) ... ok
testIter (gensim.test.test_similarities.TestSimilarity) ... ok
testLarge (gensim.test.test_similarities.TestSimilarity) ... ok
testMmap (gensim.test.test_similarities.TestSimilarity) ... ok
testNumBest (gensim.test.test_similarities.TestSimilarity) ... ok
testPersistency (gensim.test.test_similarities.TestSimilarity) ... ok
testReopen (gensim.test.test_similarities.TestSimilarity)
test re-opening partially full shards ... ok
testSharding (gensim.test.test_similarities.TestSimilarity) ... ok
testChunking (gensim.test.test_similarities.TestSparseMatrixSimilarity) ... ok
testFull (gensim.test.test_similarities.TestSparseMatrixSimilarity) ... ok
testIter (gensim.test.test_similarities.TestSparseMatrixSimilarity) ... ok
testLarge (gensim.test.test_similarities.TestSparseMatrixSimilarity) ... ok
testMmap (gensim.test.test_similarities.TestSparseMatrixSimilarity) ... ok
testNumBest (gensim.test.test_similarities.TestSparseMatrixSimilarity) ... ok
testPersistency (gensim.test.test_similarities.TestSparseMatrixSimilarity) ... o
k
test_None (gensim.test.test_utils.TestIsCorpus) ... ok
test_int_tuples (gensim.test.test_utils.TestIsCorpus) ... ok
test_invalid_formats (gensim.test.test_utils.TestIsCorpus) ... ok
test_simple_lists_of_tuples (gensim.test.test_utils.TestIsCorpus) ... ok
testLargeMmap (gensim.test.test_word2vec.TestWord2VecModel)
Test storing/loading the entire model. ... c:\gensim-py3k\gensim\models\word2vec
.py:303: UserWarning: Cython compilation failed, training will be slow. Do you h
ave Cython installed? `pip install cython`
warnings.warn("Cython compilation failed, training will be slow. Do you have C
ython installed? `pip install cython`")
ok
testParallel (gensim.test.test_word2vec.TestWord2VecModel)
Test word2vec parallel training. ... ok
testPersistence (gensim.test.test_word2vec.TestWord2VecModel)
Test storing/loading the entire model. ... ok
testPersistenceWord2VecFormat (gensim.test.test_word2vec.TestWord2VecModel)
Test storing/loading the entire model in word2vec format. ... ok
testPersistenceWord2VecFormatWithVocab (gensim.test.test_word2vec.TestWord2VecMo
del)
Test storing/loading the entire model and vocabulary in word2vec format. ... ok
testRNG (gensim.test.test_word2vec.TestWord2VecModel)
Test word2vec results identical with identical RNG seed. ... ok
testTraining (gensim.test.test_word2vec.TestWord2VecModel)
Test word2vec training. ... WARNING:gensim.models.word2vec:consider setting laye
r size to a multiple of 4 for greater performance
WARNING:gensim.models.word2vec:consider setting layer size to a multiple of 4 fo
r greater performance
ok
testTrainingCbow (gensim.test.test_word2vec.TestWord2VecModel)
Test CBOW word2vec training. ... WARNING:gensim.models.word2vec:consider setting
layer size to a multiple of 4 for greater performance
WARNING:gensim.models.word2vec:consider setting layer size to a multiple of 4 fo
r greater performance
ok
testVocab (gensim.test.test_word2vec.TestWord2VecModel)
Test word2vec vocabulary building. ... ok
testLineSentenceWorksWithCompressedFile (gensim.test.test_word2vec.TestWord2VecS
entenceIterators)
Does LineSentence work with a compressed file object argument? ... ok
testLineSentenceWorksWithFilename (gensim.test.test_word2vec.TestWord2VecSentenc
eIterators)
Does LineSentence work with a filename argument? ... ok
testLineSentenceWorksWithNormalFile (gensim.test.test_word2vec.TestWord2VecSente
nceIterators)
Does LineSentence work with a file object argument, rather than filename? ... ok
======================================================================
ERROR: test_saveAsText (gensim.test.test_corpora_hashdictionary.TestHashDictiona
ry)
`HashDictionary` can be saved as textfile.
----------------------------------------------------------------------
Traceback (most recent call last):
File "c:\gensim-py3k\gensim\test\test_corpora_hashdictionary.py", line 167, in
test_saveAsText
d.save_as_text(tmpf)
File "c:\gensim-py3k\gensim\corpora\hashdictionary.py", line 231, in save_as_t
ext
fout.write("%i\t%i\t%s\n" % (tokenid, self.dfs.get(tokenid, 0), '\t'.join(wo
rds_df)))
File "C:\Anaconda\envs\py3k\lib\encodings\cp1252.py", line 19, in encode
return codecs.charmap_encode(input,self.errors,encoding_table)[0]
UnicodeEncodeError: 'charmap' codec can't encode character '\u0165' in position
10: character maps to <undefined>
----------------------------------------------------------------------
Ran 113 tests in 15.680s
FAILED (errors=1)
[py3k] C:\gensim-py3k>