Ok, so I double checked that running in the virtual environment isn't causing any problems. When I run outside I also get 26 processes allocating to one processor ( I have 24 processors). The output of ps looks like:
>> ps -F -A
UID PID PPID C SZ RSS PSR STIME TTY TIME CMD
[*snip*]
odemasi 61669 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61670 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61671 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61672 59981 0 2738764 9821696 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61673 59981 0 2738764 9821696 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61674 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61675 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61676 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61681 59981 0 2738764 9821680 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61682 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61683 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61684 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61685 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61686 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61687 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61688 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61689 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61694 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61698 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61699 59981 0 2738764 9821704 23 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61700 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61701 59981 0 2738764 9821704 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61702 59981 0 2738764 9821696 14 03:42 pts/5 00:00:00 python RunLDA.py 2
odemasi 61703 59981 0 2738764 9821696 14 03:42 pts/5 00:00:00 python RunLDA.py 2
[*snip*]
The standard out that I'm getting is:
/home/odemasi/Packages/venv/lib/python2.6/site-packages/numpy/lib/utils.py:95: DeprecationWarning: `scipy.sparse.sparsetools` is deprecated!
scipy.sparse.sparsetools is a private module for scipy.sparse, and should not be used.
warnings.warn(depdoc, DeprecationWarning)
/home/odemasi/Packages/venv/lib/python2.6/site-packages/scipy/lib/_util.py:67: DeprecationWarning: Module scipy.linalg.blas.fblas is deprecated, use scipy.linalg.blas instead
DeprecationWarning)
2015-06-25 03:36:38,835 : INFO : adding document #0 to Dictionary(0 unique tokens: [])
2015-06-25 03:39:34,893 : INFO : built Dictionary(5060602 unique tokens: [u'loyalsubscribers', u'iftheyclosedchipotleiddie', u'\u666e\u6bb5\u306e\u53e3\u8abf\u3067\u4f55\u6ce3\u3044\u3066\u308b\u3093\u3067\u3059\u304b\u79c1\u306f\u3069\u3053\u306b\u3082\u884c\u304d\u307e\u305b\u3093\u304b\u3089\u5927\u4e08\u592b\u3067\u3059\u3092\u8a00\u3046', u'deargodmakeatrade', u'billycorgan']...) from 1 documents (total 5060602 corpus positions)
2015-06-25 03:39:36,283 : INFO : using symmetric alpha at 0.01
2015-06-25 03:39:36,283 : INFO : using serial LDA version on this node
2015-06-25 03:42:20,479 : WARNING : input corpus stream has no len(); counting documents
2015-06-25 03:42:25,018 : INFO : running online LDA training, 100 topics, 1 passes over the supplied corpus of 100000 documents, updating every 48000 documents, evaluating every ~100000 documents, iterating 50x with a convergence threshold of 0.001000
2015-06-25 03:42:25,018 : WARNING : too few updates, training might not converge; consider increasing the number of passes or iterations to improve accuracy
2015-06-25 03:42:25,023 : INFO : training LDA model using 24 processes
2015-06-25 03:42:27,407 : INFO : PROGRESS: pass 0, dispatched chunk #0 = documents up to #2000/100000, outstanding queue size 1