Hello Victor,
On Oct 10, 2:10 pm, Victor <
vki...@mail.ru> wrote:
> Thank you for explanation. Still argument "passes" is not clear.
> In Hoffman, Blei, Bach: Online Learning for Latent Dirichlet Allocation,
> NIPS 2010 they update lambda (theme generating parameter) after each chunk,
> so only ChunkSize is informative...
`passes` is the number of training passes through the corpus. For
example, if the training corpus has 50,000 documents, chunksize is
10,000, passes is 2, then online training is done in 10 updates:
#1 documents 0-9,999
#2 documents 10,000-19,999
#3 documents 20,000-29,999
#4 documents 30,000-39,999
#5 documents 40,000-49,999
#6 documents 0-9,999
#7 documents 10,000-19,999
#8 documents 20,000-29,999
#9 documents 30,000-39,999
#10 documents 40,000-49,999
HTH,
Radim