I wouldn't call these topics trash topics, though. I'm only training LDA with 25 topics, and all of them are relatively coherent. That is, within each topic the words tend to agree with each other, at least in my opinion. I am not returning topics where the words have no relationship whatsoever. The only issue I've seen is word chaining, where there appears to be two distinct groupings of words in a topic that are "attached" by a single word they share in common.
This is a problem, because models I train on the same data and the same parameters appear to chain topics in different ways. I'm using my LDA topics to categorize documents, so I'm trying to minimize this chaining issue as much as possible.
Running my LDA models with more training passes seems to be helping me out. Models trained with 500 passes are more consistent than those trained with 150 passes (using the code in my fork), though I need to study this further to be confident. However, even with 500 passes I'm noticing chaining, so I'm wondering whether models with a few hundred more passes would work better. This is the next thing I'm going to study. Is there any downside to training models this long? I assume that it depends on the corpus, but is there any danger of "overtraining" my models that I should know of?
Thanks so much for the help.