Alas, these are some early experiments - so far I am judging manually... I'll get some documents of varying length and feed them to the models.
The contents of the documents are known to me (for example, a software license agreement, a sports commentary, cooking recipes etc.), so I know what to more or less expect of the resultant topic distributions. For instance, as a representative example, below is an English outcome for GNU license text. This is nothing short of brilliant, and exactly what one might expect - some computer/software/technology topic, a law/legal topic and a business/financial topic.
The corresponding result for the German version of the document (this is what I am actually trying to do - compare topic distributions for same document translations across languages) is not exactly completely off - but most definitely not spot on like the English one either. This has always been the case in my tests thus far. I am aware that this is perfectly subjective view, but that's all I can offer at this point. Anyway, this has led me to believe that I might be doing something wrong (or not doing something necessary) in the preprocessing of the German text, or that I might need to tweak the parameters of the German LDA model creation. I'm currently re-running the LDA creation with 20 passes.
Correlation: 0.521205828707
INFO : [(u'data', 0.026586245172387498), (u'information', 0.015914390641238663), (u'software', 0.012704520236886358), (u'code', 0.010014952166849207), (u'systems', 0.0083467418435341917), (u'source', 0.0069796402967319607), (u'web', 0.0069714605916698967), (u'project', 0.0066037502568369912), (u'open', 0.0056400656073773451), (u'database', 0.005543017976261967)]
INFO : Correlation: 0.247048787896
INFO : [(u'court', 0.022021126302133591), (u'law', 0.020566839303454931), (u'police', 0.016155609527520199), (u'act', 0.012006214393929831), (u'case', 0.0095495376930901047), (u'legal', 0.0067887621798527198), (u'justice', 0.006500539900767625), (u'said', 0.0058811460940946664), (u'prison', 0.005536762183323129), (u'judge', 0.0055237228208750355)]
INFO : Correlation: 0.11054561579
INFO : [(u'financial', 0.0075286310032007114), (u'economic', 0.0065923151961858502), (u'act', 0.0063692287271636845), (u'million', 0.0062029969301934012), (u'bank', 0.0054508460950598945), (u'tax', 0.0050785230719152565), (u'management', 0.0048578108151419016), (u'policy', 0.0046335396392108525), (u'fund', 0.0039710016698683778), (u'market', 0.003874307507573083)]