Hi Terrence,
Is the test you are referring to Vuong's test? In the nonnest2 package.
I noticed that it pretty much chooses any model with fewer manifest variables in my analysis, so I thought it might not be appropriate to use.
For example, would have a 1-factor model with 6 manifest variables / indicators and would throw an indicator out with low factor loading, residual correlations high and high MI scores.
Then compare the 6-item model to the 5-item model using Vuong's test, in any case it would give me that the 5-item model was preferred over the 6 item (also when I would e.g. throw a high factor loading out with minor local fit issues).