Hi Nicola and others,
I am using LEfSe to identify taxonomic biomarkers for 3 different Classes, with no subclass information. Unfortunately, I have some issues with sample size, and I have read to take results from Kruskal-Wallis tests with small sample sizes (<5) with caution. It would be great if you can help me with some questions that I have.
Analysis Details: Class I, n=2, Class II, n=10, Class III, n=12. The result of computing LEfSe (K-W alpha = 0.05, LDA threshold = 3.5) on my dataset identifies: 12 biomarkers for Class I, 10 biomarkers for Class II, and 7 biomarkers for Class III.
How sensitive is LEfSe to small sample sizes and dissimilar sample sizes? Would computing the K-W test at an lower alpha level (say, 0.001) produce more reliable results? Additionally, are there any bootstrapping methods I can apply that would help the situation?
Thanks so much for you help,
Ryan Trexler