Hi GSEA Team,
I would like to do GSEA on a large single-cell data set (~600,000 cells) using some custom gene sets. I was looking for a way to do the analysis in a timely manner as the FindAllMarkers command from Seurat was taking a very long time to rank the genes. I came across the tutorial linked below that uses the wilcoxonauc command from the presto package to rank genes and the fgsea package to perform the actual enrichment analysis.
In the tutorial, the AUC statistic is used to rank the genes and as input to the GSEA. Is the AUC/Wilcoxon rank sum test a reasonable metric to use? The wilcoxauc command also provides a logFC variable. However, when I use the logFC to rank the genes, I get larger p-values. I would like to maximize sensitivity so would prefer to use AUC, but want to make sure this is a sound method. Any info you could provide would be greatly appreciated!
Link to tutorial: https://crazyhottommy.github.io/scRNA-seq-workshop-Fall-2019/scRNAseq_workshop_3.html
--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gsea-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/2ccbce25-f8e2-4786-93ba-bccee08e4d5en%40googlegroups.com.
Hi Anthony,
Thank you so much for your reply! Would it make sense to filter out genes with zero/no expression before running the analysis? If so, do you have any suggested criteria for filtering?
I have tried a few different methods of measuring activity of gene sets in single cell data including the AUCell package and Seurat’s AddModuleScore. The results were consistent with the GSEA in that the same cell types that had the greatest activity with these two methods were the same cell types with the greatest normalized enrichment scores. I just wanted to confirm that the Wilcoxon rank sum test/AUC makes sense as a metric to use for ranking.
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego