Hello,
The MSigDB webtools use a hypergeometric overlap statistic test. These form of tests are not particularly robust, and tend to give unreasonably permissive statistics when used with large gene lists. So that, combined with the computational limitations of our webserver being a shared resource have resulted in the submission limit you're hitting here.
You don't necessarily need the full expression data to run GSEA, GSEA supports a mode called "Preranked" where you can give it a scored list of genes. So if you have access to the full differential expression results (i.e. log2FC's for every gene without filtering for significant or not), I would strongly urge you to consider that approach instead, which is likely to give you similar results to what you would've gotten from the overlap test but with much more reasonable statistics, as well as the other benefits of a "full" enrichment analysis approach, such as quantitative metrics of the magnitude of enrichment.
If you don't have access to that data; does that 1200 gene list contain both positive and negative effects? You might be able to run the positive and negative sides separately though the webtool. There is a little bit of flexibility in the 500 gene cap so a split list might make it through.
Let me know if neither of those approaches will work for you and perhaps we can come up with an alternative.
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
gsea-help+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/gsea-help/20c6d259-b38d-4cb8-8b7f-41562761d416n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/a3f4c131-fa7d-4b65-a1bd-b1dfdba3ffb1n%40googlegroups.com.