Hi Sarah,
The default metric that GSEA uses when you provide it a complete dataset to rank is the signal to noise ratio; the difference of group means divided by the sum of the group standard deviations. This metric is effectively the change scaled by the coherence of the change for each gene. When preranking your data we don’t really give a specific recommendation for how to do so as we can’t reasonably test.
Signing the -log pValue with the direction of the fold change will give you an enrichment result that is computed with respect to how confident you are in the changes for the genes in the set (i.e. sets with higher confidence compounded changes will have larger enrichment scores), computing on the basis of log fold change will give you an enrichment result that is computed with respect to magnitude of change (i.e. sets made up of genes with larger expression shifts will have larger enrichment scores). Ideally what you would want is a metric where sets with genes having larger expression changes are more enriched, but you would want to down-weight the sets where the change is a result of a low-confidence effect.
Either way, you wouldn’t want to apply a threshold, you would want to include genes both with non-significant p-values and genes that aren’t above some log2FC threshold, you would just want them to contribute minimally to the score.
Some people have suggested using a metric like log2(FC)*-log10(pValue), to get that “best of both worlds” metric when it isn’t possible to use GSEA’s internal signal-to-noise ratio ranking, but this isn’t something we’ve rigorously tested.
Does that make sense?
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
gsea-help+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/gsea-help/eecf5ce7-f7ec-4538-982e-2f736b0beae6n%40googlegroups.com.
You received this message because you are subscribed to a topic in the Google Groups "gsea-help" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/gsea-help/p8aKtdzbx1M/unsubscribe.
To unsubscribe from this group and all its topics, send an email to gsea-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/SJ0PR05MB7609A6ED6AA7E25EA8ADEAD4F7719%40SJ0PR05MB7609.namprd05.prod.outlook.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/CAPVLg_JPt4pdtj0ECF7Mi3H_w5eZ%3D%2BvHKt4M32hcVn0zyyG%2BCw%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/CAGCeyZyWcMCC4Zp%3DQw0xiXR-naVvhd7YhM0iib9vd7cz5GjU_w%40mail.gmail.com.