--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gsea-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/f01c94f5-26e4-42cb-91e9-bcb20de9285bn%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/3743bbe1-67ac-4d0e-936b-39dcecb56836n%40googlegroups.com.
Hello,
Yeah, this kind of filtering is normal, typically genes that had zero and/or some extremely low number of UMIs across all samples were removed, it shouldn’t affect the outcome. This same kind of filtering is typically performed with bulk RNA-seq data as well. We just don’t want the list to be restricted to something like only “significant” features.
As for interpreting the results, the na_pos will reflect the upregulated/positive side of your list, and the na_neg will reflect the downregulated/negative side of your ranked list. Those names are just placeholders since we don’t know what the phenotypes are that the up and down sides of the enrichment actually represent.
Let me know if you have any more questions,
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/a1710712-d8f5-4745-b68c-246c3ee82eefn%40googlegroups.com.
Hello,
In my previous message I wes describing the case for a differentially expressed gene ranked list. But this is, in fact, a general case. na_pos will always be the gene sets enriched in the positive side of the ranked list, and na_neg will always be the gene sets enriched in the negative side of the ranked list regardless of the underlying nature of the experiment that led to the ranking. Conservation between your groups doesn’t factor into it unless it is a factor in how you ranked your genes. If your ranking is differential expression between two groups, then positive enrichment will be from upregulated genes and negative enrichment will be from downregulated genes.
With regard to using pValue; generally for pValues, you probably want to use the -log10 of the pvalue, this would allow the more significant pvalues to be assigned more weight than less significant ones (since GSEA uses the ranking metric to weight each step of the enrichment score calculation). Some people also have good results using the -log10(pvalue)*sign of the fold change. This later option allows you to get both positive and negative scores while still using the pValue to assign the magnitude of the weighting factor.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/4dcbffe3-73c7-4147-a101-d143ccdfab18n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/edef8ff9-316e-4d64-8822-fc2a6016f48bn%40googlegroups.com.
Hello,
You would not want to use the adjusted p-value because of the very issue you observed. I would recommend calculating the transformation internally in R (since Seurat is R based) and then only writing out the transformed list and not using excel at all.
Running the up and down lists separately is almost never a good option for GSEA for the previously mentioned reason. Additionally, splitting the ranked list would not address the repeated values issue as the ones present on one side of the list don’t affect the ones present on the other side of the list.
Some repeats are simply to be expected, genes that changed identically (within the limits of our ability to detect them) will have identical values, this is just something that happens. But it might be possible to mitigate this somewhat by using the value of the log2(fc) including the sign, instead of just the sign alone in the transformation formula (as well as not using Excel to perform the calculation due to the rounding behavior).
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/fec50677-400f-4c87-b781-7356eb31ff0en%40googlegroups.com.