Dear all GSEA users,
I have RNAseq data and I want to use them for GSEA (2 groups and each group contains 3 replicates). I am not sure now which values (read counts, fpkm, TPM etc.) I have to upload in “Expression dataset” means to use for GSEA. I have read that it is possible to do with fpkm values which are provided directly by the RNAseq. Is that right or do I need a normalization, other values...?
Thank you for your help in advance
Hi Bernhard,
We do not recommend using FPKM for GSEA. These values are not properly normalized for “between-sample” differential expression calculations which is effectively what GSEA has to do to rank the genes. We recommend running something DESeq2, and extracting the normalized counts table that is produced during that analysis. The DESeq2 modules on both GenePattern.org and UseGalaxy.org produce this output (GenePattern does it by default, Galaxy would need “Output normalized counts table” toggled to True first). You can also use other methods of between-sample normalization such as TMM but the DESeq2 method is probably the easiest.
An additional note, when running GSEA on small groups such as a 3 vs 3 experiment. You’ll need to set the permutation type to “gene_set” instead of the default “phenotype”.
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
gsea-help+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/gsea-help/41ab735b-29ae-407d-af5e-410fdbcafda5n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/d7aa351d-bc9b-4377-95bb-711255555928n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/630d8e82-c1db-4609-9a03-4573d1089544n%40googlegroups.com.
Hi Bernhard,
The DESeq2 module also performs a standard differential expression analysis and provides those results, but I don’t think it actually formats it as a RNK file, does it?
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/8f8f571d-4b37-4266-bc3d-5da0fc50819cn%40googlegroups.com.