
Hi Barbora,
You should not replace “0” values with empty cells. Empty cells are “absent data” and a gene with absent data for a sample has that sample excluded from the gene’s differential expression calculation (black cells in the heatmap), a zero
for RNA-seq is a real value and that value will be included in the differential expression calculation.
Additionally, just to confirm, you did perform some sort of between-sample normalization on this data, correct? If not, the raw counts (including any quantified zeros) should be normalized using something like DESeq2 and then the matrix of normalized counts
used for GSEA. If you didn’t do this already and don’t know how to do this normalization manually, the DESeq2 module on GenePattern (https://cloud.genepattern.org/) or the DESeq2 function on Galaxy (https://usegalaxy.org/)
are both capable of generating this counts matrix output, although I’d personally recomeng GenePattern since the output is in GCT format that can be used as-is with GSEA.
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego

--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
gsea-help+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/gsea-help/ec4f71dc-20cc-494e-bbf6-c8bcabc59753n%40googlegroups.com.
You received this message because you are subscribed to a topic in the Google Groups "gsea-help" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/gsea-help/6qChbgDOB94/unsubscribe.
To unsubscribe from this group and all its topics, send an email to gsea-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/BYAPR05MB57828E0F3DDA15807CE4225CF7459%40BYAPR05MB5782.namprd05.prod.outlook.com.
Hi Barbora,
Commas are not an expected character in numeric strings in the data format. Is it possible your data is comma separated instead of tab separated? Both our .TXT and our .GCT specifications (https://software.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats#GCT:_Gene_Cluster_Text_file_format_.28.2A.gct.29)
expect tab delimited text.
If that isn’t the problem, please open your file in a plain text editor (notepad on a PC or TextEdit) and send me a screenshot of what it looks like. That should help us narrow down the problem.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/CAD0-5ibEVSJJ_d7xi3-hxWLvSS3ORYSQ%2Bt%3DRe3eD%3DZvf6KLavA%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/BYAPR05MB5782848EABF06552168D3A43F7459%40BYAPR05MB5782.namprd05.prod.outlook.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/CAFEo9XhNoXV-g%3D8DvG7tH__HUsLROe2q7Hctis4z59oHgvd%3Dww%40mail.gmail.com.
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/CAD0-5iZrgM_cdPUNRRaWwz7LdT0rp69Qs6T2JQehTX_LE0aM%2BQ%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/CAGCeyZyBD4Vq%3DWsN%3DVL4Tu%2BwH041tXBCCzDFB%3DyOFU9mwBxXVA%40mail.gmail.com.