Hi Hexin,
The sample of your file that you sent appears fine to me. GSEA however, seems to be finding an “NA” value somewhere in one of the sample’s count values, but it isn’t saying exactly where in the error message which is unusual. I would check carefully for any “NA” strings other than those in the description field.
If you still can’t identify any problems, you can send your full data file to gsea...@broadinstitute.org, we’ll keep it confidential and only use it for debugging.
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
gsea-help+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/gsea-help/ec7ee2fb-45d8-4cfc-892f-5bd85bc600c8n%40googlegroups.com.
Hi Hexin,
For RNA-seq data, NA’s is rather surprising, you might want to look back at the raw data and see where they might’ve come from. If they’re zeros in the original data, you should replace with a zero, but if they weren’t expressed I’m not sure why they would’ve been coded as “NA”. More likely they represent outliers or something else that caused them to be rejected by some sort of internal logic in whatever preproccessing you did on the data, in that case you should delete the “NA” (leave a blank cell if, for example, looking at the data in excel). That will cause GSEA to not consider a value for that sample for that gene in the internal differential expression calculation.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/4dfc527a-7529-4caf-b163-01f8e98c85dbn%40googlegroups.com.