Hello,
For this, I would probably take the gene level TPM values for each of the bulk sequencing samples and format them into a GCT file. If you have multiple sorting runs resulting in multiple bulk samples you would add each one as it's own column to the GCT file.
I would take your signatures from the single cell clusters and format them into a GMX or GMT file.
You would then run ssGSEA on the GCT file using the signatures in the GMT/X file.
This would give you a score for each set for each sample (i.e. a Cluster 1 and a Cluster 2 score for each bulk sample)
The expectation would be that the Cluster 1 sorted cells score highly for the Cluster 1 signature and not the cluster 2 signature and vice versa.
If you have replicates you could then do something like a Wilcox test on the scores to get a pValue for each signature for the sorting.
The specifications for the file types I mentioned above are here: https://software.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats
Let me know if you have any other questions
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
gsea-help+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/gsea-help/24f1dd82-8e0d-4c65-b0bf-b31993883b25n%40googlegroups.com.
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego