Hi Jason,
I'm not 100% sure I understand what you're asking here.
The ranked list is computed with respect to the phenotypes you selected in your GSEA setup.
In an A vs B comparison, the ranking computation will be performed as A with respect to B such that if a gene is more highly expressed in A than B then it's ranking metric will be positive, and if it is more highly expressed in B than in A it's metric will be negative.
The GSEA algorithm (weighted K-S) is then used to determine if, on balance, the genes in a given set are more positive or negative, and thus enriched in A (on balance positive), or in B (on balance negative).
I wouldn't recommend using the top features of the heatmap directly for any specific analysis, you would want to run a standard differential expression analysis pipeline to compute per-gene significance statistics, which GSEA does not do.
We mostly provide these heatmaps and correlation statistics for data quality control assessment.
Does this answer your question?
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego