I figured it out. That was painful. In this youtube video, the woman explains that you have to save it as a tab delimited txt file and just change the ending to .gct.
https://www.youtube.com/watch?v=_zpH-DgE33U
This is confusing when you are likely editing it in excel.
This was my process for future folks working with RNAseq data (and if anyone has helpful critiques). It is not clear on the myriad of forums, gene pattern website or GSEA manual:
- upload raw counts
- convert to DGElist
- normalize data using edgeR (CalcNormFactors)
- save counts per million into a new dataframe; for example using a dgelist called "e"
> e <- as.data.frame(cpm(e))
- export as an excel file or table
- manually insert the two rows needed at the top of the data (first row, #1.2; second row is number of genes followed by number of samples)
- save as tab delimited .txt file but ensure the ending of the file is ".gct"