Hello,
The error message you received, "col:X> matrix's fColCnt:X" is usually associated with there being more columns somewhere in the dataset than are defined by the dataset header. This can be caused by, for example, excel adding hidden empty columns. I'd recommend opening the dataset in a plain text editor and double checking for any extra tabs at the end of each row. I'd also suggest double checking the txt specification here: https://software.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats#TXT:_Text_file_format_for_expression_dataset_.28.2A.txt.29
If you're still having problems, you can share the dataset confidentially to gsea...@broadinstitute.org if you're willing to and we can take a look and see what might be going on here.
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
gsea-help+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/gsea-help/a2c1a74c-800b-46ac-a337-1eef5950b52an%40googlegroups.com.
Hi Mohan,
If your dataset is using the identifiers in the first column of that probe-gene map file, then you'll need to set the "Collapse/Remap dataset" parameter to "Collapse" and select one of the MSigDB provided CHIP files from the "Chip platform" field. Based on the identifiers in the list, you should select the "Human_AFFY_HG_U133" chip.
Let me know if you have any further issues,
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
From:
'Mohan Liu' via gsea-help <gsea...@googlegroups.com>
Date: Friday, March 18, 2022 at 7:23 AM
To: gsea-help <gsea...@googlegroups.com>
Subject: Re: [gsea-help] Error: col:129 > matrix's fColCnt:129
Hi,
Thank you for your suggestion! I was able to fix this error.
Now I ran into another error: "None of the gene sets that you specified passed the size threshold."
Apparently the data contains all the genes listed in http://www.sciwhylab.org/gigeasa/human/probe-gene-map.txt.
And I couldn't find a geneset that contains all of them at http://www.gsea-msigdb.org/gsea/msigdb/search.jsp
Could you please provide any suggestion on this?
Thank you so much!
Best regards,
Mohan
On Thursday, March 17, 2022 at 3:15:47 PM UTC-4 Anthony Castanza wrote:
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/3e51b25d-615f-4c29-962e-eb64bdd63435n%40googlegroups.com.
Hi Mohan,
That is quite odd. How many genes are in your input data file? GSEA generally expects all expressed genes to be provided. If the dataset contains >10000 genes, then its perhaps there is some encoding issue with the file, like quoted columns or similar. Perhaps you could share a screenshot of the file opened in a text editor? Or we have the gsea...@broadinstitute.org address you could send the data file confidentially to.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/b7e21d76-8606-4797-b438-550ecf541dc4n%40googlegroups.com.
Hi Mohan,
Oh, I didn't realize you were using a custom gene set database file. If that database was retrieved from the same source as the probe-gene-map.txt file that you previously linked, you can use the file I've attached as the chip file instead, I've just taken the probe-gene-map.txt file and done some minor processing to make it comply with the CHIP format specification https://software.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats#CHIP:_Chip_file_format_.28.2A.chip.29
If the gene sets gmt you're using is not from the same source as the CHIP file, then what you'd need to do is manually reprocess the gene sets in the GMT to match the gene symbols used in either this chip file, which you'd then use for collapsing the dataset, or the MSigDB chip files and use the MSigDB chip file for collapsing the dataset.
I can't really judge from the screenshot what you'd need to do for the GMT because the formatting appears odd to me. Are you sure that is a GMT formatted gene sets file?
https://software.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats#GMT:_Gene_Matrix_Transposed_file_format_.28.2A.gmt.29
Hi Mohan,
This file isn't a gene set GMT, this is the Affymetrix probe mapping information table (similar to a chip file, but not with current gene annotations). A gene set gmt contains the collections of biological pathways or signatures that you want to evaluate using the GSEA methodology. MSigDB provides a number of these resources bundled with the GSEA application, but if you use one of them, we'd recommend using one of our provided CHIP files to do the gene mapping as well ( I believe I mentioned in my initial reply that your data matches the "Human_AFFY_HG_U133" chip).
That file does have gene ontology annotations associated with the probe to gene mappings though, if that is what you're interested in obtaining enrichment for, we do have the gene ontology database in the provided MSigDB gene sets database files, these start with "c4.go." in the file list, c5.go.v7.5.1.symbols.gmt contains the ontology annotation for GO Biological Process, GO Molecular Function, and GO Cellular Component all together, and then the c5.go.bp c5.go.mf, and c5.go.cc contain each of these subcomponents of GO separately if you were only interested in a specific part of GO.
Let me know if you have additional questions,
-Anthony
Anthony S. Castanza, PhD
Curator, Molecular Signatures Database
Mesirov Lab, Department of Medicine
University of California, San Diego
From:
'Mohan Liu' via gsea-help <gsea...@googlegroups.com>
Date: Monday, March 21, 2022 at 6:11 AM
To: gsea-help <gsea...@googlegroups.com>
Subject: Re: [gsea-help] Error: col:129 > matrix's fColCnt:129
--
You received this message because you are subscribed to the Google Groups "gsea-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
gsea-help+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gsea-help/9d87c3dd-0ae2-4aaa-b1be-510363a34c89n%40googlegroups.com.