Hello Nathan,
I’m messing a bit with the GO-Elite databases (more specific, the files: Ensembl.txt and Ensembl-Affymetrix.txt). My thought was to generate an annotation table that I could import into R to synchronize the Affymetrix/EnsEMBL annotation used in GO-Elite with my human data.
During this I found some oddities and I hope you can shed some light on them:
- For 207 Affymetrix IDs the GO-Elite tool is able to associate 170 of these unique IDs with an EnsEMBL ID coupled to 150 unique GO-processes. I’m running the latest GO-conversion tables, etc. Using my own table (derived from your databases), I can only associate 149 unique reporters with one or more EnsEMBL IDs. I’ve uploaded the files (input, denominator, mappfinder results, databases used) to http://ftp2.bigcat.unimaas.nl/~stan.gaj/goelite/bugs/102908/
- Regarding this, I thought there might have been an error in my script. During the debugging phase I noticed that the ‘Ensembl-Affymetrix”-table contained EnsEMBL IDs that were not present in the EnsEMBL.txt. I can’t seem to find a way to update the EnsEMBL database through the GO-Elite GUI, and will look in the documents if I can do it manually. A few examples of EnsEMBL IDs I can’t find:
ENSG00000197316 1555498_at
ENSG00000203964 1555547_at
ENSG00000165012 234686_at
If I look up those genes in the (current) EnsEMBL database I always found the following:
Ensembl
gene: ENSG00000165012
gene ENSG00000165012 is no longer in the Ensembl database
and it has not been mapped to any newer identifiers
Looking up the AffyIDs in EnsEMBL got me the following:
1555498_at ENSG00000196914
1555547_at No Hit
234686_at ENSG00000218045
For me, this clearly indicates that the EnsEMBL-Affymetrix file is somehow outdated. I’ve tried updating the Affymetrix databases through the GUI, but that part claims that there is no new annotation file available.
Any suggestions on how to proceed?
-- Stan
Hello Nathan,
I’m messing a bit with the GO-Elite databases (more specific, the files: Ensembl.txt and Ensembl-Affymetrix.txt). My thought was to generate an annotation table that I could import into R to synchronize the Affymetrix/EnsEMBL annotation used in GO-Elite with my human data.
During this I found some oddities and I hope you can shed some light on them:
- For 207 Affymetrix IDs the GO-Elite tool is able to associate 170 of these unique IDs with an EnsEMBL ID coupled to 150 unique GO-processes. I’m running the latest GO-conversion tables, etc. Using my own table (derived from your databases), I can only associate 149 unique reporters with one or more EnsEMBL IDs. I’ve uploaded the files (input, denominator, mappfinder results, databases used) to http://ftp2.bigcat.unimaas.nl/~stan.gaj/goelite/bugs/102908/
- Regarding this, I thought there might have been an error in my script. During the debugging phase I noticed that the ‘Ensembl-Affymetrix”-table contained EnsEMBL IDs that were not present in the EnsEMBL.txt. I can’t seem to find a way to update the EnsEMBL database through the GO-Elite GUI, and will look in the documents if I can do it manually. A few examples of EnsEMBL IDs I can’t find:
ENSG00000197316 1555498_at
ENSG00000203964 1555547_at
ENSG00000165012 234686_at
If I look up those genes in the (current) EnsEMBL database I always found the following:
Ensembl gene: ENSG00000165012 <http://www.ensembl.org/Homo_sapiens/idhistoryview?gene=ENSG00000165012>
gene ENSG00000165012 is no longer in the Ensembl database and it has not been mapped to any newer identifiers
Looking up the AffyIDs in EnsEMBL got me the following:
1555498_at ENSG00000196914 <http://www.ensembl.org/Homo_sapiens/geneview?gene=ENSG00000196914>
1555547_at No Hit
234686_at ENSG00000218045 <http://www.ensembl.org/Homo_sapiens/geneview?gene=ENSG00000218045>
Hi Nathan,
Yup. Your answer covers it all. I understood the cause of this, but I was not able to update the Ens-Affy table using the CSV Affymetrix updater. I didn’t know that you made GO-Elite also compatible with BioMart output (but come to think of it, I actually should have thought about that (-; ) and will give that approach a try.
Thanks for the quick reply!
-- Stan
Well, grabbing updates from WikiPathways works fine. But, if I select:
- Include Gene associations ... : YES
- Extract gene-GeneOntology information from Affy: YES
- Save options: Overwrite previous
- Build gene-MAPP associations from WikiPathways: NO
And species Homo Sapiens.
When I start the update I immediately get the message “Finished Parsing the Latest Affymetrix CSV Annotations” (within <1 sec). I don’t get the impression that it actually makes a connection to the affymetrix website (especially since both my firewall and TCPVIEW only show GO-Elite connect to something if I include the Wikipathways option). Is this normal behaviour? And no, GO-Elite is allowed to make internet connections (-;
Hi Nathan,
I started to figure that out while I was reading the database instructions. Updating the AFFX databases now!
Thanks for the quick replies!