Hey Nathan,
While manually updating my current EnsEMBL relationship databases, I came across a few changes in EnsEMBL that might need to be adressed:
- Changes on webpage:
· You are now no longer able to select the one ‘GO ID’ box under the ‘EXTERNAL’ parameter in BioMART. This has been split in three seperate ‘GO ID’ boxes, each corresponding to a main GO category. This implies that you have to download 3 seperate files that you need to merge into one!!
· “External Gene Symbol” under the ‘Attributes’ tag is now renamed to “Associated Gene Name”
o Textual changes should be made at ‘Method B’ on the website.
- The layout of the EnsEMBL table (/databases/<species>/gene/Ensembl.txt) may change if you export it directly from BioMART (column order):
GO-Elite v1.19: ID Symbol Description Chromosome Species Date Remarks
build 48: Ensembl Gene ID Description External Gene ID
build 50: Ensembl Gene ID Description Associated Gene Name
If this is not done in a proper way, the final output file will be messy (i.e. using descriptions instead of gene names - confirmed). This can be easily fixed by explaining that within BioMART the fields have to be selected in a specific order (thus point people out that they have to check first the gene name box and then the description box) and then all will be fine (:
Next to this, I’m still not able to create the EnsEMBL-MAPP.txt using the ‘Affymetrix CSV’ update window. It works perfectly when I put in the Affymetrix CSV file in the proper BuildDB directory. But doing this overwrites the Ensembl-Affymetrix relationship table and I only want to include the relationships from BioMART in my database (thus keep the old file). Although this is part of the local pathway analysis, and no high priority, I would love to know how I can recreate those gene-MAPP files without using the Affymetrix CSV (I don’t understand why this file is needed for that anyhow).
Best wishes,
-- Stan
Yes, I thought so too. The CSV updater is initiated with the following options:
* Species: Homo Sapiens
* Include gene associations from gene and uid-gene files in current directories: YES
* Extract gene-GeneOntology information from Affymetrix: NO
* Save options: OVERWRITE PREVIOUS
* Build Gene-MAPP associations from WikiPathways: YES
The EnsEMBL-Affy relationship table has been exported from BioMART and I would like to use that (since this has been updated more recently than the CSV on the Affy website). So for this exercise there is no CSV file present in my /BuildDBs/-dir, but we do have a fresh BioMART table in the /uid-gene/-dir.
After selecting the above options, a wikipathways TAB file is downloaded and then I get the following: http://ftp2.bigcat.unimaas.nl/~stan.gaj/goelite/screens/Screen01.jpg
-- Stan
Hey Nathan,
I was not aware about the EnsEMBL mapping issues and didn’t know WikiPathways supplies only one identifier (I always thought it was backed up with a gene database containing ID crosslinking). Thanks for bringing these points up, since it clarifies a lot!
So, from the official GO-elite’s point of view, the Affy CSV file is used as ‘highly recommended’ input then? If so, then I’d better stick to that as well and trust your judgement on this (since I don’t really have that much time to dig in the details here).
Thank you again for the wonderfull support after my waves of daily spam (:
Oh, and no. The screenshot was taken under Windows, but I do use GO-Elite under Ubuntu 6.06 LTS, and so I’ve encountered no real problems! Usually, if I come across any bug on my notebook (Windows), I try to reproduce it on the Linux system (workstation).