select genes (ecocyc IDS) on the basis of specific gene ontology terms / inspect gene ontology tree

5 views
Skip to first unread message

aless...@gmail.com

unread,
Jun 25, 2014, 7:18:38 AM6/25/14
to ogg-d...@googlegroups.com
Hello Everyone
I am quite unfamiliar with gene ontology 
I have a list of genes of interest of E. coli (about 1800 genes) in a  Matlab file
I have the needing to select those genes belonging to specific biological functions, like nutrient transport, stress defense etc..
I know the biological function but I don't exactly know how my broad definitions map into the gene ontology tree.
My idea was to use a program like OBO-edit to understand a bit more the parent to child structure of gene ontology, related to E. coli
in order to select the broadest GO term for each biological functions and then use matlab to identify corresponding genes belonging to each category.
I have however some doubts:
1) I downloaded the GO in the OBO format from http://go.princeton.edu/GOTermMapper/goSlimFiles/gene_ontology.obo
but i don't know how to select those terms that are present only in E.coli. E.g. motocondria related terms should be excluded
I also have the gene associacion file for E.coli in Ecocyc format. Is there an easy way to make such cross-correlation between ontologies and e coli genes with OBO-edit?
Is there a better and or easier program that can fullfill my needings?

Despite my struggling I am really interested in understanding more about the gene ontology structure and I am having some fun, but some help from you would be REALLY appreciated

Thank you in advance for the suggestions you will give and you patience!

Alessandro

Yongqun He

unread,
Jun 25, 2014, 3:25:11 PM6/25/14
to aless...@gmail.com, ogg-d...@googlegroups.com
for each E. coli gene, OGG provides a GO association, for example,
for the E. coli gene accG:
http://purl.obolibrary.org/obo/OGG_3000947758
it has an annotation:
has GO association: GO_0042966

You can identify all GO associations for all E. coli genes. Then you can use the information for different applications.

Please see if you have any questions.

Oliver He




--
You received this message because you are subscribed to the Google Groups "ogg-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ogg-discuss...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Asiyah Yu Lin

unread,
Jun 25, 2014, 4:40:43 PM6/25/14
to Yongqun He, aless...@gmail.com, ogg-d...@googlegroups.com
Hi Alessandro,

I extracted the E.coli genes from the OGG owl file using OntoFox (http://ontofox.hegroup.org/). Please check the attached file. (unzip and you will see an owl file)
The gene is a class in OGG, and its GO association terms are annotated by following tag:
<obo:OGG_0000000029></obo:OGG_0000000029>

It should be relatively straight forward to extract the information that you would need for Matlab input.

Best,
Asiyah



################################################
Jedi Order:
There is no emotion, there is peace.
There is no ignorance, there is knowledge.
There is no passion, there is serenity.
There is no chaos, there is harmony.
There is no death, there is Force.

Our Jedi Code: May peace and force be with you.

EcoliGO_1.zip
Reply all
Reply to author
Forward
0 new messages