------------------------------------------------------------------------------
"Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE
Instantly run your Selenium tests across 300+ browser/OS combos. Get
unparalleled scalability from the best Selenium testing platform available.
Simple to use. Nothing to install. Get started now for free."
http://p.sf.net/sfu/SauceLabs
_______________________________________________
Obo-discuss mailing list
Obo-d...@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/obo-discuss
Hi Erick,
Thanks for your questions. Here are my answers to your questions:
- any plans to include plant species (e.g. A thaliana)?
Answer: As you wish, I have just added the Arabidopsis thaliana one as an OGG subset:
- what's the source of the gene names? NCBI? HGBC?
Answer: It’s NCBI, primarily NCBI Gene database.
- what is the NEWENTRY ?: http://www.ontobee.org/browser/rdf.php?o=OGG&iri=http://purl.obolibrary.org/obo/OGG_3002829857
Answer: The source of the NEWENTRY definition is here:
- http://www.ncbi.nlm.gov/gene is not a valid URL
Answer: Thank you for finding this out. We have updated the URL now to:
http://www.ncbi.nlm.nih.gov/gene
Please let me know if you have any more comments, questions, and suggestions. Thanks!
Oliver
------------------------------------------------------------------------------
Hi Erick,
Yes. It’s possible to use the NCBI HomoloGene resource to construct the information for gene cross-link between species. We may try it later. Thanks for the suggestion.
For your three other questions:
- how often do you update OGG?
Answer: We will update OGG periodically, probably monthly? Later, we plan to label the date of every update in OGG as well.
- would it be possible to have the NCBI id's instead of the OGG ones ? (PURL)
Answer: It’s an alternative. We provided cross-linked NCBI IDs for the OGG terms when we can find NCBI IDs. However, for many OGG terms, we cannot find corresponding NCBI IDs, for example, ‘bacterial gene’ (or ‘gene of Bacteria’), ‘virus gene’ (or ‘gene of Viruses’), HIV protein-coding gene, protein-coding gene type, etc. Also, there may be political issues if we want to use NCBI IDs. For example, we may update our gene records later. If we use NCBI IDs, should we ask for permission every time we update OGG? What will happen if NCBI does not like our updates? Also, OGG genes cross-link to many other resources, some of which may not belong to NCBI resources. Anyway, I think we have designed a smart algorithm to manage the ontology IDs. Our program can automatically manage the IDs well. We will see how it works in the future.
- is there a way to get a local copy (VM image?) of Ontobee?
Answer: Sure. You are welcome to get a local copy of Ontobee. The Ontobee source is openly available (see the download information in our website). You can try it out and let us know if you need help.
Thanks!
Oliver
From: Erick Antezana [mailto:erick.a...@gmail.com]
Sent: Wednesday, April 30, 2014 3:18 PM
To: obo-d...@lists.sourceforge.net
Subject: Re: [Obo-discuss] OGG updates. RE: Change name to OGG from GGO.
On 30 April 2014 15:50, Asiyah Yu Lin <lini...@gmail.com> wrote:
Erick,
Homolog is about protein.
What relation you will propose for the genes of homologs?
Thanks,
Asiyah
I cannot catch all that Asiyah has said. Here I want to provide some clarification on those issues based on my understanding:
Yes, the genes in the NCBI Gene resource represents strain-specific genes. This is indeed what I want in my Vaccine Ontology and Brucellosis Ontology use cases. Those genes used in vaccines and mutant generations are also strain-specific. Strain-specific genes represent those genes belong to a strain, which can be cultured in different labs and described studies. Therefore, the strain-specific genes do represent gene entities or classes that are the abstract form of the instances of genes in the tubes or stocks in real physical labs.
A higher level of gene entities can be the homolog genes that are shared by different strains or even different species. The NCBI HomoloGene resource records the information of many such genes among eukaryotic organisms. The homolog genes are useful for various studies as Erick also indicated in an earlier email. OGG may represent such homolog genes later.
So in my mind, OGG is not generated to represent the data in the NCBI Gene resource per se. It is generated to represent the gene entities as indicated above. We don’t want to have a situation where there is one “ontology” representing the NCBI Gene resource data, and there is another “ontology” representing the Ensembl Gene resource data. As long as they represent the same gene entities for the same strain (or homolog genes among strains/species), they should be represented using the same ontology terms (in OBO Foundry). According to the ontological realism principle, ontology terms represent the entities in reality. In comparison, a resource data represent the data of entities stored in the resource. They are different. This is another reason why I don’t want to use the NCBI Gene namespace per se to represent the gene entities at the class level. Ontology terms do have difference from the resource data items at the philosophical level.
Oliver