Some triples in the irefindex namespace are being generated to the
geneid namespace when they should be pointing at the symbol namespace.
The predicates in question are [1] and [2], for example on the uri
[3]. In the example, http://bio2rdf.org/geneid:MAGEA4 should be
http://bio2rdf.org/symbol:MAGEA4
The rdfiser script [4] assumes that irefindex will always have the
following mapping 'entrez gene/locuslink' => 'geneid', and that works
for the other predicates, but the irefindex database fails to
distinguish the numeric identifiers from the textual symbols that we
put in "symbol:", which is based mostly on HGNC.
I can write a sparql rule to fix the error dynamically in the webapp
if necessary, but it would be best to fix it in the rdfiser so that
users of the sparql endpoint are not confused about the difference
between the endpoint and the webapp.
On Wed, Jul 4, 2012 at 9:55 PM, Peter Ansell <ansell.pe...@gmail.com> wrote:
> Some triples in the irefindex namespace are being generated to the
> geneid namespace when they should be pointing at the symbol namespace.
> The predicates in question are [1] and [2], for example on the uri
> [3]. In the example, http://bio2rdf.org/geneid:MAGEA4 should be
> http://bio2rdf.org/symbol:MAGEA4
> The rdfiser script [4] assumes that irefindex will always have the
> following mapping 'entrez gene/locuslink' => 'geneid', and that works
> for the other predicates, but the irefindex database fails to
> distinguish the numeric identifiers from the textual symbols that we
> put in "symbol:", which is based mostly on HGNC.
> I can write a sparql rule to fix the error dynamically in the webapp
> if necessary, but it would be best to fix it in the rdfiser so that
> users of the sparql endpoint are not confused about the difference
> between the endpoint and the webapp.
> --
> You received this message because you are subscribed to the Google Groups
> "bio2rdf" group.
> To post to this group, send email to bio2rdf@googlegroups.com.
> To unsubscribe from this group, send email to
> bio2rdf+unsubscribe@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/bio2rdf?hl=en.
-- Michel Dumontier
Associate Professor of Bioinformatics, Carleton University
Chair, W3C Semantic Web for Health Care and the Life Sciences Interest Group
http://dumontierlab.com
> so, are we using the "symbol" or "hgnc" namespace?
> m.
> On Wed, Jul 4, 2012 at 9:55 PM, Peter Ansell <ansell.pe...@gmail.com> wrote:
>> Some triples in the irefindex namespace are being generated to the
>> geneid namespace when they should be pointing at the symbol namespace.
>> The predicates in question are [1] and [2], for example on the uri
>> [3]. In the example, http://bio2rdf.org/geneid:MAGEA4 should be
>> http://bio2rdf.org/symbol:MAGEA4
>> The rdfiser script [4] assumes that irefindex will always have the
>> following mapping 'entrez gene/locuslink' => 'geneid', and that works
>> for the other predicates, but the irefindex database fails to
>> distinguish the numeric identifiers from the textual symbols that we
>> put in "symbol:", which is based mostly on HGNC.
>> I can write a sparql rule to fix the error dynamically in the webapp
>> if necessary, but it would be best to fix it in the rdfiser so that
>> users of the sparql endpoint are not confused about the difference
>> between the endpoint and the webapp.
>> --
>> You received this message because you are subscribed to the Google Groups
>> "bio2rdf" group.
>> To post to this group, send email to bio2rdf@googlegroups.com.
>> To unsubscribe from this group, send email to
>> bio2rdf+unsubscribe@googlegroups.com.
>> For more options, visit this group at
>> http://groups.google.com/group/bio2rdf?hl=en.
> --
> Michel Dumontier
> Associate Professor of Bioinformatics, Carleton University
> Chair, W3C Semantic Web for Health Care and the Life Sciences Interest Group
> http://dumontierlab.com
> --
> You received this message because you are subscribed to the Google Groups
> "bio2rdf" group.
> To post to this group, send email to bio2rdf@googlegroups.com.
> To unsubscribe from this group, send email to
> bio2rdf+unsubscribe@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/bio2rdf?hl=en.
On Thu, Jul 5, 2012 at 4:43 PM, Peter Ansell <ansell.pe...@gmail.com> wrote:
> hgnc contains the numeric identifiers from hgnc and symbol has
> contained the textual gene symbols.
> Peter
> On 6 July 2012 06:38, Michel Dumontier <michel.dumont...@gmail.com> wrote:
> > so, are we using the "symbol" or "hgnc" namespace?
> > m.
> > On Wed, Jul 4, 2012 at 9:55 PM, Peter Ansell <ansell.pe...@gmail.com>
> wrote:
> >> Some triples in the irefindex namespace are being generated to the
> >> geneid namespace when they should be pointing at the symbol namespace.
> >> The predicates in question are [1] and [2], for example on the uri
> >> [3]. In the example, http://bio2rdf.org/geneid:MAGEA4 should be
> >> http://bio2rdf.org/symbol:MAGEA4
> >> The rdfiser script [4] assumes that irefindex will always have the
> >> following mapping 'entrez gene/locuslink' => 'geneid', and that works
> >> for the other predicates, but the irefindex database fails to
> >> distinguish the numeric identifiers from the textual symbols that we
> >> put in "symbol:", which is based mostly on HGNC.
> >> I can write a sparql rule to fix the error dynamically in the webapp
> >> if necessary, but it would be best to fix it in the rdfiser so that
> >> users of the sparql endpoint are not confused about the difference
> >> between the endpoint and the webapp.
> >> --
> >> You received this message because you are subscribed to the Google
> Groups
> >> "bio2rdf" group.
> >> To post to this group, send email to bio2rdf@googlegroups.com.
> >> To unsubscribe from this group, send email to
> >> bio2rdf+unsubscribe@googlegroups.com.
> >> For more options, visit this group at
> >> http://groups.google.com/group/bio2rdf?hl=en.
> > --
> > Michel Dumontier
> > Associate Professor of Bioinformatics, Carleton University
> > Chair, W3C Semantic Web for Health Care and the Life Sciences Interest
> Group
> > http://dumontierlab.com
> > --
> > You received this message because you are subscribed to the Google Groups
> > "bio2rdf" group.
> > To post to this group, send email to bio2rdf@googlegroups.com.
> > To unsubscribe from this group, send email to
> > bio2rdf+unsubscribe@googlegroups.com.
> > For more options, visit this group at
> > http://groups.google.com/group/bio2rdf?hl=en.
> --
> You received this message because you are subscribed to the Google Groups
> "bio2rdf" group.
> To post to this group, send email to bio2rdf@googlegroups.com.
> To unsubscribe from this group, send email to
> bio2rdf+unsubscribe@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/bio2rdf?hl=en.
-- Michel Dumontier
Associate Professor of Bioinformatics, Carleton University
Chair, W3C Semantic Web for Health Care and the Life Sciences Interest Group
http://dumontierlab.com
On Thursday, 5 July 2012 03:55:39 UTC+2, Peter Ansell wrote:
> Some triples in the irefindex namespace are being generated to the > geneid namespace when they should be pointing at the symbol namespace. > The predicates in question are [1] and [2], for example on the uri > [3]. In the example, http://bio2rdf.org/geneid:MAGEA4 should be > http://bio2rdf.org/symbol:MAGEA4
I have to admit that I'm not following RDF-based work and I'm therefore not familiar with the various namespaces in use, but as someone currently responsible for the iRefIndex data, should I make sure that gene symbols use the hgnc prefix corresponding to the MI:1095 controlled vocabulary term in the PSI-MI TAB output?
> I have to admit that I'm not following RDF-based work and I'm therefore
> not familiar with the various namespaces in use, but as someone currently
> responsible for the iRefIndex data, should I make sure that gene symbols
> use the hgnc prefix corresponding to the MI:1095 controlled vocabulary term
> in the PSI-MI TAB output?
> To post to this group, send email to bio2rdf@googlegroups.com.
> To unsubscribe from this group, send email to
> bio2rdf+unsubscribe@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/bio2rdf?hl=en.
-- Michel Dumontier
Associate Professor of Bioinformatics, Carleton University
Chair, W3C Semantic Web for Health Care and the Life Sciences Interest Group
http://dumontierlab.com
In the data we produce, that term won't actually get used, but the symbols would be prefixed with hgnc as the "database name" (corresponding to the "preferred name" in the ontology term entry) as described in the PSI-MI TAB specification:
> In the data we produce, that term won't actually get used, but the symbols
> would be prefixed with hgnc as the "database name" (corresponding to the
> "preferred name" in the ontology term entry) as described in the PSI-MI TAB
> specification:
> --
> You received this message because you are subscribed to the Google Groups
> "bio2rdf" group.
> To post to this group, send email to bio2rdf@googlegroups.com.
> To unsubscribe from this group, send email to bio2rdf+unsubscribe@**
> googlegroups.com <bio2rdf%2Bunsubscribe@googlegroups.com>.
> For more options, visit this group at http://groups.google.com/** > group/bio2rdf?hl=en <http://groups.google.com/group/bio2rdf?hl=en>.
-- Michel Dumontier
Associate Professor of Bioinformatics, Carleton University
Chair, W3C Semantic Web for Health Care and the Life Sciences Interest Group
http://dumontierlab.com