Hi Nolan,
I'm no expert on neurolex - I don't understand the mapping to RDF.
However, before deleting anything you might want to check that these are
truly duplicates.
These appear to be aliases for the same underlying database entry:
http://neurolex.org/wiki/Nlx_137244
http://neurolex.org/wiki/Category:Z_offset_in_Slide_Coordinate_System
Click on 'history' to see what I mean.
You are indeed seeing two separate URIs in the RDF dump of neurolex.
This appears to be a 'feature' of neurolex (or SMWs in general?). Two
(or more) URIs are generated for each entry: one with a URI fragment
that starts "Category-3A" another that uses a Nlx or Birnlex Id. These
are connected via an owl:sameAs axiom.
I have encountered this when attempting to use the RDF dump or SPARQL
interface to perform operations over the content of neurolex. Personally
I find it easier to first merge over owl:sameAs before attempting
further processing. I would prefer it if neurolex provided a native form
of RDF that only used a single URI for each concept.
Neurolex gurus, let me know if I've misunderstood something.
Here's another example, two URIs for the cerebellum concept:
<owl:Class rdf:about="&wiki;Category-3ACerebellum">
<rdfs:label>Cerebellum</rdfs:label>
<swivt:page rdf:resource="&wikiurl;Category:Cerebellum"/>
<rdfs:isDefinedBy
rdf:resource="&wikiurl;Special:ExportRDF/Category:Cerebellum"/>
<property:Authors
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">Joseph
Altman</property:Authors>
<property:Authors
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">Shirley Ann
Bayer</property:Authors>
<property:Created
rdf:datatype="
http://www.w3.org/2001/XMLSchema#dateTime">2006-07-15T00:00:00</property:Created>
<property:CurationStatus
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">uncurated</property:CurationStatus>
<property:Definition
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">Part of the
rhombencephalon that lies in the posterior cranial fossa behind the
brain stem, consisting of the cerebellar cortex, deep cerebellar nuclei
and cerebellar white matter.
A portion of the brain that helps regulate posture, balance, and
coordination. (NIDA Media Guide Glossary)</property:Definition>
<property:ISBN
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">0849394902</property:ISBN>
<property:Id
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">birnlex_1489</property:Id>
…
<swivt:Subject rdf:about="&wiki;Birnlex_1489">
<rdfs:label>Birnlex 1489</rdfs:label>
<swivt:page rdf:resource="&wikiurl;Birnlex_1489"/>
<rdfs:isDefinedBy
rdf:resource="&wikiurl;Special:ExportRDF/Birnlex_1489"/>
<rdf:type rdf:resource="&wiki;Category-3ARegional_part_of_brain"/>
<property:Authors
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">Joseph
Altman</property:Authors>
<property:Authors
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">Shirley Ann
Bayer</property:Authors>
<property:Created
rdf:datatype="
http://www.w3.org/2001/XMLSchema#dateTime">2006-07-15T00:00:00</property:Created>
<property:CurationStatus
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">uncurated</property:CurationStatus>
<property:Definition
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">Part of the
rhombencephalon that lies in the posterior cranial fossa behind the
brain stem, consisting of the cerebellar cortex, deep cerebellar nuclei
and cerebellar white matter.
A portion of the brain that helps regulate posture, balance, and
coordination. (NIDA Media Guide Glossary)</property:Definition>
<property:ISBN
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">0849394902</property:ISBN>
<property:Id
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">birnlex_1489</property:Id>
<property:Is_part_of rdf:resource="&wiki;Category-3AHindbrain"/>
<property:Label
rdf:datatype="
http://www.w3.org/2001/XMLSchema#string">Cerebellum</property:Label>
<owl:sameAs rdf:resource="&wiki;Category-3ACerebellum"/>
…
>>> email to
neurolex+u...@googlegroups.com <javascript:>.