I'm sending this email to the VOCAB Task Group contributors and also to
the RDF Task Group email list (if it still works) in order to solicit
advice.
During the expert review of the draft Standards Documentation Specification
https://github.com/tdwg/vocab/blob/master/documentation-specification.md
one of the reviewers made this comment:
"Regarding the use of dcat:Dataset in several of your examples, I do not
think that this is semantically correct to type a resource as being both
a dcat:Dataset and a skos:ConceptScheme or an owl:Ontology (e.g.
examples 4.5.4.1, 4.4.2.3, 4.4.1.1)"
In the examples cited by the reviewer, it is "term lists" that are typed
as dcat:Dataset. The reason was to enable the use of the property
dcat:distribution to link an abstract term list to the various forms in
which it might be distributed (e.g. html, turtle, json) as shown in Fig.
4 (Section 2.2.4). Because dcat:distribution has the domain
dcat:Dataset, making the link to the distributions using
dcat:distribution entails that the term list is a dcat:Dataset whether
we like it or not, so I stated that fact explicitly in the examples. I
should also note that because dcat:Dataset is rdfs:subClassOf
dctype:Dataset, it is also entailed that the term lists are dctype:Dataset.
The question is whether it is a problem when a term list (a
dcat:Dataset) might also be declared to be an owl:Ontology as is
recommended in section 4.4.2.2 (ontology) or if a vocabulary (a
dctype:Dataset) might also be a skos:ConceptScheme (suggested as a
possibility in 4.5.4). With respect to generating inconsistencies, none
of these classes are declared to be disjoint with each other in their
defining RDF. (skos:Concept is disjoint with skos:ConceptScheme but
that isn't an issue here.) So it seems to me that the only problem
would be if there were something "incompatible" in the human-readable
definitions.
I've complied the various definitions in this document:
https://github.com/tdwg/vocab/blob/master/dataset-related-definitions.md
The main requirement in the human readable definitions seems to be that
datasets contain "data", and it seems to me that the definition of
"data" would be fuzzy enough to include vocabulary terms regardless of
whether those terms were also part of an ontology or a concept scheme.
In fact, the definition of dctype:Dataset gives "list" as an example of
a dataset, and "subject heading lists" are listed as an example of a
skos:ConceptScheme. So there seems to be at least one clear example
(subject heading list) that could be considered both a dctype:Dataset
and a skos:ConceptScheme.
Anyway, I'm inclined to just disagree with the reviewer that the
multiple typing to which he/she objects is a problem - I can't see a
reason why it would be a problem. But I'd like to have some feedback
from some experts (you) before I take this position. Agreeing with the
reviewers objection would probably require using some term other than
dcat:distribution to make the link from term lists to distributions, and
it seemed to me that dcat:distribution was just the right well-known
term for that job. I can't think of a better alternative.
Steve
--
Steven J. Baskauf, Ph.D., Senior Lecturer
Vanderbilt University Dept. of Biological Sciences
postal mail address:
PMB 351634
Nashville, TN 37235-1634, U.S.A.
delivery address:
2125 Stevenson Center
1161 21st Ave., S.
Nashville, TN 37235
office: 2128 Stevenson Center
phone:
(615) 343-4582, fax:
(615) 322-4942
If you fax, please phone or email so that I will know to look for it.
http://bioimages.vanderbilt.edu
http://vanderbilt.edu/trees