Well, right now it is based on the Open World Assumption, which means
that it is not because you don't have access to (or don't know it
exists), that it doesn't exists. However, it is true that we could gain
by adding one attribute to the description of the datasets: something
like "accessibleAt" or "instantiatedAt", or something similar, which
would link a dataset description to one or multiple files that
instantiate the description of the records that belong to this dataset
(important: without changing the ID of that dataset).
So, we could end-up with something like:
"dataset": {
"id": "http://people.bibkn.org/wsf/datasets/106/",
"prefLabel": "Oberwolfach Photo Collection ",
"description": "Oberwolfach Photo Collection ",
"prefURL": "http://owpdb.mfo.de/",
"instantiatedAt":
"http://people.bibkn.org/drupal/data/oberwolfach/A.bibjson",
"instantiatedAt":
"http://people.bibkn.org/drupal/data/oberwolfach/B.bibjson",
"instantiatedAt":
"http://people.bibkn.org/drupal/data/oberwolfach/C.bibjson",
"instantiatedAt":
"http://people.bibkn.org/drupal/data/oberwolfach/D.bibjson",
"schema": "http://www.bibkn.org/drupal/bibjson/bibjson_schema.json",
"linkage": [
"http://www.bibkn.org/drupal/bibjson/oberwolfach_linkage.json",
"http://www.bibkn.org/drupal/bibjson/iron_linkage.json"
]
},
This new attribute (instantiatedAt) has to be seen as a convenient way
to describe information about a dataset.
Is this what you were meaning?
Thanks!
Take care,
Fred
> Your "instantiatedAt" proposal would get rid of this problem, although
> it could make for difficult to maintain datasets, since every file in
> a dataset would have to be updated if a single file were added or
> removed. I would prefer to see each file in the dataset point towards
> one common file which would in turn point towards each of the
> segments. Perhaps we could use the existing "metaFile" attribute in
> each segment, with each "metaFile" pointing towards a single file
> which would contain, in addition to other dataset metadata, a list of
> segments using the "instantiatedAt" attribute.
I am glad that you carefully read the spec! I think this would be a good
usage of the datasets metaFiles. In fact, you would still need an
attribute such as "instantiatedAt" to let this metaFile point to all
files. So, we would have something like this:
Dataset A, file 1:
==================
{
"dataset": {
"id": "http://dataset/a/",
"metaFile": "http://dataset/a/metafile.bibjson"
}
}
==================
Dataset A, file 2:
==================
{
"dataset": {
"id": "http://dataset/a/",
"metaFile": "http://dataset/a/metafile.bibjson"
}
}
==================
Dataset A, Metafile:
==================
{
"dataset": {
"id": "http://dataset/a/",
"instantiatedAt": "http://dataset/a/datasetA_1.bibjson"
"instantiatedAt": "http://dataset/a/datasetA_2.bibjson"
}
}
==================
Note: nobody should confuse the dataset ID and the location of the
dataset slice file.
So, as you suggest, you only have to maintain the growth of the meta
file instead of all dataset files. which greatly simplify the task.
This is certainly something I would suggest to do. But I would think
about another attribute name than "instantiatedAt".
Is this what you had in mind?
Thanks!
Take care,
Fred