@Karen: We are not dead set on adding `dateSubmitted` per se. Our main goal is having a metadata field that represents the thing we want, namely the date and time of the ingest of an event. We merely thought `dateSubmitted` might be a good fit for that. What do you think?
I also wasn't aware about the OC namespace thing. Does that allow us to define our own metadata fields in the dublincore.xml? If so, ditching `dc_dateSubmitted` and adding `oc_ingestDate` instead might be a good solution? But the constants you linked to in `[...]/DublinCores.java` are not used anywhere?
"result": {
"id": "e8060941-84e9-47a3-acb9-be17c4ac6792",
"mediapackage": {
"duration": 10722160,
"id": "e8060941-84e9-47a3-acb9-be17c4ac6792”, <— same Id as the event
"start": "2023-12-06T17:59:00Z”, <— potentially different start date than the event
},
"metadata": {
"catalog": [
{
"id": "f81f4d01-0b40-4f2b-8d5b-afbd3d3c6f70",
"type": "dublincore/episode”, <— contains the XML with additional content of the event, such as the temporary capture agent schedule time.
"mimetype": "text/xml",
},
"url": "https://example.edu/engage-player/e8060941-84e9-47a3/dab37fff-f1/episode.xml”
}]
}
"dcCreated": "2023-12-06T13:00:00-05:00”, <— Traditionally used to illustrate the start time of the class lecture, not the recording start. Student “show me the 10am class recording”.
"modified": "2023-08-29T11:37:04.505-04:00”, <— The date that the current version of this mediapackage was last published (for when you use merge publish strategy to update files and attributes)
"dcTitle": "J'Accuse”, <— The following are a subset of duplicate information from the episode catalog, also relevant to the mediapackage top level.
"dcCreator": "Donald Ostrowski",
"dcPublisher": "arafferty",
"dcContributor": "kking",
"dcSpatial": "1story-304",
"dcIsPartOf": 20240116935,
"dcType": "P15",
"dcType": "P15",…
.
<dcterms:type>P15</dcterms:type></dublincore>
Dear All
Welcome to the 12th episode of this topic, cf. the discussion at https://groups.google.com/a/opencast.org/g/dev/c/vuopMKcgJQc eight years ago when Matthias was “not yet very familiar with the Opencast code base”. That still holds true for me, so consider these comments non-technical.
I agree “created” and “startDate” should be allowed to hold different values - but not required to:
“created” refers to dcterms:Created and is the “Date of creation of the resource” where the resource is both the video and the content it shows, often a lecture (https://www.dublincore.org/specifications/dublin-core/dcmi-terms/terms/created/). This to me is a canonical date, at least for lecture recording.
“startDate” is not part of the DC terms (https://www.dublincore.org/specifications/dublin-core/dcmi-terms/) and was created to address differences between the time of a lecture and the actual recording time of a video, cf. mail attached. This to me is more to the technical side and required for scheduling, I think.
Things are different with uploads though: Here, the two values should be allowed to be both identical and different: There’s no reason to differentiate the two if the teacher uploads an explanatory video out of the blue. However, if she uploads a missing/failed lecture recording with one she did at home, she needs to backdate this for the video to have the correct date of the lecture held (dcterms:Created) in the context of the lecture series. Or antedate for a lecture they know she will be missing. So editing dcterms:Created should be allowed when uploading and the startDate becomes canonical.
In conclusion, I would argue for separating the two and using them for life cycle management as it fits your requirements: If you want to manage lecture recordings en masse use dcterms:Created. If you define a retention rule for uploads, use startDate and let users manipulate dcterms:Created to indicate the video is from a different date than the upload date. I’m not sure, but I think the life cycle management implemented gives us the freedom to do this.
Regards
Olaf A.
To unsubscribe from this group and stop receiving emails from it, send an email to users+un...@opencast.org.
Hi Olaf,
I don't have time to go into the details right now, but there are
actually three dates related to recordings: created, the
bibliographic start date (when the lecture happened/will happen)
and the technical start date (used by the capture agent for
scheduling, can differ from the bibliographic one for technical
reasons e.g. start five minutes early and end five minutes later).
If I remember correctly, the first two are part of the dublin core
catalog (though start date might be a custom term as Olaf said)
and displayed as metadata while the later is part of the schedule
information and not part of the catalog at all.
I'm not disagreeing that the usage in Opencast is occasionally nonsensical, especially where created is concerned, which is pretty useless as it is in my opinion. But this is a pretty complex topic and we need to be careful not to misrepresent the actual state of things.
And I would actually say things should be opposite from what Olaf said: The (bibliographic) start date should be when the lecture happened and can be changed (which is already the case), and created should be something that is set automatically e.g. on creation of the event and/or the actual ingest (here is where people usually disagree, I don't really have an opinion) and be read-only. It's possible this is at odds with the way created is defined by DC, but I think it's closer to the way Opencast handles things right now.
Cheers,
Katrin
Opencast-DevOps Teamlead elan e.V. Karlstr. 23 D-26123 Oldeburg elan-ev.de
Dear all
I agree here with Katrin. In short after an internal discussion how we (University of Bern) see things:
Cheers,
David Graf (UniBe)
Dear AllWelcome to the 12th episode of this topic, cf. the discussion athttps://groups.google.com/a/opencast.org/g/dev/c/vuopMKcgJQc eight years ago when Matthias was “not yet very familiar with the Opencast code base”. That still holds true for me, so consider these comments non-technical.
It’s pretty hard to continue this thread now that Martin’s request to differentiate two metadata fields for the sake of life cycle management has become a discussion on 10+ dates, but let’s try to find some common ground:
And that’s it already. Could people please comment on whether these are perspectives we share so we could move on differentiate existing datetime information along those lines? Other suggestions to structure this thread are welcome, of course.
O