Bernhard,
> Great Michael, thanks ... for putting together the memo :-) Here are my
> comments:
Thanks a bunch for the detailed comments!
> - One outcome of the discussions in Raleigh was that the notification of
> triple-level changes introduces quite some complexity for data providers
> because they need to maintain the history of triple level-changes until a
> certain point in the past. Keeping track of resource-level (entity-level)
> changes is less complex and has successfully been implemented by other
> protocols that deal with changes in data sets, such as the OAI-PMH protocol
> [1]. So if we want to keep the protocol simple and stupid we should take this
> into account. I know we already discussed that several times...but I want to
> raise that point again.
I tried to formulate an issue based on this comment but failed. May I ask
you to give it a try at [1]?
> - In the current version of the guide you refer to HTTP caching and memento
> for resource-level changes. To be honest, I don't get the point here. Can you
> explain?
Imagine a case where one deals with 10 resources from Dbpedia
(resource-level changes). Would I advise to use Dataset Dynamics? Certainly
not. One can do this with the mentioned technologies. Now, the case where
one uses several thousands resources, for example based on certain
categories in Wikipedia (dataset-level changes) - here it would make sense
to use the proposed mechanism.
Ok, or still need for clarification?
> - Thanks for the evset:ResourceChangeEvent hint. We will reconsider/fix this.
Cool, thanks.
> - for the update frequencies we should quantifiable values, i.e. numbers;
Is this covered with Issue 1 [2]? If not, can you add there?
I agree and proposed to use atom:updated [3], see Code 2. Maybe I should
make this more explicit?
> - a dady:lastUpdate (as subproperty of
http://purl.org/dc/terms/modified) is
> for the client the most valuable and for the data provider the easiest to
> implement meta-information it can offer
Not sure why this is needed. If you look at Fig 1. Dataset Dynamics overall
process there are essentially two metadata types involved: the
time-oblivious, statis voiD+dady and the time-sensitive, dynamic Atom+CS.
Where in this setup would you see the dady:lastUpdate taken place?
> - should we also consider the option to deliver the changesets embedded into
> the the atom feed? That would save lot's of HTTP requests...
Hm. Unsure about this. I guess only implementation experience can tell. I'd
typically expect the CS (as of Code 3.) to be quite large; maybe thousands
of triples in many resources would change. Delivering them by-value rather
than by-reference would indefinitely bloat the Atom feed and make it
unusable. A consumer should further first be able to understand if a certain
change is relevant for its case and not be forced to consume the entire big
feed.
Again, if you think this is a valuable requirement, please raise and issue
at [1].
> - any idea how we can get rid of SOME TITLE and SOME SUMMARY? We do not really
> need that info, right?
This is intentional. Rather than having this metadata in the CS, I propose
to have it in the Atom feed (same as with atom:updated vs cs:createdDate).
Cheers,
Michael
[1]
http://code.google.com/p/dady/issues/list
[2]
http://code.google.com/p/dady/issues/detail?id=1
[3]
http://tools.ietf.org/html/rfc4287#section-4.2.15