Hi all, maybe you have an answer to my question: How could an HTML document declare itself to be a snapshot of some resource?
Memento is all about specifying such provenance information, but
it always expresses this information in HTTP headers. If I save a
web page, in my case using freeze-dry, I
would like to add this information to the file itself. Do you know
of standards for expressing this?
As a possible solution, would it make sense move the Memento HTTP headers into link and meta tags?:
<meta http-equiv="Memento-Datetime" content="Wed, 30 May 2007 18:47:52 GMT">
<link rel="original" href="https://example.org/">
Thinking further in this direction: would it make sense to allow more/all of the defined link relation types to be given in a <link> element, instead of an HTTP Link header? The spec seems not to mention any such practice.
Gerben
Hi all, maybe you have an answer to my question: How could an HTML document declare itself to be a snapshot of some resource?
Memento is all about specifying such provenance information, but it always expresses this information in HTTP headers. If I save a web page, in my case using freeze-dry, I would like to add this information to the file itself. Do you know of standards for expressing this?
As a possible solution, would it make sense move the Memento HTTP headers into link and meta tags?:
<meta http-equiv="Memento-Datetime" content="Wed, 30 May 2007 18:47:52 GMT">
Thinking further in this direction: would it make sense to allow more/all of the defined link relation types to be given in a <link> element, instead of an HTTP Link header? The spec seems not to mention any such practice.
--Gerben
---
You received this message because you are subscribed to the Google Groups "Memento Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to memento-dev+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Thanks for thinking along, Herbert & Sawood. Some follow-up
thoughts:
I agree with you it is preferable to leave pages unchanged by saving metadata separately. To clarify/justify my intended approach, the situation in question is one without any archiving-specific infrastructure: it's just snapshotting and saving an html page, to save it locally or host it anywhere. So the alternative is not storing any information at all, and then I suppose it is worth making some changes to add some metadata (I wish MHTML was more widely supported, it allows headers..). In my scenario large changes are made anyhow, e.g. subresources are inlined as data URLs (I guess the best possible outcome would be if those changes are mostly reversible; though I haven't thought that through).
I think I have not fully understood the worry about using unspec'd types for link and meta tags; I thought the web standards are designed to be extensible, allowing for such unknown values by simply ignoring them. I would be very surprised if an unknown link relation type or unknown meta http-equiv header name would upset any browser or other platform. Especially as for both of them, the HTML 5.2 spec refers to a wiki as the authority for their name registration (though the wiki for http-equiv values now appears defunct). I would be glad to hear if I may be overseeing some potential problem here.
In any case, link and meta tags to stick with Memento vocabulary was just a suggestion. I hoped to discover some existing practice of adding provenance metadata to pages, but if that seems not to exist perhaps I will just experiment with this approach for now. Sawood's suggestion to use a custom element is another interesting approach, though I do not understand the pros and cons of that well enough; I fear it would lack the possibility to standardise the tag name.
Gerben
PS I like the idea of Reworker; I did not know this was even
possible with ServiceWorkers. :)
To unsubscribe from this group and stop receiving emails from it, send an email to memento-dev...@googlegroups.com.