All,
We’ve hesitated to publish deletes to the LR for resources it contains for a few reasons. The big one is, while it may be a valuable resource to the educational community at large, it may not meet the criteria for our audience. It wouldn’t be accurate, or appropriate, for us to assert that the resource is no longer valid when it simply doesn’t meet our criteria, so we would not be publishing those as deletes to the LR.
The reasons I’ve come up with so far for deletes are:
· Not Found
o This includes 404 errors as well as error pages which technically return a status of 200 OK but have a friendly error message indicating the resource is not found.
· Domain for Sale
o Obviously if the domain is for sale, the resource is dead.
We have run into situations where the resource leads to a page which has nothing to do with education, although the page programmatically returns 200 OK. While the end result in our system is the same, I’m not sure if the assertion should be published as a “delete” or if we should publish it with some other verb:
· Inappropriate
o Contains text which makes it inappropriate for the education community as a whole. For example, an online gaming site. It may very well be that the resource was legitimate at one time, but someone else may have purchased the domain and set up a page which is not appropriate for the LR community.
I’d like to start some discussion on how best to publish assertions for this situation so that we can come up with a standard way of handling it. This way everyone understands the assertion and can apply their business logic to it. What we’re looking at here is essentially the opposite of the “endorse” verb. What standard verb would be appropriate in this situation, so the most people can understand it?
Jerome Grimmer
Southern Illinois University Carbondale
2450 Foundation Drive Suite 100
Springfield, IL
Phone: 217-786-3010 ext. 5857
Toll-free: 1-800-252-4822 ext. 5857
NOTE: My E-mail address has changed
This email sent using 100% recycled electrons.
--
You received this message because you are subscribed to the Google Groups "Learning Registry Developers List" group.
To unsubscribe from this group and stop receiving emails from it, send an email to learningreg-d...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Illinois has found that there are resources which were published to the Learning Registry which we have found through our automated link checker are no longer available. In some cases, a request to the URL returns a 404 (Not Found) error; in other cases we’ve determined that the request is being redirected to a human-friendly error page, again indicating that the resource is no longer available. In a high percentage of cases, these are about resources that Illinois *did not* publish the metadata about. We’re simply wanting to let the LR community know that this resource is, in our experience, not available any more.
In the third scenario (the “inappropriate” one), Illinois wants to let the LR community know that this resource may not be appropriate for the education community as a whole. For example, let’s suppose that NSDL submitted a resource which at one time was valid. Now, however, the domain is owned by an online casino, and they’ve programmed their website to redirect any 404 errors to their home page. Programmatically, the URL is technically still good, as it probably redirects with a 302 Found status, but the resource is no longer appropriate for the education community. In such a case, I think the “disavow” paradata assertion might work well enough; it can be up to the consumer to decide if they want to include the resource in their system. For this, I’d think the LR would want to see a standard recipe for this paradata assertion. If everyone follows the standardized format of the assertion, it makes it easier for everyone to understand and handle as they see fit, instead of having one group do it with one verb, and another group do it with another verb.
There’s a fourth scenario that I haven’t even got into yet; it is likely that Illinois would not even publish paradata back to the LR for. For example, we found lots of resources published by the American Association of Physics Teachers. These are mostly research papers, and would definitely be valuable to certain segments of the education community; however we might want to remove them because they are resources for which there is a cost to access. Illinois’ system is about open education resources, as well as free ones which might not be open, but when we encounter a site with nothing but resources for sale, we tend to exclude it from our index. We would likely NOT publish paradata assertions about these back to the LR; we’d simply remove them from our system.
Again, in a high percentage of cases, the paradata assertions we’d be publishing back would NOT be about resources that we published metadata for, as we could simply delete those easily enough with the replaces API already in place.
Jerome Grimmer
Southern Illinois University Carbondale
2450 Foundation Drive Suite 100
Springfield, IL
Phone: 217-786-3010 ext. 5857
Toll-free: 1-800-252-4822 ext. 5857
NOTE: My E-mail address has changed
This email sent using 100% recycled electrons.
From: learnin...@googlegroups.com [mailto:learnin...@googlegroups.com]
On Behalf Of Steve Midgley
Sent: Monday, March 07, 2016 3:19 PM
To: learnin...@googlegroups.com
Subject: Re: [learningreg-dev] Publishing assertions for deletes vs. other reasons
Thanks Abraham. I think your point about not overloading the http verbs is very strong. I tend to make things too compact.
It raises the opportunity to create endpoints/apis for handling specialized metadata, which I think a "delete resource" request is. It's basically paradata, similar to "bookmarked" or "favorited."
So possibly we just need to start defining some new "paradata publish" interfaces for the system, to make it much easier to submit standardized forms of paradata?
· DELETE /documents/{doc-id} => Doc ID is specified so the document (LR envelope) is deleted
o +1 - this is right
· DELETE /documents/{doc-id}/resources/{resource-url}
o I don't agree entirely with this one b/c I think it's possible we might want to submit a "delete" for a resource we've never submitted metadata on? What about:
o DELETE /paradata/resource/{resource-url}