I've been working on integrating the 20048 titles covered by CrossRef
into the periodicals dataset. I've also taken the chance to change
some other bits and pieces and I've loaded these into the store for
review via http://periodicals.dataincubator.org:
* The void dataset descriptor has changed. Rather than just one
descriptor, the main descriptor now has 3 subsets, one each for
Highwire, NLM and CrossRef. Each subset's source now points to the
actual file the data was generated from. Each resource has one or more
dct:partOf predicates pointing back to the subset(s) it was generated
from.
* For items from the CrossRef set, and where available, a title-level
DOI is provided as a literal via bibo:doi. This is also turned into a
bibo:uri by pre-pending the dx.doi.org prefix and also a owl:sameAs is
generated to a resource describing the DOI. See http://periodicals.dataincubator.org/journal/cellmotilityandthecytoskeleton
* Resource URIs for titles from all three sets are now based upon
lower-casing and stripping spaces and special chars from the
periodical's title. This means that data that overlaps between the
sets is naturally merged - e.g. http://periodicals.dataincubator.org/journal/cellmotilityandthecytoskeleton
now benefits from data gleaned from both the NLM (provides the
foaf:primaryTopicOf) and Crossref (provides the DOI and publisher
info) sets.
* Added a new target in the rake file to also upload the void.rdf file
* Added dct:publisher literal where available
On the last point, dct:publisher is modeled and used as a literal,
however what do people think about minting URIs and creating entities
for the publishers themselves? Under what schema/ontology would these
be modeled?
Have a good weekend,
Chris
Please consider the environment before printing this email.
Find out more about Talis at www.talis.com
shared innovationTM
Any views or personal opinions expressed within this email may not be those of Talis Information Ltd or its employees. The content of this email message and any files that may be attached are confidential, and for the usage of the intended recipient only. If you are not the intended recipient, then please return this message to the sender and delete it. Any use of this e-mail by an unauthorised recipient is prohibited.
Talis Information Ltd is a member of the Talis Group of companies and is registered in England No 3638278 with its registered office at Knights Court, Solihull Parkway, Birmingham Business Park, B37 7YB.
I forgot to say that I'd also sorted out the search indexing. Do any of these fit the bill?
http://periodicals.dataincubator.org/~search.html?query=social+geography
The issue of maintenance and synchronization of the Talis and SHERPA
data may still be vexing. There is
the same issue for every journal data source, another significant one
being JournalSeek http://journalseek.net/