Then, click the RDF button in the top right corner, select Add reconciliation service, Based on SPARQL endpoint. In the dialog, you tell OpenRefine how to access the controlled vocabulary. For LCSH, provide the following parameters:
http://sparql.freeyourmetadata.org/http://id.loc.gov/authorities/subjectsskos:prefLabel
Hi Richard,
I had the same trouble with the FreeYourMetadata LCSH procedure. As an alternative I tried loading one of LC's dump files into OpenRefine, thinking I could do a cell.cross to reconcile, but I could never get it to load in my computing environment (managed) even though I increased memory. I ended up using "Add column by fetching URLs" to get errors from the id.loc.gov site, which told me which headings were LCSH and which were not. This solution was inspired by Ruth Tillman Kitchin's blog post One Weird Query: Resolving LC Subject Strings to URIs Using Python (2020). Ruth goes into detail about how the LC "label" function works.
I'm not quite finished with the procedure because I messed up the logic with splitting off subdivisions, but it's 95% there.
Hope this helps!
Helen
--
You received this message because you are subscribed to the Google Groups "OpenRefine" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
openrefine+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/openrefine/2699be42-93c4-4712-a228-6253b0c2fffe%40googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to openr...@googlegroups.com.
> pip install -r requirements.txt (first time only); b> python reconcile.py --debug; My Python is Python 2.7.12)3. It will show: Running on http://0.0.0.0:5000/4. Can check in browser: http://localhost:5000/ or http://0.0.0.0:5000/ should show both LcNAf and LcSH
{
"defaultTypes": [
{
"id": "Names",
"name": "Library of Congress Name Authority File"
},
{
"id": "Subjects",
"name": "Library of Congress Subject Headings"
},
{
"id": "LoC",
"name": "LCNAF & LCSH"
}
],
"identifierSpace": "http://localhost/identifier",
"name": "LoC Reconciliation Service",
"schemaSpace": "http://localhost/schema",
"view": {
"url": "{{id}}"
5. In OpenRefine select Column to reconcile and "Add standard service" and then add http://0.0.0.0:5000/ in URL box (or your IP).
5.1 After adding we need to exit before "start reconciliation"
6. Tested with OR 3.1, 3.2 and 3.3
-----------------------------------------------------------------------
To unsubscribe from this group and stop receiving emails from it, send an email to openrefine+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/openrefine/02382c79-4849-458b-88e1-b4c5011543be%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/openrefine/02382c79-4849-458b-88e1-b4c5011543be%40googlegroups.com.
Hmmm, I think I need to learn some Python!
To unsubscribe from this group and stop receiving emails from it, send an email to
openrefine+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/openrefine/08163295-f9f9-41a2-84ca-e024af05f242%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/openrefine/DM5PR0701MB3638F0164AEE5ED4532B2B61888F0%40DM5PR0701MB3638.namprd07.prod.outlook.com.