Easy way to pull down metadata sans DGI?

61 views
Skip to first unread message

Audrey Sage Lorberfeld

unread,
Jul 6, 2017, 10:24:44 AM7/6/17
to islandora
Hi Islandora Hive Mind,

We at the Academy are wondering if any of you who subscribe to Discovery Garden know of a way to pull metadata down from your Islandora instance without having to go through DGI? It is unlikely that there is a way to do this without direct access to our Fedora server, but I thought I would inquire here first. 

Thank you!

Best,
Audrey

 

Audrey Sage Lorberfeld, MLIS

Digital Technical Specialist

Library and Center for the History of Medicine and Public Health

212.822.7330 office

 

The New York Academy of Medicine

LIBRARY

1216 Fifth Avenue |  New York, NY, 10029

Brandon Weigel

unread,
Jul 6, 2017, 10:49:19 AM7/6/17
to islandora
Hi Audrey,

There are numerous ways to export your metadata on your own, but it really depends on what you're looking for and what kind of access and capabilities you have. MODS or DC datastreams? Check Islandora Labs for things like this: https://github.com/Islandora-Labs/islandora_datastream_exporter (requires SSH access to your repository)

Another method: The Move to Islandora Kit can download MODS or DC (and, optionally, objects) from the OAI-PMH endpoint using the OAI toolchain: https://github.com/MarcusBarnes/mik (does NOT require SSH access to your repository, but you will have to install MIK on your machine, write a config file, etc.) Check the wiki for detailed instructions and sample files.

Another method you can access via the UI (which I haven't tried myself) is Islandora BagIt: https://github.com/Islandora/islandora_bagit

Hope that helps! 

- Brandon

Brian Harrington

unread,
Jul 6, 2017, 11:24:08 AM7/6/17
to isla...@googlegroups.com
Hi Audrey,

If you have a list of PIDs that you would like to get your metadata for, this script (from Don Moses) will grab your MODS.  It just runs through the list and uses curl to pull down the MODS datastream from the Islandora server.

#!/bin/bash

for PID in `cat pid_list.txt`
do
## get the MODS ds using the pattern of an islandora URL 
## and write it out to a file in the local directory
curl http:// [insert server name here] /islandora/object/$PID/datastream/MODS/download > $PID"_mods.xml"

## rename the file to get rid of the colon in the PID
mv $PID"_mods.xml" `echo $PID"_mods.xml" | sed s/:/_/` 
done

Simple, but it does the job nicely.

I hope this helps.

Brian


--
For more information about using this group, please read our Listserv Guidelines: http://islandora.ca/content/welcome-islandora-listserv
---
You received this message because you are subscribed to the Google Groups "islandora" group.
To unsubscribe from this group and stop receiving emails from it, send an email to islandora+...@googlegroups.com.
Visit this group at https://groups.google.com/group/islandora.
To view this discussion on the web visit https://groups.google.com/d/msgid/islandora/ce46c706-48da-4988-a8ab-b86d2e585135%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Mark Jordan

unread,
Jul 6, 2017, 12:30:05 PM7/6/17
to isla...@googlegroups.com
Hi Audrey,

If you don't have a list of PIDs (e.g., you want MODS for all objects or on a collection by collection basis), I'd look into Brandon's suggestion of using the Move to Islandora Kit's OAI harvesting capabilities. If you want some assistance, let me know.

Mark

Jordan Dukart

unread,
Jul 7, 2017, 6:05:44 AM7/7/17
to isla...@googlegroups.com
Depends on what exactly you are looking for / your use case. Are some modules that exist that meet particular use cases such as: https://github.com/discoverygarden/islandora_zip_download, which provides an export handler to Bookmarks to download parts of Fedora objects in ZIP files (requires Gearman). Is also: https://github.com/Islandora-Labs/islandora_datastream_exporter, which is a lighter version of the former but would require shell access to run a drush command. In terms of a bulk UI option I'm not sure there exists one in the core suite of modules as of current.

Jordan
--

Jordan Dukart

unread,
Jul 7, 2017, 6:07:28 AM7/7/17
to isla...@googlegroups.com, Mark Jordan
Whoops didn't see the above before posting.

Audrey Sage Lorberfeld

unread,
Jul 7, 2017, 6:11:24 AM7/7/17
to isla...@googlegroups.com
Hi All, 

Thank you so much for all of these helpful responses. I have to figure out some quick things on my end, and I will get back to this thread shortly! 

Thank you again!

Best, 
Audrey

On Jul 7, 2017 6:05 AM, "Jordan Dukart" <jor...@discoverygarden.ca> wrote:
Depends on what exactly you are looking for / your use case. Are some modules that exist that meet particular use cases such as: https://github.com/discoverygarden/islandora_zip_download, which provides an export handler to Bookmarks to download parts of Fedora objects in ZIP files (requires Gearman). Is also: https://github.com/Islandora-Labs/islandora_datastream_exporter, which is a lighter version of the former but would require shell access to run a drush command. In terms of a bulk UI option I'm not sure there exists one in the core suite of modules as of current.

Jordan
On 2017-07-06 11:24 AM, Audrey Sage Lorberfeld wrote:
Hi Islandora Hive Mind,

We at the Academy are wondering if any of you who subscribe to Discovery Garden know of a way to pull metadata down from your Islandora instance without having to go through DGI? It is unlikely that there is a way to do this without direct access to our Fedora server, but I thought I would inquire here first. 

Thank you!

Best,
Audrey

 

Audrey Sage Lorberfeld, MLIS

Digital Technical Specialist

Library and Center for the History of Medicine and Public Health

212.822.7330 office

 

The New York Academy of Medicine

LIBRARY

1216 Fifth Avenue |  New York, NY, 10029

--
For more information about using this group, please read our Listserv Guidelines: http://islandora.ca/content/welcome-islandora-listserv
---
You received this message because you are subscribed to the Google Groups "islandora" group.
To unsubscribe from this group and stop receiving emails from it, send an email to islandora+unsubscribe@googlegroups.com.

--
For more information about using this group, please read our Listserv Guidelines: http://islandora.ca/content/welcome-islandora-listserv
---
You received this message because you are subscribed to a topic in the Google Groups "islandora" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/islandora/q8FLsxTvnuM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to islandora+unsubscribe@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages