Help with BagIt Setup

39 views
Skip to first unread message

Sherry Lake

unread,
Aug 3, 2021, 10:46:15 AM8/3/21
to Dataverse Users Community
UVa wants to use local storage for BagIt exports but are having trouble with the commands (and setup maybe?). When running this command (using dataverseAdmin's token):

curl -H "X-Dataverse-key: 33020XXXXXXXXXXXXXX7f230e" http://dvdev.lib.virginia.edu:8080/api/admin/submitDataVersionToArchive/:persistentId/?persistentId=doi:10.80100/FK2/PKSBXX/1.1

We get the error message:

{"status":"ERROR","code":404,"message":"API endpoint does not exist on this server. Please check your code for typos, or consult our API guide at http://guides.dataverse.org.","requestUrl":"http://dvdev.lib.virginia.edu:8080/api/v1/admin/submitDataVersionToArchive/:persistentId/?persistentId=doi:10.80100/FK2/PKSBXX/1.1","requestMethod":"GET"}


I am assuming the documentation (API command) is not correct??

Thanks,
Sherry

James Myers

unread,
Aug 3, 2021, 11:43:55 AM8/3/21
to dataverse...@googlegroups.com

--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/9b4189f5-595a-4636-b194-6644c34844c2n%40googlegroups.com.

Sherry Lake

unread,
Aug 3, 2021, 1:44:58 PM8/3/21
to Dataverse Users Community
Thanks Jim.

Question if I just wanted the latest version, do I need to put the version in the URL?


--
Sherry

James Myers

unread,
Aug 3, 2021, 1:55:55 PM8/3/21
to dataverse...@googlegroups.com

Looking at the code, the version number is required. I don’t think it would be hard to either support the :latest-published tag instead of a number or just assume the latest published if the version is not sent, but neither of those is implemented now.

Sherry Lake

unread,
Aug 3, 2021, 2:16:19 PM8/3/21
to Dataverse Users Community
Hi Jim,

Sorry but my sysadmins keep thinking up new questions....

What if a dataset is not published? It doesn't have a version yet, does it?

Can you make bags of unpublished works?

---
Sherry

Sherry Lake

unread,
Aug 3, 2021, 2:42:06 PM8/3/21
to Dataverse Users Community
And one more....

Can multiple versions be combined in one bag?

I think the answer is "now" based on the URL needing an exact version #, but I was asked to ask.

James Myers

unread,
Aug 3, 2021, 3:17:47 PM8/3/21
to dataverse...@googlegroups.com

> Sorry but my sysadmins keep thinking up new questions....

 

In the job description ;-)

 

Bags were originally developed as a way to archive datasets at publication time (with archiving automated as a post-publication workflow), so they can’t currently be created for draft data. If desired, that’s probably a relatively minor change to support in the API.

 

Multiple versions is a more fundamental limit, at least in terms of standards: Bags weren’t designed to handle versions. The newer Oxford Common File Layout specification (https://ocfl.io/0.9/spec/) supports versions and might be a better/additional option for the future. (There are also some folks (e.g. https://stacks.wellcomecollection.org/how-we-store-multiple-versions-of-bagit-bags-e68499815184) who do non-standard things to allow Bags for later versions to refer to files in earlier versions to avoid duplicating the files, which would be a possible interim possibility.).

Reply all
Reply to author
Forward
0 new messages