Dear Dataverse Community,
We would like to share a small open-source tool that we are developing at the University of Bonn to simplify downloading large and complex datasets from Dataverse installations.
We are the Service Center for Research Data at the University of Bonn and operate our Dataverse installation within the university IT - Center.
The motivation behind the tool was a problem we repeatedly encountered with large or complex datasets:
downloads may fail, interrupted transfers are difficult to resume, and it can become unclear which files have already been downloaded successfully — especially when datasets are organized in hierarchical folder structures and partial downloads are difficult.
To address this, we developed a lightweight desktop downloader with features such as:
The project is fully open source and available here:
https://github.com/sergejzr/harvard-dataverse-downloaderWe have already received first positive feedback from the German Dataverse community and would be very happy to hear from others as well:
We are also currently testing direct Dataverse integration via custom deep links (e.g. opening datasets directly from the browser into the downloader application).
Feedback, ideas, and contributions are very welcome (here or directly at GitHub).
Best regards from Bonn,
Sergej Zerr & the RDM Team
University of Bonn
Service Center for Research Data / University IT
PS: See you in Barcelona next week! :)

-- Dr. Sergej Zerr Hochschulrechenzentrum Bonn Servicestelle Forschungsdatenmanagement - SFD Tel: +49 228 73-4121 Raum: 3.011 Wegelerstrasse 6 53115 Bonn www.hrz.uni-bonn.de
--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/CACdcJw2qcqTTueDtuTi%2Bhu2O8Ay60xM3hNABhJV_Hvvsd1dqsg%40mail.gmail.com.
Hi Martin,
Thanks! Yes, we are facing exactly the same problem. For example, a file may download successfully from Europe, but from Australia the transfer can take slightly longer and eventually break due to the timeout.
And yes — the tool supports resuming downloads of individual files. It gives fairly complete control over the entire download process. Users can download selected subfolders or entire datasets, and interrupted downloads can be resumed at any time.
There is currently no dedicated feature to resume all failed downloads at once. However, restarting the application effectively should achieve the same result, since it automatically detects and continues interrupted downloads.
Best regards,
Sergej
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/329dead6-8a9b-4cdc-aa64-a79cbbf39d94n%40googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/775e0d4f-77f3-4045-bc2e-c73b317ba533n%40googlegroups.com.