Interested in adding more dataset metadata about your repository that gives users a greater sense of the data's trustworthiness?

131 views
Skip to first unread message

Julian Gautier

unread,
Dec 2, 2025, 3:51:18 PMDec 2
to Dataverse Users Community
Hi everyone,

The team at Harvard is working on changes to Dataverse so that repositories can publish machine-readable metadata, defined by research communities, that lets users know more about a dataset's fitness for reuse, provenance, sensitivity, or other domain-specific indicators of trust. We've been calling this work the Trusted Data project, and you can read more about it at https://github.com/IQSS/dataverse-pm/issues/425.

As part of this project, we're interested in working with folks who manage Dataverse repositories to learn if we should also include, in each dataset, more metadata about repositories themselves, to give users a greater sense of the trustworthiness of the data.

For example, would someone who finds a dataset feel more confident in the data if they saw that it was published in a repository or collection that had the CoreTrustSeal certification, or in a repository that met certain requirements of that certification, or that had certain characteristics from the NIH's "Desirable Characteristics for Data Repositories"? Could we work out a way to describe such repository characteristics so that they're included in each dataset's metadata?

If you're interested in working with us or have questions about this effort, please email me at julian...@g.harvard.edu by December 19, 2025. Then during the week of January 5 (after we get back from winter holidays), we'll reach out to everyone who's expressed interest to start brainstorming and testing ideas.

Cheers!
Julian

Julian Gautier (he/him)
Product Research Specialist, IQSS

Julian Gautier

unread,
Dec 3, 2025, 1:48:58 PMDec 3
to Dataverse Users Community
Hi again everyone! Just sharing more information about the scope of this effort, mostly that the deliverable will be a brief white paper where we'll list a set of characteristics about repositories that repository managers, curators, and authors of relevant literature think signal trust, and where we'll discuss how these characteristics might be represented and shared in Dataverse.

The GitHub issue at https://github.com/IQSS/dataverse-pm/issues/463 describes more of the scope of the effort and tasks.

Thanks to everyone who's expressed interest so far.

Vaidas Morkevičius

unread,
Dec 16, 2025, 11:57:03 AM (8 days ago) Dec 16
to dataverse...@googlegroups.com
Dear Julian,

The project that you described in the e-mail is very interesting, and we would like to help you in any way we can. So please count LiDA in as collaborators.

Thank you and best wishes,
--
Vaidas Morkevičius
Coordinator
Lithuanian Data Archive for Social Sciences and Humanities (LiDA, https://lida.dataverse.lt)
Editorial Board Member
Registry of Research Data Repositories (re3data.org)


--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/b8fdd1e6-d34f-4ae6-8994-b764391bf59cn%40googlegroups.com.

Julian Gautier

unread,
Dec 18, 2025, 2:56:56 PM (6 days ago) Dec 18
to Dataverse Users Community
Hi Vaidas,

Thanks for your interest. I'll add your email to the email thread with everyone else who plans to help write the report.
Reply all
Reply to author
Forward
0 new messages