top 25 cited datasets in your Dataverse installation
89 views
Skip to first unread message
Philip Durbin
unread,
Jan 26, 2024, 12:36:41 PM1/26/24
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to dataverse...@googlegroups.com
I don't know if this is common knowledge or not, but I was just chatting* with Kelly Statis from DataCite and if your installation of Dataverse uses DataCite (most do), you can pretty easily get a list of the 25 most cited datasets.
To back up a bit, if you go to https://commons.datacite.org/repositories/x3oc4vr which is Harvard Dataverse's landing page in DataCite Commons, you'll see 2578 citations. Great! But which datasets have been cited? I know you can set up Make Data Count for this, but below is a quick way to check the top 25.
The main thing you need to know is your installation's client-id (Harvard Dataverse example below). I'm not 100% sure where to find this but I assume it's the value of dataverse.pid.datacite.username in domain.xml. You might also be able to find it in the list of DataCite clients at https://support.datacite.org/reference/get_clients
Anyway, once you have your client-d, here's how you can get the top 25 citations:
I checked with Kelly Stathis at DataCite and she explained that you can easily figure out the client id for your Dataverse installation by looking it up via the DOI authority/prefix. For example, UNC Dataverse has 10.15139 as the authority. You can look up the client id for 10.15139 by going to https://api.datacite.org/prefixes/10.15139 to discover that it is "gdcc.odum-dv". Then you can plug in that client id as before: