number of citations working?

97 views
Skip to first unread message

Data CUHK

unread,
Sep 12, 2025, 6:23:32 AMSep 12
to Dataverse Users Community
Hi everyone,

I'm not sure if your dataverse is showing the number of citations for datasets in the repository. We have that for our datasets, but the number of citation counts is always 0 for all datasets.. I wonder what the counting mechanism behind the scenes are and if the metrics is actually working. Or it could be that our datasets haven't been cited at all. 

Thanks!

Best regards,
Qinqin

Philip Durbin

unread,
Sep 12, 2025, 10:07:56 AMSep 12
to dataverse...@googlegroups.com
Hi Qinqin,

Hmm, can you please verify your citations outside of Dataverse, by querying the DataCite API directly, using the technique described in the "top 25 cited datasets in your Dataverse installation" thread? Please see https://groups.google.com/g/dataverse-community/c/Q2PbdEaZjaA/m/vWSiGBtaBQAJ

Thanks!

Phil

--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/590ab8cb-b7c4-46d8-9cf7-129b407adb75n%40googlegroups.com.


--

Julian Gautier

unread,
Sep 12, 2025, 10:17:31 AMSep 12
to Dataverse Users Community
Hi Qinqin,

The counting mechanism is described at https://guides.dataverse.org/en/latest/admin/make-data-count.html. That Make Data Count project generates the views, downloads, and citations counts, and Dataverse gets and displays those metrics on some regular basis.

On dataset pages on the CUHK Research Data Repository at https://researchdata.cuhk.edu.hk, I see view and download counts, so I'm assuming the repository is getting these Make Data Count metrics.

Whether or not DataCite records citations for datasets depends on how others are citing them and how different systems are describing those "citations" in the metadata they send to DataCite. It's definitely not perfect but groups are working on different ways of detecting citations, to improve citation counts.

Some folks have written about datasets or other research objects that they knew were "cited" in other published research objects, but DataCite wasn't aware of it, so it didn't appear in the Make Data Count database. So they tried to work out why that was, sometimes tracing the problem back to how the system, like another repository, was or wasn't sending that citation metadata to DataCite, or differences among research disciplines about what "cited" means.

Do you know that datasets in CUHK Research Data Repository have been cited? It might be helpful to look into cases like that.

All best,
Julian

Julian Gautier (he/him)
Product Research Specialist, IQSS
Interested in helping test Dataverse? Sign up for user experience research

Philip Durbin

unread,
Sep 12, 2025, 10:32:53 AMSep 12
to dataverse...@googlegroups.com
From https://api.datacite.org/prefixes/10.48668 I can tell that your client id is "wklx.uftoeh".

Let's run the following:

curl 'https://api.datacite.org/dois/?client-id=wklx.uftoeh&sort=-citation-count' | jq '.data[].attributes | "\(.citationCount) \(.url)"' -r

It claims that you have 5 datasets with 1 citation each:


However, I just visited each link above all 5 show 0 citations. I'm not sure why. Is Make Data Count set up properly? Or is my curl command wrong? Hmm.

Here's an example of a dataset in Dataverse that has 2 citations: https://data.qdr.syr.edu/citation?persistentId=doi:10.5064/F6NEGI7I

Exciting, right? This is what it's all about! Citing data!

I hope this helps!

Phil

Gautier, Julian

unread,
Sep 12, 2025, 10:43:14 AMSep 12
to dataverse...@googlegroups.com
I think your curl command is right, Phil. In the JSON at https://api.datacite.org/dois/10.48668/E6JEKD, I see that 1 citation count.

Screenshot 2025-09-12 at 10.40.49 AM.png


But those viewCounts and downloadCounts are very different than what's on the repository's dataset page at https://researchdata.cuhk.edu.hk/dataset.xhtml?persistentId=doi:10.48668/E6JEKD, so I'm a bit confused, now 😅


James Myers

unread,
Sep 12, 2025, 10:56:11 AMSep 12
to dataverse...@googlegroups.com

FWIW: The MDC processes to count views/downloads is ~separate from the process to get Citations from MDC, so just seeing MDC downloads doesn’t mean everything is configured. You need to periodically run the updateCitationsForDataset API (i.e. via the counter_weekly.sh script) to see the citations in Dataverse: see https://guides.dataverse.org/en/latest/admin/make-data-count.html#configuring-your-dataverse-installation-for-make-data-count-citations.

 

--Jim

--

You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.

Data CUHK

unread,
Sep 15, 2025, 12:23:27 AMSep 15
to Dataverse Users Community
Hi Phil, Julian and Jim,

Thank you so much for your insights! This is really helpful.
I will look into our  updateCitationsForDataset API and see if there is anything that needs to be fixed.. 

Thanks again!

Qinqin

Federico Yemurenko (ANII)

unread,
Oct 21, 2025, 10:01:24 AMOct 21
to Dataverse Users Community
Hi all,

We also have have an issue with citations counts in our DV.  

We know that some datasets do have citations, as per Datasite.  For example, the Datacite API returns that doi:10.60895/redata/JFXCLA has "citationCount": 6

However, when I try to get that figure to our DV, I get an error that does not make sense to me (Datacite IS our DOI provider) 

curl -X POST "http://localhost:8080/api/admin/makeDataCount/:persistentId/updateCitationsForDataset?persistentId=doi:10.60895/redata/JFXCLA"
{"status":"ERROR","message":"Only DataCite DOI providers are supported"}

I am double checking the configuration related to Datacite.  In the meantime, any help / suggestion will be appreciated.

Regards

Philip Durbin

unread,
Oct 21, 2025, 10:18:31 AMOct 21
to dataverse...@googlegroups.com
Hi Federico,

Hmm, https://doi.org/10.60895/redata/JFXCLA does seem to resolve fine as a proper DataCite DOI. And here it is in DataCite Commons: https://commons.datacite.org/doi.org/10.60895/redata/jfxcla

At least for Dataverse 6.8 (I'm not sure what version you're on), here's where the error is thrown: https://github.com/IQSS/dataverse/blob/v6.8/src/main/java/edu/harvard/iq/dataverse/api/MakeDataCountApi.java#L151

Do you have the ability to recompile the war file with some extra logging so we can see what pidProvider.getProviderType() is?

Thanks,

Phil

James Myers

unread,
Oct 21, 2025, 10:26:31 AMOct 21
to dataverse...@googlegroups.com

FWIW: My guess is that you’re seeing a bug in handling lower case shoulders that was fixed in Dataverse 6.5. I don’t know of any work-around for versions <= v6.4.

 

-- Jim

Federico Yemurenko (ANII)

unread,
Oct 21, 2025, 3:02:02 PMOct 21
to Dataverse Users Community
Hi,

This is Dataverse 6.4.

First, I found the the JVM option dataverse.pid.datacite.type was not set.  I added it with the value "datacite" following the definition of the constant TYPE in 
edu.harvard.iq.dataverse.pidproviders.doi.datacite.DataCiteDOIProvider.java 

public static final String TYPE = "datacite";

It was not enough to avoid the condition in edu.harvard.iq.dataverse.api.MakedataCountAPI.java that return the error

            if(!DataCiteDOIProvider.TYPE.equals(pidProvider.getProviderType())) {
                return error(Status.BAD_REQUEST, "Only DataCite DOI providers are supported");
            }

dataverse@ip-10-0-2-41:~/glassfish$ bin/asadmin list-jvm-options | grep datacite
-Ddataverse.pid.datacite.password=${ALIAS=doi_password_alias}
-Ddataverse.pid.datacite.username=ANII.ANII
-Ddataverse.pid.datacite.mds-api-url=https://mds.datacite.org
-Ddataverse.pid.datacite.rest-api-url=https://api.datacite.org
-Ddataverse.pid.datacite.type=datacite
dataverse@ip-10-0-2-41:~/glassfish$


{"status":"ERROR","message":"Only DataCite DOI providers are supported"}

Then, there is the issue with dataverse software being case sensitive.  

Returns the required dataset


The same with the API: 

fyemurenko@ip-10-0-2-41:~$ curl -X POST "http://localhost:8080/api/admin/makeDataCount/:persistentId/updateCitationsForDataset?persistentId=doi:10.60895/redata/jfxcla"
{"status":"ERROR","message":"Cuando se accede a un conjunto de datos mediante su ID persistente, debe indicarse un parámetro persistentId."}

fyemurenko@ip-10-0-2-41:~$ curl -X POST "http://localhost:8080/api/admin/makeDataCount/:persistentId/updateCitationsForDataset?persistentId=doi:10.60895/REDATA/JFXCLA"
{"status":"ERROR","message":"Cuando se accede a un conjunto de datos mediante su ID persistente, debe indicarse un parámetro persistentId."}


{"status":"ERROR","message":"Only DataCite DOI providers are supported"}

I undestand that DOIs should not be case sensitive since they are used un URLs

And have a look at this.  When I call datacite API with DOI 10.60895/redata/JFXCLA

I get the json but with ID in lowercase !!!!!!!!  May be, this is why Dataverse don´t know what to do with this response; this DOI in lowercase don´t belong to this repository

{ "data": { "id": "10.60895/redata/jfxcla", "type": "dois", "attributes": { "doi": "10.60895/redata/jfxcla", "prefix": "10.60895", "suffix": "redata/jfxcla", "identifiers": [], "alternateIdentifiers": [], "creators": [ .............. }

I can also call Datacite API with the DOI in lowercases, to get the same response.  As it should be.

We have at least two issues here, I think.  I will start by upgrading to 6.5 

If it doesn´t work, I will add the traces suggested by Phil

I ´ll let you know then

Regards
Reply all
Reply to author
Forward
0 new messages