Hi,
I was doing a citation analysis for Stockholm University publications and found a few articles which have much higher "cited by count" in OpenAlex than in other databases. The most extreme case was a 2019 article from Scientific Reports (https://openalex.org/works/w2983201012) with 1787 citations as of today in OpenAlex, but only 30-40 in other databases (Scopus, WoS, Dimensions, Crossref). If I look at the citing articles in OpenAlex they are mostly from unrelated topics. The article is a psychology article, and the citing articles are from other areas than psychology. Most of the citing articles are from 2020-2021. I checked some of the citing articles and they don't have this psychology article in their reference lists. So obviously this is caused by some kind of reference matching error in OpenAlex, although it is unclear what the exact origin of this error is.
I tried to find other similar cases in OpenAlex among the highly cited publications in general, and I found an Indonesian article from 2024 (https://openalex.org/works/w4402690901) with more than 2700 citations. The weird thing here is that most of the citing articles were published earlier than the cited article. According to Crossref this article hasn't received any citations yet (https://api.crossref.org/works/10.30640/dewantara.v3i2.2644)
Has anyone seen similar discrepancies in OpenAlex citation counts?
Gabor Schubert,
Stockholm University
Hi Gabor,
I found similar cases for paratext-like publications in OpenAlex:
For example, if you search for „Table of Contents“, you will find several publications with a high number of citations, although most of these publications are just ordinary tables of contents.
Example: https://openalex.org/works/w4242011962
I also found some examples of publications that “cite” these tables of contents, but also have a publication year that is earlier than the actual cited publication.
Example: https://openalex.org/works/w4239391401 (From 2016, but is cited from an article from 1998)
Best,
Nick