In 2016 I started a project to populate Wikidata with citation relationship data, i.e., instances of one work citing another. This started as a batch script and in 2018 evolved into a bot, Citationgraph bot. The citationgraph bot pulled citation data from PubMed Central and mapped it to entries on Wikidata. Subsequently I developed Citationgraph bot 2, which does the same using data from Crossref.
At the end of 2018, the two bots went down and stayed down for over two years. As the corpus of academic paper data grew on Wikidata, I could no longer provide sufficient technical resources for the bot.
As of this month, both Citationgraph bot and Citationgraph bot 2 are back online. The original Citationgraph bot still uses the PubMed Central API, while Citationgraph bot 2 has been updated to rely on a Crossref data dump from March 2020.
What this data allows us to do is identify the most important and significant papers among the tens of millions. And if a given paper is cited on Wikipedia, having a built out citation graph on Wikidata allows you to follow the chain of provenance, from Wikipedia article to review article to underlying original sources. Alternatively, this could help in recommending sources for use in Wikipedia articles.