Indexing Datasets from Dataverse Repositories in Google Scholar?

47 views
Skip to first unread message

FREDRIK SAHLSTRÖM

unread,
Feb 19, 2026, 10:32:19 AM (5 days ago) Feb 19
to Dataverse Users Community
Dear Dataverse Community,

My name is Fredrik Sahlström, and I’m part of the admin team for the DataverseNO repository.

We are currently looking into whether and how datasets in DataverseNO are harvested and indexed by different external search engines. There is particular interest from our users to have their datasets indexed in Google Scholar. However, in Google Scholar, our datasets appear only sporadically, and typically as “citations”, which suggests they are being picked up via references in scholarly articles rather than harvested directly from the repository (see this example).

Has anyone in the community successfully had datasets from their Dataverse repository indexed directly in Google Scholar? If so, we would greatly appreciate any insights or recommendations on how to achieve this.

Many thanks in advance.

Kind regards,
Fredrik

Humberto Blanco Castillo

unread,
Feb 19, 2026, 1:07:34 PM (5 days ago) Feb 19
to dataverse...@googlegroups.com

Hi Fredrik,

Thank you for bringing this up. It is a challenge many of us in the community have encountered.

While Google Scholar focuses primarily on scholarly articles and often only indexes datasets when they are cited in those papers, Google provides a dedicated service specifically for this purpose: Google Dataset Search.

In my experience, rather than trying to force direct indexing into Google Scholar, the most effective approach is to ensure your repository's metadata is optimized for Google’s general search index. Google uses the schema.org vocabulary (specifically the Dataset type) to crawl and display these sets in Google Dataset Search.

Here are a few points that might help:

  • Schema.org Metadata: Dataverse usually exports metadata in JSON-LD format, which Google uses to discover datasets.

  • Sitemaps: Ensure your sitemap is up to date and accessible to Google’s web crawlers.

  • Google Search Console: You can use this tool to monitor how Google sees your repository and identify any indexing errors.

While it’s not exactly Google Scholar, Google Dataset Search is the standard "home" for data and is where most researchers now look for primary sources.

Best regards,


--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/3835b7a7-b2d5-44fe-8e98-66e87885a9e0n%40googlegroups.com.


--
Cordialmente,


Humberto Blanco

FREDRIK SAHLSTRÖM

unread,
Feb 23, 2026, 4:34:11 AM (yesterday) Feb 23
to Dataverse Users Community
Dear Humberto,

Thank you very much for your very informative reply!

Kind regards,
Fredrik

Philip Durbin

unread,
Feb 23, 2026, 12:18:06 PM (20 hours ago) Feb 23
to dataverse...@googlegroups.com
Reply all
Reply to author
Forward
0 new messages