Mission Accomplished for Vectors of Human Disease Data Mobilization

2 views

Skip to first unread message

Prash

unread,

May 6, 2026, 6:25:26 PM (5 days ago) May 6

to bioc...@googlegroups.com

An excellent read!

͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

Forwarded this email? Subscribe here for more

Mission Accomplished for Vectors of Human Disease Data Mobilization

The final papers coming out in the GigaByte-GBIF-TDR Vectors of Human Disease series provides a nice opportunity to look back at the lessons learned from this targeted data mobilization approach

Scott C Edmunds

May 6

READ IN APP

Regular readers may have seen my previous post covering some of the outputs from the sponsored data mobilization campaigns I facilitated at GigaScience Press that helped in the sharing of vectors of human disease data crucial in understanding and tackling some of the major killer diseases and new zoonotic outbreaks such as Oropouche virus that is spreading across the Americas. The last two papers to come from the three successive calls for papers have finally come out, so this milestone provides a nice opportunity to look back at the lessons learned from this targeted data mobilization approach. With that reason in mind our technical partners at River Valley Technologies have acknowledged this milestone on their and the ALPSP news pages, and they also kindly helped me publish a piece in Editorial Office News (EON) on how this was achieved from an infrastructure perspective. For those of you who may not keep up with the Editorial Office infrastructure literature I thought it could be useful to reshare an adapted version of these news items here.

Data sharing is important in any field, and open biodiversity data are arguably one of the most important datasets for understanding the planet; helping prevent diseases, model the effects of climate change, and figure out where we need to tackle these problems. The de facto home for this data is GBIF (the Global Biodiversity Information Facility), an international network and data infrastructure funded by the world’s governments to provide open access to data about all types of life on Earth. To create global impact, public health datasets have to be discoverable, citable, and easy to reuse (see also the FAIR principles). In practice, valuable datasets remain under-utilised because publication and data-sharing steps can be too expensive, too slow, or operationally too complex. This is particularly true for researchers in parts of the world that are disproportionately affected by these public health challenges, but have traditionally lacked the resources and expertise to tackle them effectively.

This is where we stepped in, with a WHO-supported collaboration with GBIF and our GigaByte journal that tackled the issue of this under-utilised and shared data proactively. In order to reduce the burden for contributors, the programme sponsored publication costs, a GBIF health data helpdesk, as well as hands-on support from the GigaScience Press GigaDB team for curation and data audits. The practical outcomes of these efforts have been improved accessibility, better discovery, stronger linking, and better reuse. Sponsorship removed the cost barrier (WHO covering the APCs), but scalability also came from making the process genuinely easier for contributors, through helpdesk support, data audits, and a publishing workflow we could run repeatedly.

This program enabled the following:

Three data mobilisation rounds from 2022-2025
A series of 31 data papers published (plus a commentary and Editorial)
Facilitated sharing of over 750,000 observations and 1.14 million specimens
Multilingual publishing support (including Portuguese, Spanish and French)
Interactive, data-rich outputs with embedded maps and protocols
Data audits and contributor support, improving dataset quality and reuse

This was truly a global effort, working with and sharing data from over 70 countries. The first call having a majority of papers from Latin-America, and the final call mostly publishing work from African authors (authors from Democratic Republic of Congo winning a Ben Barres Spotlight Award for their submission).

Why metadata matters in global health publishing

Metadata quality is not a technical detail. It enables discovery, linking, indexing, and machine-readability. In data-rich publishing, strong metadata practices help ensure that outputs travel properly through scholarly infrastructure and remain reusable long after publication.

This programme mobilized data and papers relevant to disease surveillance and response, including the first large-scale and open disease vector datasets for the following:

Potential Oropouche virus vectors during its first outbreak in Cuba
Chagas disease vectors outside their typical geographic range in the Americas
Rodent vector datasets relevant to Mpox in West Africa
Citizen science contributions, including the first detection of the dengue carrying Asian bush mosquito in Western Europe
Digitisation of century old historic records to support climate-driven analysis of vector distribution changes in Asia and South America

River Valley supported this work through its end-to-end publishing platform, enabling repeatable workflows across calls and supporting structured publishing. The result was multilingual and machine readable outputs, interactive content, and the richest metadata in the publishing industry (which enabled us to win an inaugral Crossref Excellence in Metadata Award.

You can see a video covering the features of the workflow we put together, and if you are looking to mobilise high-value datasets in a similar manner please get in touch with RVT and myself.

To explore the final outputs from this effort check out the 31 data papers (and one Editorial) collected together via the series page here:

https://doi.org/10.46471/GIGABYTE_SERIES_0002

Mission Accomplished for Vectors of Human Disease Data Mobilization

Prash

Mission Accomplished for Vectors of Human Disease Data Mobilization

The final papers coming out in the GigaByte-GBIF-TDR Vectors of Human Disease series provides a nice opportunity to look back at the lessons learned from this targeted data mobilization approach

Why metadata matters in global health publishing

Further Reading