We are pleased to announce a new release of Metacat (3.2.2), a turnkey data repository software platform used across the earth science community. This release introduces reliability enhancements for indexing datasets, along with enhancements to make the upgrade process more robust. It can be downloaded here.
For new installations, see the Installation notes and the Administrator’s Guide.
Metacat can also be deployed on Kubernetes, using a Helm chart. See the helm README.
Important: if you are upgrading an existing helm deployment, please be sure to read the chart upgrade section of the Release Notes.
Upgrades and Bug Fixes include:
Fixed: Bug: Bagit setting incorrect Content-Type (Issue #1953)
Fixed: Metacat DataONE admin page doesn't show the error message from CN (Issue #2040)
Fixed: Hashstore convertor may miss some objects which need to be converted (Issue #2182)
Update eml2osti.xsl for OSTI Service (Issue #2185), to incorporate bug fix for OSTI XML Sponsor List starting with semicolon causes error in OSTI submission
Upgrade Postgres from 42.7.4 to 42.7.7 (issue #2191)
Upgrade DataONE-Indexer library to 3.1.5 in metacat-index (see dataone-indexer Release Notes for details) and update Metacat Helm chart dataone-indexer sub-chart to version 1.3.2
Other Enhancements:
Ensure Metacat Startup Doesn't Fail if CN Registration Unsuccessful (Issue #2181)
Increase index.resourcemap.waitingComponent.max.attempts to 200 as a temporary fix for resourcemaps not being successfully indexed. Duplicated in Helm chart (.Values.dataone-indexer.idxworker.resourcemapMaxTries)
Set additional Ingress annotations in Helm chart, for enabling upload and download of large files and data packages, without timeouts/disconnects.
Utility Scripts for Metacat Administrators:
(Python) submit index tasks as low priority in the background, given a list of object IDs (PIDs) (Issue #2176)
(Python) Synchronize the system metadata of a list of object IDs (PIDs) from Metacat database to the corresponding files in HashStore (Issue #2166)
(Bash) Determine the subset of objects that have not been indexed, and submit them for indexing (Issue #2165)
About Metacat
Metacat (https://github.com/NCEAS/metacat) provides a standardized but customizable platform for preserving data and metadata in many formats. It helps scientists find, understand and effectively use data sets they manage. or that have been created by others. Hundreds of thousands of data sets are currently documented in a standardized way and stored in Metacat systems, enabling the scientific community to access a vast array of scientific data that can be easily searched, compared, merged, or adapted for other purposes due to its thorough and consistent description.
Metacat is compliant with the DataONE network, making it easy for organizations to participate in the global DataONE data federation, and integrates with ORCiD and DOI identifier systems. It also supports the customizable MetacatUI client-side application for searching and browsing data in Metacat and DataONE, creating an easily-deployed system with advanced search and discovery features pre-installed.
The open source Metacat system is maintained by NCEAS (https://nceas.ucsb.edu) and DataONE (https://dataone.org), and is used by repositories worldwide to manage data collections. We're an open community, collaborating on the shared development of this common data platform, to improve efficiency and sustainability in the open data ecosystem. We welcome all types of contributions, from feedback on features and bugs, to documentation, code, and everything in between. Please join us!
We hope that this software is useful to you. We welcome feedback and comments that will further improve the application. Please submit bugs and problems through our bug tracking system (https://github.com/NCEAS/metacat/issues) and send general feedback to devel...@dataone.org.
The Metacat Development Team