Dear Prof. Daniel J. Rigden,
We would like to propose the inclusion of our manuscript describing “openTECR”, the open Database on Thermodynamics of Enzyme-Catalyzed Reactions, in the upcoming database issue of NAR. You can find the abstract for this manuscript below, attached to the end of this email.
We have rigorously recurated the NIST Thermodynamics of Enzyme-Catalyzed Reactions database (TECRDB) in an unprecedented high quality. We improved the consistency of the data with the cited experimental literature, and developed an accessible and actively managed website for researchers to search and download experimental thermodynamic data.
This is a unique asset to the community because TECRDB is the largest database of experimental thermodynamic measurements, and is the basis of thermodynamic prediction tools such as eQuilibrator. However, NIST’s TECRDB provides only limited HTML-based access to the underlying data, which has led to various research teams independently web-scraping the database. Our openTECR database, in full cooperation with the developers of the original database, provides a standard for the community that can be useful to both experimentalists and computational researchers.
We also wish to highlight the fact that this effort has been undertaken by an international volunteer community and coordinated asynchronously over the past few years, representing a success story of scientific achievements through volunteer efforts.
Please let us know if we should proceed with submitting the manuscript.
With best regards,
Robert Giessmann, on behalf of the openTECR community
Title:
openTECR: A public catalog of experimental thermodynamic data of enzyme-catalyzed biochemical reactions
Abstract:
Enzyme-catalyzed metabolic reactions are ubiquitous in all domains of life, and the thermodynamics of these reactions is an essential aspect of understanding how they operate in a metabolic environment. However, published thermodynamic data for these biological reactions is sparse, heterogeneous, and often inaccessible. Robert Goldberg and colleagues published a collection of this data and the National Institute of Standards and Technology (NIST) developed a website to interface with this collection, termed TECRDB. But the database website is not developed further and its webpages have gradually corrupted and lost functionality and content over the years.
We therefore developed the “open database on Thermodynamics of Enzyme-Catalyzed Reactions” (openTECR) as a free and public resource for cataloging experimentally determined thermodynamic parameters. From a collection of over 1,000 publications, we curated a database with 5,565 data points, consisting of 773 enthalpy measurements and 4,792 measured equilibrium constants. In a first approach towards a continuously improving database, we re-curated the exact information given in the PDF version of TECRDB. We added links to the original source in the form of DOI and/or PMID and to the reaction in Rhea, where possible. This homogenized and accessible data will be invaluable for metabolic modeling and cheminformatics, among other fields. Our data set will further improve thermodynamic prediction tools such as eQuilibrator and dGpredictor and help pathway databases (e.g. Plant Reactome, BioCyc, Reactome, etc.) to include additional information relevant to their users.
openTECR is the most comprehensive experimental database of thermodynamic metabolic data, adding 838 equilibrium constants and experimental conditions for an additional 239 enthalpy measurements relative to the best-known machine-readable data set. Our open source and open data initiative invites the community to contribute by submitting and curating new experimental data, as well as enhancing the user experience. The database is available at https://w3id.org/opentecr/tecrdb and can be downloaded as a single tabular file licensed under CC-0.