About OrganiZymeDB

OrganiZymeDB is a manually curated database dedicated to enzyme stability and function in the presence of organic solvents. This page describes the rationale behind the resource, its content, and how to use it.

Rationale

The use of enzymes as catalysts is steadily growing, driven by their often remarkable selectivity, catalytic efficiency and their environmental sustainability. Yet enzymes have evolved to function under conditions that are frequently drastically different from those required for real-world applications. In particular, the use of organic solvents is often indispensable in biocatalysis: they solubilize substrates with limited water solubility, including organic synthesis intermediates and (bio-)polymers, act as stoichiometric reactants in processes such as biodiesel production, suppress unwanted hydrolytic side reactions, and can even unlock entirely new chemistries, such as peptide synthesis by proteases or transesterification by lipases.

Despite this practical importance, no central and organized repository of such data existed prior to OrganiZymeDB. This gap particularly hinders computational scientists: without accessible, structured data, it is impossible to train machine-learning algorithms, fine-tune self-supervised AI models, or calibrate physics-based approaches aiming to predict enzyme stability and function in the presence of organic solvents.

OrganiZymeDB was built to address this need, with three main goals:

A resource for biocatalysis

To provide chemists and biotechnologists with a central, extensively annotated and manually curated repository of experimental data on enzyme stability and function in the presence of organic solvents. Such a resource makes it straightforward to assess whether and how an enzyme of interest or its homologs have been characterised in such conditions.

A reference dataset for computational methods

To supply a structured dataset for the development, training and evaluation of computational methods aimed at understanding and predicting enzyme stability and function in the presence of organic solvents.

Expanding the focus of enzyme engineering

To advance research on enzyme stability and function in the presence of organic solvents, driving the use of organic-solvent tolerance as a key parameter in enzyme engineering.

OrganiZymeDB Content

Data in OrganiZymeDB originates exclusively from peer-reviewed scientific publications and preprints, and covers experiments conducted on enzymes that are neither immobilized nor chemically modified. A substantial fraction of the data was manually extracted from figures (barplots, scatterplots, heatmaps) using WebPlotDigitizer, making it available in machine-readable numerical form for the first time.

The database is organized around enzyme entries, each corresponding to the experimental data reported across one or more publications for a given enzyme. Every entry includes enzyme annotations, experimental measurements with associated experimental conditions, barplot visualizations and an AlphaFold3-modeled structure of the studied enzyme. When applicable, each experimental value is paired with a control (the same property measured in pure aqueous conditions) and, for mutants, when available, with the wild-type value under identical conditions (thus in the presence of organic solvent). Experimental assays are classified according to a controlled vocabulary described below.

Using OrganiZymeDB

Search

The search bar at the top of every page accepts queries across multiple categories: protein name, organism, EC number, article, author, substrate, product, and organic solvent. If no result is found in the selected category, the search automatically tries other categories, prioritising assay type matches.

Browse

Browse the full database by enzyme, by scientific publication, or by assay type.

BLAST

To identify whether measurements have been conducted on an enzyme of interest or its homologs, the BLAST page accepts one or more protein sequences and returns matching database entries ranked by E-value. The E-value threshold and maximum number of hits are user-configurable.

Download

The full dataset is available for download as a CSV file. A Build-my-dataset feature additionally lets users filter datapoints by assay type, organic solvent, enzyme EC class, publication year, or wild-type/mutant status before downloading.

REST API

OrganiZymeDB exposes a REST API for programmatic access to all database content. The API documentation page describes available endpoints, query parameters, and response formats, with worked examples.

Deposition

Contributions are welcome, with the Deposit page explaining how to submit new experimental data. For questions, contact us at Romain.Debruyne@ulb.be and Fabrizio.Pucci@ulb.be.

Citation & License

How to Cite

If you use OrganiZymeDB in your research, please cite:

[Authors]. OrganiZymeDB: [Title]. [Journal], [Year]. DOI: [DOI]

License

OrganiZymeDB data is distributed under the Creative Commons Attribution–NonCommercial 4.0 International (CC BY-NC 4.0) licence. For commercial use enquiries, please contact Romain.Debruyne@ulb.be and Fabrizio.Pucci@ulb.be.