Download

Download the complete OrganiZymeDB dataset, or use the filters below to build and export a custom subset.

Full Dataset

Download the complete database as a flat CSV file. Each row is one measurement, with all associated experimental conditions, protein metadata, and source references. Data is released under CC BY-NC 4.0. For commercial use, contact Romain.Debruyne@ulb.be and Fabrizio.Pucci@ulb.be.

⬇ Download all measurements (CSV)

CSV column reference

Each row is one measurement. Missing values are reported as NaN. When a measurement involves multiple solvents, substrates, or products, their respective fields contain all values separated by | .

Assay
measurement_id Unique identifier for each measurement row
property Type of property measured according to a controlled vocabulary (e.g. Stability - Tm, Specificity - Enantioselectivity)
assay_description Experimental assay description
aqueous_control Aqueous control (if mutant enzyme, assay performed with the mutant enzyme in aqueous conditions)
wt_control Wild-type control value (assay in the same conditions for the wild-type enzyme)
measured_value Experimentally measured value
unit Unit of the measured value
assay_solution Assay solution composition (buffer, additives, …)
ph Assay solution pH
temperature Assay temperature
solvent_name Organic solvent name
solvent_volume Organic solvent volume fraction (in % v/v), exceptionally organic solvent concentration (in M) and then indicated as such
substrates Substrate(s) with concentration if available (e.g. 100 mM p-nitrophenyl butyrate)
products Product(s)
cofactor Cofactor(s)
shaking Assay solution shaking
additional_annotation Additional annotation(s)
Enzyme
enzyme_name Enzyme name
species Enzyme Provenance
database_id Database accession number
is_extremophile Binary variable 1 if enzyme is annotated as originating from an extremophile organism, 0 otherwise
extremophile_annotation Additional annotation on the extremophilic nature of the organism or enzyme
mutation Amino acid substitution (with residue numbering according to the provided sequence)
sequence Enzyme amino acid sequence (including mutations if applicable)
Article
doi Digital Object Identifier of the source publication
year Publication year
authors Author list of the source publication
title Title of the source publication
journal Journal name
Organic solvent(s) : additional information
solvent_cas CAS number of the organic solvent(s)
solvent_pubchem PubChem CID of the organic solvent(s)
solvent_smiles SMILES string(s) of the organic solvent(s)
solvent_logp Octanol-water partition coefficient (log P) of the organic solvent(s)
solvent_chem21_safety CHEM21 safety score of the organic solvent(s)
solvent_chem21_health CHEM21 health score of the organic solvent(s)
solvent_chem21_environment CHEM21 environment score of the organic solvent(s)
solvent_chem21_ranking_default CHEM21 overall ranking of the organic solvent(s)
solvent_chem21_ranking_discussion CHEM21 ranking discussion / notes of the organic solvent(s)

For more information about CHEM21 scores, see https://pubs.rsc.org/en/content/articlelanding/2016/gc/c5gc01008j

Substrate(s) & Product(s) : additional information
substrate_cas CAS number of the substrate(s)
substrate_pubchem PubChem CID of the substrate(s)
substrate_smiles SMILES string of the substrate(s)
product_cas CAS number of the product(s)
product_pubchem PubChem CID of the product(s)
product_smiles SMILES string of the product(s)

Build My Dataset

Select filters to export only the measurements relevant to your work. Leave any filter empty to include all values for that dimension. The live count updates as you adjust filters.

Assay Type

Leave all unchecked to include all types.

Solvent

Enzyme Class (EC)

Publication Year

Variant Type

Matching measurements:
Adjust filters to refine your selection

Exports include all columns described in the Full Dataset section above. Missing values are reported as NaN. Results are capped at 50,000 rows; use the full-dataset button above for an uncapped export.