Download
Download the complete OrganiZymeDB dataset, or use the filters below to build and export a custom subset.
Full Dataset
Download the complete database as a flat CSV file. Each row is one measurement, with all associated experimental conditions, protein metadata, and source references. Data is released under CC BY-NC 4.0. For commercial use, contact Romain.Debruyne@ulb.be and Fabrizio.Pucci@ulb.be.
⬇ Download all measurements (CSV)CSV column reference
Each row is one measurement. Missing values are reported as NaN.
When a measurement involves multiple solvents, substrates, or products, their
respective fields contain all values separated by | .
measurement_id
Unique identifier for each measurement row
property
Type of property measured according to a controlled vocabulary (e.g. Stability - Tm, Specificity - Enantioselectivity)
assay_description
Experimental assay description
aqueous_control
Aqueous control (if mutant enzyme, assay performed with the mutant enzyme in aqueous conditions)
wt_control
Wild-type control value (assay in the same conditions for the wild-type enzyme)
measured_value
Experimentally measured value
unit
Unit of the measured value
assay_solution
Assay solution composition (buffer, additives, …)
ph
Assay solution pH
temperature
Assay temperature
solvent_name
Organic solvent name
solvent_volume
Organic solvent volume fraction (in % v/v), exceptionally organic solvent concentration (in M) and then indicated as such
substrates
Substrate(s) with concentration if available (e.g. 100 mM p-nitrophenyl butyrate)
products
Product(s)
cofactor
Cofactor(s)
shaking
Assay solution shaking
additional_annotation
Additional annotation(s)
enzyme_name
Enzyme name
species
Enzyme Provenance
database_id
Database accession number
is_extremophile
Binary variable 1 if enzyme is annotated as originating from an extremophile organism, 0 otherwise
extremophile_annotation
Additional annotation on the extremophilic nature of the organism or enzyme
mutation
Amino acid substitution (with residue numbering according to the provided sequence)
sequence
Enzyme amino acid sequence (including mutations if applicable)
doi
Digital Object Identifier of the source publication
year
Publication year
authors
Author list of the source publication
title
Title of the source publication
journal
Journal name
solvent_cas
CAS number of the organic solvent(s)
solvent_pubchem
PubChem CID of the organic solvent(s)
solvent_smiles
SMILES string(s) of the organic solvent(s)
solvent_logp
Octanol-water partition coefficient (log P) of the organic solvent(s)
solvent_chem21_safety
CHEM21 safety score of the organic solvent(s)
solvent_chem21_health
CHEM21 health score of the organic solvent(s)
solvent_chem21_environment
CHEM21 environment score of the organic solvent(s)
solvent_chem21_ranking_default
CHEM21 overall ranking of the organic solvent(s)
solvent_chem21_ranking_discussion
CHEM21 ranking discussion / notes of the organic solvent(s)
For more information about CHEM21 scores, see https://pubs.rsc.org/en/content/articlelanding/2016/gc/c5gc01008j
substrate_cas
CAS number of the substrate(s)
substrate_pubchem
PubChem CID of the substrate(s)
substrate_smiles
SMILES string of the substrate(s)
product_cas
CAS number of the product(s)
product_pubchem
PubChem CID of the product(s)
product_smiles
SMILES string of the product(s)
Build My Dataset
Select filters to export only the measurements relevant to your work. Leave any filter empty to include all values for that dimension. The live count updates as you adjust filters.