Enzyme Commission number

The Enzyme Commission number (EC number) is a standardized numerical classification system used to identify enzymes based on the biochemical reactions they catalyze, providing a unique identifier for each characterized enzyme-catalyzed reaction. Developed and maintained by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB), EC numbers consist of four digits separated by periods (e.g., EC 1.1.1.1), where the first digit denotes one of seven main classes of enzyme-catalyzed reactions, the second and third digits specify subclasses and sub-subclasses based on the type of reaction or substrate involved, and the fourth digit serves as a serial number for individual enzymes within that group.^[1]^[2] The system originated from the efforts of the first Enzyme Commission, established by the International Union of Biochemistry (IUB) in 1956 to create a systematic nomenclature amid growing discoveries of enzymes. The inaugural report, published in 1961, classified enzymes into six primary classes—oxidoreductases (EC 1), transferases (EC 2), hydrolases (EC 3), lyases (EC 4), isomerases (EC 5), and ligases (EC 6)—with the EC numbering format introduced to reflect reaction types rather than enzyme sources or structures.^[3]^[1] This hierarchical approach ensured logical grouping, such as subclasses for oxidoreductases based on electron acceptors like NAD+ or quinones.^[2] Over time, the classification has evolved to accommodate new biochemical knowledge; notably, a seventh class, translocases (EC 7), was added in 2018 to cover enzymes that transport ions or molecules across membranes without covalent modification. The NC-IUBMB continues to oversee updates through periodic revisions, with proposals for new EC numbers submitted publicly and reviewed for acceptance based on experimental evidence of the catalyzed reaction.^[2]^[3] Deleted or transferred entries are noted in databases to maintain historical traceability, preventing reuse of numbers.^[1] EC numbers play a crucial role in bioinformatics and research, linking enzyme data across databases like ENZYME and BRENDA, and facilitating the annotation of protein sequences in genomics projects. As of October 2025, the official ENZYME database lists 6,919 active EC numbers, reflecting ongoing discoveries in enzymology.^[4]^[3] Each entry includes systematic and recommended (trivial) names, reaction equations, and references to support precise scientific communication.^[1]

Introduction

Definition and Purpose

The Enzyme Commission (EC) number is a four-digit numerical code assigned to enzymes to classify them according to the chemical reactions they catalyze, rather than their amino acid sequence, three-dimensional structure, or biological source.^[2] This system provides a unique identifier for each distinct enzymatic activity, such as EC 1.1.1.1 for alcohol dehydrogenase, which catalyzes the oxidation of primary alcohols to aldehydes (or secondary alcohols to ketones) using NAD⁺ as an electron acceptor.^[5] The primary purpose of the EC numbering system is to establish a standardized, reaction-centered nomenclature that enables precise communication among researchers, supports the indexing of enzymes in biochemical databases, and facilitates interdisciplinary research in fields like biochemistry, genomics, and pharmacology.^[2] By focusing on the substrates transformed and products formed, it promotes a functional understanding of enzyme roles independent of evolutionary or structural similarities, allowing for consistent annotation across diverse datasets.^[6] The scope of EC numbers encompasses all known enzymes from any organism, prokaryotic or eukaryotic, primarily protein enzymes but also including some ribozymes that catalyze biochemical reactions.^[2]^[7] This classification emphasizes catalytic function over phylogenetic relatedness, ensuring broad applicability while maintaining a hierarchical structure for organizing thousands of entries.^[6]

Historical Context and Evolution

The Enzyme Commission (EC) system originated in 1956 when the International Union of Biochemistry (IUB), under the leadership of Professor Marcel Florkin, established an International Commission on Enzymes to address the growing chaos in enzyme nomenclature amid rapid discoveries in biochemistry.^[8] This initiative aimed to standardize classification based on reaction types rather than sources or trivial names, providing a systematic framework for the burgeoning field. The Commission's work culminated in its first report in 1961, which initially outlined six main classes of enzymes, assigning unique four-digit EC numbers to over 700 known activities.^[3] By 1965, the system evolved with the second report, which expanded the list of classified enzymes within the existing six primary classes—oxidoreductases, transferases, hydrolases, lyases, isomerases, and ligases—to better encompass diverse catalytic mechanisms, reflecting feedback from the biochemical community and additional enzyme characterizations.^[9] This hierarchical structure, with subclasses and further subdivisions, became the foundation for subsequent editions, published periodically in print form through the 1980s and 1990s. A major milestone occurred in 2018 with the addition of a seventh class, translocases (EC 7), to classify membrane transporters that move ions or molecules across barriers, addressing long-standing gaps for enzymes previously unassigned or shoehorned into existing categories.^[10] The EC system's evolution has paralleled advances in molecular biology, transitioning from print-based reports to digital maintenance starting in the early 2000s via online databases like ExplorEnz, enabling real-time updates and global access.^[11] The rise of genomics and proteomics in the late 20th and early 21st centuries prompted significant revisions, particularly for multi-substrate enzymes exhibiting promiscuity or moonlighting functions, as high-throughput sequencing revealed thousands of novel activities requiring refined classifications.^[6] As of October 2025, 6,919 EC numbers have been assigned, with the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB) conducting periodic reviews to incorporate these insights while maintaining reaction-based consistency.^[4]

Classification System

Format and Nomenclature

The Enzyme Commission (EC) number is a four-digit numerical code structured as EC a.b.c.d, where the digits are separated by periods, providing a unique identifier for each enzyme based on the reaction it catalyzes. The first digit (a) denotes the main class of enzyme reaction, ranging from 1 to 7; the second (b) specifies the subclass within that class; the third (c) indicates the sub-subclass for further refinement; and the fourth (d) serves as a serial number to distinguish individual enzymes within the sub-subclass.^[2]^[3] Each EC number is associated with a systematic name that precisely describes the enzyme's action in terms of substrates and reaction type, such as "alcohol:NAD⁺ oxidoreductase" for EC 1.1.1.1, alongside an accepted name reflecting common usage, like "alcohol dehydrogenase." Additionally, the entry includes a reaction equation outlining the catalyzed transformation, for instance, RCH₂OH + NAD⁺ ⇌ RCHO + NADH + H⁺ for the oxidation of primary alcohols by EC 1.1.1.1. These naming conventions ensure standardized, unambiguous reference in scientific literature and databases.^[2]^[12] EC numbers are assigned to ensure uniqueness, such that no two enzymes catalyzing identical reactions share the same code; if an enzyme's classification changes or becomes obsolete, the original number is retained in the database with notes indicating deletion, transfer, or replacement, but it is never reassigned to a new activity.^[2]^[3] For example, EC 3.4.21.4 corresponds to the enzyme trypsin, with the accepted name "trypsin" and a reaction specified as preferential cleavage of peptide bonds at Arg-|-Xaa and Lys-|-Xaa.^[13]

Hierarchical Levels

The Enzyme Commission (EC) classification system organizes enzymes into a four-tiered hierarchy based on the reactions they catalyze, ensuring precise and non-overlapping categorization. The first level, the class, represents the broad type of reaction, such as oxidoreductases (EC 1) for oxidation-reduction reactions or transferases (EC 2) for group transfer reactions.^[2] The second level, the subclass, further specifies the subgroup within the class, often indicating the particular bond, group, or component involved, such as EC 1.1 for oxidoreductases acting on the CH-OH group of donors.^[2] The third level, the sub-subclass, provides additional detail on the mechanism, substrate specificity, or cofactor used, for example, EC 1.1.1 for those using NAD(+) or NADP(+) as acceptors.^[2] Finally, the fourth level, the serial number, distinguishes individual enzymes or variants within the sub-subclass, such as EC 1.1.1.1 for alcohol dehydrogenase specifically acting on primary alcohols.^[2] Enzymes are assigned to this hierarchy primarily according to the chemical reaction they catalyze, with the EC number reflecting the reaction rather than the enzyme's protein structure or identity.^[1] In cases of multi-functional enzymes that catalyze distinct reactions, multiple EC numbers may be assigned to capture different activities. This approach prioritizes reaction specificity to avoid overlap, grouping enzymes solely by catalytic function even if they share structural similarities.^[1] The hierarchical structure achieves high granularity, with an average of over 100 sub-subclasses distributed across the main classes to accommodate diverse reaction types, resulting in 6,919 unique EC entries as of October 2025.^[4] This subdivision ensures comprehensive coverage without redundancy, as assignments emphasize the catalyzed reaction over phylogenetic or sequence-based similarities.^[1] A key distinction in the EC system involves terminology for proenzymes (inactive precursors) and cofactors: classifications apply to the active enzyme form that "catalyzes" the reaction, whereas proenzymes or cofactors may be noted as facilitating "entry into" the reaction pathway but do not receive independent EC numbers unless they exhibit distinct catalytic activity post-activation.^[2]

Top-Level Enzyme Classes

The Enzyme Commission (EC) classification system organizes enzymes into seven top-level classes based on the type of chemical reaction they catalyze, providing a foundational framework for understanding enzymatic function across biological systems. These classes, established by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB), reflect the primary reaction mechanisms and have been refined over time to encompass emerging knowledge in biochemistry. The first six classes were defined in the original 1961 report, with the seventh added in 2018 to address membrane transport processes previously unclassified. Each class is further subdivided hierarchically into subclasses, sub-subclasses, and serial numbers, but the top-level categories emphasize the core reaction types. Class 1: Oxidoreductases catalyze oxidation-reduction reactions, typically involving the transfer of electrons, hydrogen atoms, or oxygen atoms between substrates, often using cofactors like NAD+ or oxygen as acceptors. These enzymes are essential in metabolic pathways such as respiration and photosynthesis, where they facilitate energy transfer. A representative example is EC 1.1.1.1, alcohol dehydrogenase, which oxidizes primary or secondary alcohols to aldehydes or ketones using NAD+ as the acceptor.^[14] Class 2: Transferases mediate the transfer of specific functional groups, such as methyl, acyl, amino, phosphate, or glycosyl groups, from a donor molecule to an acceptor substrate. This class plays a key role in biosynthetic processes, including phosphorylation and glycosylation. For instance, EC 2.7.1.1, hexokinase, transfers a phosphate group from ATP to D-hexose sugars, forming D-hexose 6-phosphate, a critical step in glycolysis.^[15] Class 3: Hydrolases promote the hydrolysis of various chemical bonds, incorporating water to cleave esters, amides, glycosidic linkages, or other substrates into simpler products. These enzymes are ubiquitous in digestion, nucleic acid metabolism, and protein degradation. An example is EC 3.1.1.1, carboxylesterase, which hydrolyzes carboxylic esters to alcohols and carboxylates.^[16] Class 4: Lyases facilitate the cleavage of chemical bonds through non-hydrolytic or non-oxidative elimination reactions, often resulting in the formation of double bonds or rings, or conversely, the addition of groups to double bonds. They are involved in pathways like the citric acid cycle and amino acid biosynthesis. EC 4.1.1.1, pyruvate decarboxylase, exemplifies this by decarboxylating 2-oxo carboxylates to aldehydes and CO2.^[17] Class 5: Isomerases enable intramolecular rearrangements, converting a molecule into one of its isomers without altering the molecular formula, such as through racemization, epimerization, or cis-trans shifts. These enzymes support carbohydrate metabolism and stereochemical adjustments in biosynthesis. A typical example is EC 5.3.1.9, glucose-6-phosphate isomerase (also known as phosphoglucose isomerase), which interconverts α-D-glucose 6-phosphate and β-D-fructofuranose 6-phosphate in glycolysis.^[18] Class 6: Ligases (also called synthetases) drive the formation of new chemical bonds, such as C-O, C-S, C-N, or C-C linkages, coupled to the hydrolysis of a nucleoside triphosphate like ATP. This class is vital for macromolecular synthesis, including protein and nucleic acid assembly. For example, EC 6.1.1.1, tyrosine—tRNA ligase, ligates L-tyrosine to tRNA^Tyr using ATP, producing L-tyrosyl-tRNA^Tyr in protein translation.^[19] Class 7: Translocases catalyze the translocation of ions or molecules across membranes or between membrane sides, often generating electrochemical gradients without direct chemical modification of substrates. Introduced in 2018 to classify previously orphaned transport proteins, this class addresses the mechanistic distinctness of transporters from other enzymes. An illustrative entry is EC 7.1.1.1, proton-translocating NAD(P)+ transhydrogenase, which transfers hydride between NADP+ and NAD+ while translocating protons across the membrane.^[20]^[10]

Assignment and Grouping Criteria

Reaction-Based Classification

The Enzyme Commission (EC) classification system fundamentally prioritizes the overall chemical reaction catalyzed by an enzyme—specifically, the transformation of substrates into products—over considerations of the enzyme's three-dimensional structure, amino acid sequence, or the biological source organism. This reaction-centric approach groups enzymes that perform analogous chemical transformations, promoting a functional taxonomy that reflects biochemical mechanisms rather than evolutionary relatedness. The International Union of Biochemistry and Molecular Biology (IUBMB) establishes standardized, recommended reaction equations for each EC entry to precisely delineate the catalyzed process, ensuring consistency across diverse enzymatic activities.^[3]^[21] Enzymes are classified into hierarchical groups based on the chemical similarity of their reactions, with subclasses defined by shared mechanistic features such as the type of bond formed or broken. For instance, all enzymes facilitating the transfer of electrons from a donor to an acceptor fall under EC 1 (oxidoreductases), and within that, EC 1.1 encompasses those acting on the CH-OH group of donors. This grouping incorporates cofactors or coenzymes only if they participate directly in the reaction stoichiometry, while excluding kinetic or regulatory details that vary between enzymes. By focusing on net reaction outcomes, the system avoids erroneous structural assumptions, recognizing that enzymes catalyzing identical reactions may lack sequence or fold homology due to convergent evolution.^[1]^[3] For enzymes exhibiting multiple catalytic activities, the primary EC number is designated according to the reaction representing the enzyme's predominant physiological role or the activity for which it was initially identified. Secondary activities receive additional EC assignments, allowing comprehensive annotation without conflating distinct functions. When scientific advancements necessitate reclassification—such as due to refined mechanistic understanding—superseded EC numbers are designated as "transferred entries," which redirect to the updated classification while preserving historical references.^[21]^[22] Reaction schemes are typically generalized to illustrate core principles within subclasses, emphasizing the chemical logic over specific molecular identities. A representative example for oxidoreductases is the generic redox transformation:

\ce{A(reduced) + B(oxidized) -> A(oxidized) + B(reduced)}

This notation captures the essential electron transfer without implying structural constraints, reinforcing the classification's independence from protein architecture.^[1] These reaction archetypes underpin the seven top-level EC classes, from EC 1 oxidoreductases to EC 7 translocases.^[2]

Similarity and Database Integration

In practice, sequence similarity tools such as BLAST are employed to predict EC numbers for novel enzymes by aligning query sequences against databases of known enzymes, achieving high accuracy for homologs with at least 40% identity in predicting the first three EC digits.^[23] However, these predictions require validation through experimental confirmation of the catalyzed reaction to ensure alignment with the reaction-based classification principles, as sequence similarity alone may not capture functional divergence.^[24] This approach addresses limitations in classifying non-homologous enzymes, where structural or motif-based methods supplement sequence analysis to infer function.^[25] EC numbers serve as standardized identifiers that facilitate integration across major bioinformatics databases, enabling cross-referencing for enzyme annotation and functional analysis. In BRENDA, the comprehensive enzyme information system, EC numbers link to curated data on approximately 112,000 enzymes, including kinetic parameters and organism-specific details derived from literature (as of 2025).^[26] The KEGG ENZYME database implements the full EC nomenclature, mapping enzymes to metabolic pathways and reactions for systems biology applications.^[27] Similarly, UniProt uses EC numbers in the catalytic activity section of protein entries, providing evidence-based annotations for reviewed sequences and supporting automated propagation to unreviewed ones, with a high percentage of enzyme-annotated entries in the Swiss-Prot subset having complete EC numbers.^[28] These integrations support applications in genomics, such as predicting functions for orphan genes (ORFans) lacking close homologs by inferring EC numbers from partial sequence or domain matches. Tools like EFICAz automate EC prediction by combining sequence similarity searches with PROSITE pattern matching and metabolic context, achieving high precision on large-scale datasets for pathway reconstruction.^[29] The official ENZYME database lists 6,919 active EC numbers as of October 2025.^[4]

Maintenance and Applications

Management by the Nomenclature Committee

The Enzyme Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) serves as the primary organizational body overseeing the assignment, classification, and maintenance of Enzyme Commission (EC) numbers, ensuring a standardized system for enzyme nomenclature worldwide.^[30] Operating under the IUBMB, the committee comprises approximately 12 expert members selected for their specialized knowledge in biochemistry and enzymology, with elections conducted to promote diversity in gender, geography, and career stage; members typically serve three-year terms to provide continuity in decision-making.^[2] This structure reflects the committee's evolution from its foundational role in the 1956 International Commission on Enzymes, maintaining consistent oversight amid advances in biochemical research.^[8] Proposals for new EC numbers are submitted electronically through dedicated online forms available on the official IUBMB enzyme database platforms, allowing researchers to provide detailed evidence of novel enzymatic reactions, including biochemical data and references.^[31] The NC-IUBMB reviews these submissions rigorously, evaluating the uniqueness of the catalyzed reaction, its alignment with existing classifications, and supporting experimental validation, often incorporating input from collaborating experts during the process.^[32] Approved assignments are documented in annual reports published within the Enzyme Nomenclature recommendations, which detail additions, revisions, and rationale for classifications to foster transparency and scientific consensus.^[22] In terms of ongoing oversight, the committee maintains the authoritative EC number list hosted at enzyme.expasy.org, a comprehensive repository synchronized with the IUBMB's ExplorEnz database for global accessibility.^[4] It actively manages user queries, processes correction requests for inaccuracies in nomenclature or classification, and handles deprecations for obsolete entries through the same digital submission channels, ensuring the system's accuracy and relevance.^[31] Furthermore, the NC-IUBMB collaborates closely with major sequence and enzyme databases, such as BRENDA, to integrate structural and functional data, promoting consistency across bioinformatics resources.^[33] Since the implementation of formalized digital submission systems, these processes have been streamlined, enabling the committee to assign approximately 80-100 new EC numbers each year in recent years, reflecting the pace of enzymatic discoveries in molecular biology.^[2]^[6]

Updates, Revisions, and Modern Usage

The Enzyme Commission (EC) system undergoes regular revisions through a structured process managed by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB), involving validation of new enzyme proposals, public review periods, and periodic supplements to address inconsistencies or reclassify entries for better alignment with emerging biochemical knowledge.^[2] For instance, when new discoveries reveal functional overlaps, similar entries may be merged or renumbered, as seen in updates to hydrolase classifications to incorporate novel nucleases.^[9] This process has responded to breakthroughs like CRISPR-associated enzymes, with Cas9 assigned provisional EC 3.1.-.- based on its endonucleolytic activity, enabling integration into the broader classification framework.^[34] In the 2020s, modern updates have expanded the EC system to accommodate data from high-throughput sequencing and engineering efforts, including the incorporation of metagenomic sequences to assign numbers to enzymes from uncultured microbes, which previously lacked characterization.^[35] Expansions for synthetic biology have included EC assignments for engineered catalysts, such as modified oxidoreductases designed for biofuel production, reflecting the system's adaptability to non-natural variants.^[36] These revisions are supported by computational tools that predict and validate EC numbers, ensuring the hierarchy remains relevant amid rapid advancements in genome editing and protein design. The EC system plays a central role in contemporary biochemical research, particularly in metabolic modeling where it standardizes reactions for techniques like Flux Balance Analysis (FBA), allowing simulation of cellular pathways in organisms from bacteria to humans.^[37] In drug design, targeting specific EC classes—such as proteases (EC 3.4)—facilitates the development of inhibitors for diseases like cancer, by mapping enzyme activities to therapeutic interventions.^[38] AI-driven predictions further enhance its utility, using machine learning to annotate EC numbers for novel proteins, accelerating functional genomics and addressing gaps in experimental data.^[39] However, the system faces critiques for incomplete coverage of regulatory mechanisms, such as allosteric effects that modulate enzyme activity beyond reaction type, and for not fully accounting for directionality in reversible reactions, which can complicate kinetic modeling.^[40]^[41] As of October 2025, there are 6,919 active EC numbers, with ongoing efforts to develop provisional schemes for non-protein catalysts like ribozymes, which catalyze reactions such as self-splicing but fall outside traditional protein-focused classifications.^[4] These initiatives aim to broaden the system's scope while maintaining its reaction-centric foundation, often integrating with databases like KEGG for pathway updates.^[42]