Fact-checked by Grok 2 weeks ago

Enzyme Commission number

The Enzyme Commission number (EC number) is a standardized numerical classification system used to identify enzymes based on the biochemical reactions they catalyze, providing a for each characterized enzyme-catalyzed reaction. Developed and maintained by the Nomenclature Committee of the International Union of Biochemistry and (NC-IUBMB), EC numbers consist of four digits separated by periods (e.g., EC 1.1.1.1), where the first digit denotes one of seven main classes of enzyme-catalyzed reactions, the second and third digits specify subclasses and sub-subclasses based on the type of reaction or substrate involved, and the fourth digit serves as a for individual enzymes within that group. The system originated from the efforts of the first Enzyme Commission, established by the International Union of Biochemistry (IUB) in 1956 to create a systematic nomenclature amid growing discoveries of enzymes. The inaugural report, published in 1961, classified enzymes into six primary classes—oxidoreductases (EC 1), transferases (EC 2), hydrolases (EC 3), lyases (EC 4), isomerases (EC 5), and ligases (EC 6)—with the EC numbering format introduced to reflect reaction types rather than enzyme sources or structures. This hierarchical approach ensured logical grouping, such as subclasses for oxidoreductases based on electron acceptors like NAD+ or quinones. Over time, the classification has evolved to accommodate new biochemical knowledge; notably, a seventh class, translocases (EC 7), was added in to cover enzymes that transport ions or molecules across membranes without covalent modification. The NC-IUBMB continues to oversee updates through periodic revisions, with proposals for new EC numbers submitted publicly and reviewed for acceptance based on experimental evidence of the catalyzed reaction. Deleted or transferred entries are noted in to maintain historical , preventing reuse of numbers. EC numbers play a crucial role in bioinformatics and research, linking enzyme data across databases like and , and facilitating the annotation of protein sequences in projects. As of October 2025, the official database lists 6,919 active EC numbers, reflecting ongoing discoveries in enzymology. Each entry includes systematic and recommended (trivial) names, reaction equations, and references to support precise scientific communication.

Introduction

Definition and Purpose

The Enzyme Commission (EC) number is a four-digit numerical code assigned to enzymes to classify them according to the chemical reactions they catalyze, rather than their sequence, three-dimensional structure, or biological source. This system provides a for each distinct enzymatic activity, such as for , which catalyzes the oxidation of primary alcohols to aldehydes (or secondary alcohols to ketones) using NAD⁺ as an . The primary purpose of the EC numbering system is to establish a standardized, reaction-centered that enables precise communication among researchers, supports the indexing of enzymes in biochemical databases, and facilitates interdisciplinary research in fields like biochemistry, , and . By focusing on the substrates transformed and products formed, it promotes a functional understanding of roles independent of evolutionary or structural similarities, allowing for consistent annotation across diverse datasets. The scope of EC numbers encompasses all known from any organism, prokaryotic or eukaryotic, primarily protein enzymes but also including some ribozymes that catalyze biochemical reactions. This emphasizes catalytic function over phylogenetic relatedness, ensuring broad applicability while maintaining a hierarchical structure for organizing thousands of entries.

Historical Context and Evolution

The Enzyme Commission (EC) system originated in 1956 when the International Union of Biochemistry (IUB), under the leadership of Professor Marcel Florkin, established an International Commission on Enzymes to address the growing chaos in enzyme nomenclature amid rapid discoveries in biochemistry. This initiative aimed to standardize classification based on reaction types rather than sources or trivial names, providing a systematic framework for the burgeoning field. The Commission's work culminated in its first report in 1961, which initially outlined six main classes of enzymes, assigning unique four-digit EC numbers to over 700 known activities. By , the system evolved with the second report, which expanded the list of classified enzymes within the existing six primary classes—oxidoreductases, transferases, hydrolases, lyases, isomerases, and ligases—to better encompass diverse catalytic mechanisms, reflecting feedback from the biochemical community and additional characterizations. This hierarchical structure, with subclasses and further subdivisions, became the foundation for subsequent editions, published periodically in print form through the and . A major milestone occurred in 2018 with the addition of a seventh class, translocases (EC 7), to classify transporters that move ions or molecules across barriers, addressing long-standing gaps for enzymes previously unassigned or shoehorned into existing categories. The EC system's evolution has paralleled advances in , transitioning from print-based reports to digital maintenance starting in the early via online databases like ExplorEnz, enabling real-time updates and global access. The rise of and in the late 20th and early 21st centuries prompted significant revisions, particularly for multi-substrate enzymes exhibiting or functions, as high-throughput sequencing revealed thousands of novel activities requiring refined classifications. As of October 2025, 6,919 EC numbers have been assigned, with the Committee of the International Union of Biochemistry and (IUBMB) conducting periodic reviews to incorporate these insights while maintaining reaction-based consistency.

Classification System

Format and Nomenclature

The Enzyme Commission (EC) number is a four-digit numerical code structured as EC a.b.c.d, where the digits are separated by periods, providing a for each based on the reaction it catalyzes. The first digit (a) denotes the main class of reaction, ranging from 1 to 7; the second (b) specifies the subclass within that class; the third (c) indicates the sub-subclass for further refinement; and the fourth (d) serves as a to distinguish individual enzymes within the sub-subclass. Each EC number is associated with a systematic name that precisely describes the enzyme's action in terms of substrates and reaction type, such as ":NAD⁺ " for EC 1.1.1.1, alongside an accepted name reflecting common usage, like "." Additionally, the entry includes a equation outlining the catalyzed transformation, for instance, RCH₂OH + NAD⁺ ⇌ RCHO + NADH + H⁺ for the oxidation of primary alcohols by EC 1.1.1.1. These naming conventions ensure standardized, unambiguous reference in and databases. EC numbers are assigned to ensure uniqueness, such that no two enzymes catalyzing identical reactions share the same code; if an enzyme's classification changes or becomes obsolete, the original number is retained in the database with notes indicating deletion, transfer, or replacement, but it is never reassigned to a new activity. For example, EC 3.4.21.4 corresponds to the trypsin, with the accepted name "" and a reaction specified as preferential of bonds at Arg-|-Xaa and Lys-|-Xaa.

Hierarchical Levels

The Enzyme Commission (EC) classification system organizes enzymes into a four-tiered based on the they catalyze, ensuring precise and non-overlapping . The first level, the , represents the broad type of , such as oxidoreductases (EC 1) for oxidation-reduction reactions or transferases (EC 2) for group transfer reactions. The second level, the subclass, further specifies the subgroup within the class, often indicating the particular bond, group, or component involved, such as EC 1.1 for oxidoreductases acting on the CH-OH group of donors. The third level, the sub-subclass, provides additional detail on the mechanism, substrate specificity, or cofactor used, for example, EC 1.1.1 for those using NAD(+) or NADP(+) as acceptors. Finally, the fourth level, the serial number, distinguishes individual enzymes or variants within the sub-subclass, such as EC 1.1.1.1 for specifically acting on primary alcohols. Enzymes are assigned to this hierarchy primarily according to the they catalyze, with the EC number reflecting the reaction rather than the enzyme's or identity. In cases of multi-functional enzymes that catalyze distinct reactions, multiple EC numbers may be assigned to capture different activities. This approach prioritizes reaction specificity to avoid overlap, grouping enzymes solely by catalytic function even if they share structural similarities. The hierarchical structure achieves high , with an average of over 100 sub-subclasses distributed across the main classes to accommodate diverse types, resulting in 6,919 unique entries as of October 2025. This subdivision ensures comprehensive coverage without redundancy, as assignments emphasize the catalyzed over phylogenetic or sequence-based similarities. A key distinction in the EC system involves terminology for proenzymes (inactive precursors) and cofactors: classifications apply to the active form that "catalyzes" the , whereas proenzymes or cofactors may be noted as facilitating "entry into" the reaction pathway but do not receive independent EC numbers unless they exhibit distinct catalytic activity post-activation.

Top-Level Enzyme Classes

The Enzyme Commission (EC) classification system organizes into seven top-level classes based on the type of they catalyze, providing a foundational framework for understanding enzymatic function across biological systems. These classes, established by the Nomenclature Committee of the International Union of Biochemistry and (IUBMB), reflect the primary reaction mechanisms and have been refined over time to encompass emerging knowledge in biochemistry. The first six classes were defined in the original 1961 report, with the seventh added in 2018 to address processes previously unclassified. Each class is further subdivided hierarchically into subclasses, sub-subclasses, and serial numbers, but the top-level categories emphasize the core reaction types. Class 1: Oxidoreductases catalyze oxidation-reduction reactions, typically involving the transfer of electrons, hydrogen atoms, or oxygen atoms between substrates, often using cofactors like NAD+ or oxygen as acceptors. These enzymes are essential in metabolic pathways such as and , where they facilitate energy transfer. A representative example is EC 1.1.1.1, , which oxidizes primary or secondary alcohols to aldehydes or ketones using NAD+ as the acceptor. Class 2: Transferases mediate the transfer of specific functional groups, such as methyl, acyl, amino, , or groups, from a donor to an acceptor . This class plays a key role in biosynthetic processes, including and . For instance, EC 2.7.1.1, , transfers a group from ATP to D-hexose sugars, forming D-hexose 6-phosphate, a critical step in . Class 3: Hydrolases promote the of various chemical bonds, incorporating to cleave esters, amides, glycosidic linkages, or other substrates into simpler products. These enzymes are ubiquitous in , , and protein degradation. An example is EC 3.1.1.1, carboxylesterase, which hydrolyzes carboxylic esters to alcohols and carboxylates. Class 4: Lyases facilitate the cleavage of chemical bonds through non-hydrolytic or non-oxidative elimination reactions, often resulting in the formation of double bonds or rings, or conversely, the addition of groups to double bonds. They are involved in pathways like the and biosynthesis. EC 4.1.1.1, pyruvate decarboxylase, exemplifies this by decarboxylating 2-oxo carboxylates to aldehydes and CO2. Class 5: Isomerases enable intramolecular rearrangements, converting a into one of its isomers without altering the molecular formula, such as through , epimerization, or cis-trans shifts. These enzymes support and stereochemical adjustments in . A typical example is 5.3.1.9, (also known as phosphoglucose isomerase), which interconverts α-D-glucose 6-phosphate and β-D-fructofuranose 6-phosphate in . Class 6: Ligases (also called synthetases) drive the formation of new chemical bonds, such as C-O, C-S, C-N, or C-C linkages, coupled to the of a like ATP. This class is vital for macromolecular synthesis, including protein and assembly. For example, EC 6.1.1.1, —tRNA ligase, ligates L-tyrosine to tRNA^Tyr using ATP, producing L-tyrosyl-tRNA^Tyr in protein . Class 7: Translocases catalyze the translocation of ions or molecules across membranes or between membrane sides, often generating electrochemical gradients without direct chemical modification of substrates. Introduced in to classify previously orphaned transport proteins, this class addresses the mechanistic distinctness of transporters from other enzymes. An illustrative entry is EC 7.1.1.1, proton-translocating NAD(P)+ transhydrogenase, which transfers between NADP+ and NAD+ while translocating protons across the membrane.

Assignment and Grouping Criteria

Reaction-Based Classification

The Enzyme Commission (EC) classification system fundamentally prioritizes the overall catalyzed by an enzyme—specifically, the transformation of substrates into products—over considerations of the enzyme's three-dimensional structure, sequence, or the biological source . This reaction-centric approach groups enzymes that perform analogous chemical transformations, promoting a functional that reflects biochemical mechanisms rather than evolutionary relatedness. The International Union of Biochemistry and (IUBMB) establishes standardized, recommended reaction equations for each EC entry to precisely delineate the catalyzed process, ensuring consistency across diverse enzymatic activities. Enzymes are classified into hierarchical groups based on the of their reactions, with subclasses defined by shared mechanistic features such as the type of formed or broken. For instance, all enzymes facilitating the transfer of electrons from a donor to an acceptor fall under EC 1 (oxidoreductases), and within that, EC 1.1 encompasses those acting on the CH-OH group of donors. This grouping incorporates cofactors or coenzymes only if they participate directly in the reaction , while excluding kinetic or regulatory details that vary between enzymes. By focusing on net reaction outcomes, the system avoids erroneous structural assumptions, recognizing that enzymes catalyzing identical reactions may lack or fold homology due to . For enzymes exhibiting multiple catalytic activities, the primary EC number is designated according to the reaction representing the enzyme's predominant physiological role or the activity for which it was initially identified. Secondary activities receive additional EC assignments, allowing comprehensive annotation without conflating distinct functions. When scientific advancements necessitate reclassification—such as due to refined mechanistic understanding—superseded EC numbers are designated as "transferred entries," which redirect to the updated while preserving historical references. Reaction schemes are typically generalized to illustrate core principles within subclasses, emphasizing the chemical logic over specific molecular identities. A representative example for oxidoreductases is the generic transformation: \ce{A(reduced) + B(oxidized) -> A(oxidized) + B(reduced)} This notation captures the essential without implying structural constraints, reinforcing the classification's independence from protein architecture. These reaction archetypes underpin the seven top-level classes, from EC 1 oxidoreductases to EC 7 translocases.

Similarity and Database Integration

In practice, sequence similarity tools such as are employed to predict EC numbers for novel enzymes by aligning query sequences against databases of known enzymes, achieving high accuracy for homologs with at least 40% identity in predicting the first three EC digits. However, these predictions require validation through experimental confirmation of the catalyzed reaction to ensure alignment with the reaction-based principles, as sequence similarity alone may not capture functional divergence. This approach addresses limitations in classifying non-homologous enzymes, where structural or motif-based methods supplement to infer function. EC numbers serve as standardized identifiers that facilitate integration across major bioinformatics databases, enabling cross-referencing for enzyme annotation and functional analysis. In , the comprehensive , EC numbers link to curated data on approximately 112,000 enzymes, including kinetic parameters and organism-specific details derived from literature (as of 2025). The database implements the full EC nomenclature, mapping enzymes to metabolic pathways and reactions for applications. Similarly, uses EC numbers in the catalytic activity section of protein entries, providing evidence-based annotations for reviewed sequences and supporting automated propagation to unreviewed ones, with a high of enzyme-annotated entries in the Swiss-Prot subset having complete EC numbers. These integrations support applications in , such as predicting functions for orphan genes (ORFans) lacking close homologs by inferring numbers from partial sequence or domain matches. Tools like EFICAz automate prediction by combining sequence similarity searches with pattern matching and metabolic context, achieving high precision on large-scale datasets for pathway reconstruction. The official database lists 6,919 active numbers as of October 2025.

Maintenance and Applications

Management by the Nomenclature Committee

The Enzyme Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) serves as the primary organizational body overseeing the assignment, classification, and maintenance of Enzyme Commission (EC) numbers, ensuring a standardized system for enzyme nomenclature worldwide. Operating under the IUBMB, the committee comprises approximately 12 expert members selected for their specialized knowledge in biochemistry and enzymology, with elections conducted to promote diversity in gender, geography, and career stage; members typically serve three-year terms to provide continuity in decision-making. This structure reflects the committee's evolution from its foundational role in the 1956 International Commission on Enzymes, maintaining consistent oversight amid advances in biochemical research. Proposals for new EC numbers are submitted electronically through dedicated online forms available on the official IUBMB enzyme database platforms, allowing researchers to provide detailed of enzymatic reactions, including biochemical and . The NC-IUBMB reviews these submissions rigorously, evaluating the uniqueness of the catalyzed reaction, its alignment with existing classifications, and supporting experimental validation, often incorporating input from collaborating experts during the process. Approved assignments are documented in annual reports published within the Enzyme Nomenclature recommendations, which detail additions, revisions, and rationale for classifications to foster transparency and . In terms of ongoing oversight, the maintains the authoritative number list hosted at enzyme.expasy.org, a comprehensive repository synchronized with the IUBMB's ExplorEnz database for global accessibility. It actively manages user queries, processes correction requests for inaccuracies in or classification, and handles deprecations for obsolete entries through the same digital submission channels, ensuring the system's accuracy and relevance. Furthermore, the NC-IUBMB collaborates closely with major sequence and enzyme databases, such as , to integrate structural and functional data, promoting consistency across bioinformatics resources. Since the implementation of formalized digital submission systems, these processes have been streamlined, enabling the to assign approximately 80-100 new numbers each year in recent years, reflecting the pace of enzymatic discoveries in .

Updates, Revisions, and Modern Usage

The system undergoes regular revisions through a structured process managed by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB), involving validation of new enzyme proposals, public review periods, and periodic supplements to address inconsistencies or reclassify entries for better alignment with emerging biochemical knowledge. For instance, when new discoveries reveal functional overlaps, similar entries may be merged or renumbered, as seen in updates to classifications to incorporate novel nucleases. This process has responded to breakthroughs like CRISPR-associated enzymes, with assigned provisional EC 3.1.-.- based on its endonucleolytic activity, enabling integration into the broader classification framework. In the 2020s, modern updates have expanded the EC system to accommodate data from high-throughput sequencing and engineering efforts, including the incorporation of metagenomic sequences to assign numbers to enzymes from uncultured microbes, which previously lacked characterization. Expansions for synthetic biology have included EC assignments for engineered catalysts, such as modified oxidoreductases designed for biofuel production, reflecting the system's adaptability to non-natural variants. These revisions are supported by computational tools that predict and validate EC numbers, ensuring the hierarchy remains relevant amid rapid advancements in genome editing and protein design. The system plays a central role in contemporary biochemical research, particularly in metabolic modeling where it standardizes reactions for techniques like (FBA), allowing simulation of cellular pathways in organisms from to humans. In drug design, targeting specific EC classes—such as proteases (EC 3.4)—facilitates the development of inhibitors for diseases like cancer, by mapping activities to therapeutic interventions. further enhance its utility, using to annotate EC numbers for novel proteins, accelerating and addressing gaps in experimental data. However, the system faces critiques for incomplete coverage of regulatory mechanisms, such as allosteric effects that modulate activity beyond reaction type, and for not fully accounting for directionality in reversible reactions, which can complicate kinetic modeling. As of October 2025, there are 6,919 active EC numbers, with ongoing efforts to develop provisional schemes for non-protein catalysts like ribozymes, which catalyze reactions such as self-splicing but fall outside traditional protein-focused classifications. These initiatives aim to broaden the system's scope while maintaining its reaction-centric foundation, often integrating with databases like for pathway updates.