Complementarity-determining region
Complementarity-determining regions (CDRs) are hypervariable amino acid sequences located within the variable domains of immunoglobulins (antibodies) and T-cell receptors (TCRs), forming the structural basis for antigen recognition and specific binding in the adaptive immune response.[1][2] These regions exhibit high sequence diversity due to genetic recombination and somatic hypermutation, enabling the immune system to generate a vast repertoire of specificities against diverse pathogens and foreign molecules.[1] In antibodies, each consists of six CDRs—three in the heavy chain (H1, H2, H3) and three in the light chain (L1, L2, L3)—that collectively create a paratope complementary to the antigen's epitope.[3] Similarly, TCRs feature six CDRs (three each in the α and β chains), where CDR1 and CDR2 primarily interact with major histocompatibility complex (MHC) molecules, while CDR3 loops provide peptide-specific contacts.[2] The structural diversity of CDRs arises from their loop conformations, with most adopting canonical structures determined by germline sequences, except for the highly variable CDR3 regions generated by V(D)J recombination.[2] This variability is crucial not only for antigen specificity but also for additional functions, such as direct antimicrobial, antiviral, and antitumor activities independent of full antibody context.[1] Numbering schemes like Kabat, Chothia, and IMGT standardize CDR identification across sequences, aiding in antibody engineering and therapeutic design, though differences in definitions highlight ongoing refinements in structural immunology.[4] Overall, CDRs exemplify the precision of immune recognition, underpinning applications in monoclonal antibodies, CAR-T therapies, and vaccine development.Definition and Nomenclature
Definition
Complementarity-determining regions (CDRs) are hypervariable polypeptide segments located within the variable domains of immunoglobulins (antibodies) and T-cell receptors (TCRs), characterized by high sequence diversity that enables specific recognition of antigens.[5] These regions were first identified based on their elevated variability in amino acid sequences compared to other parts of the variable domains.[3] In contrast to CDRs, the intervening framework regions (FRs) exhibit greater sequence conservation and form a beta-sheet structural scaffold that supports the positioning of the CDRs.[6] Each variable domain contains three CDRs, designated CDR1, CDR2, and CDR3, which together contribute to the antigen-binding functionality. In antibodies, the paratope—the antigen-binding site—is formed by the six CDRs from the light and heavy chains (three from each), creating a complementary surface for antigen interaction.[7] This arrangement results in six CDRs per Fab fragment and twelve CDRs in a full IgG antibody molecule, which consists of two Fab arms.[8] The hypervariability of CDRs is a critical feature that underpins antigen specificity in adaptive immunity. Evolutionarily, CDRs have been conserved across immunoglobulin superfamily members as essential elements driving the diversity required for broad immune recognition.[9]Numbering Systems
The Kabat numbering system, developed in the 1970s, provides a foundational framework for identifying complementarity-determining regions (CDRs) based on sequence variability in antibody variable domains.[10] It assigns residue positions sequentially, starting from the N-terminus of the variable domain, with CDRs defined as hypervariable segments. For the light chain, CDR-L1 spans positions 24-34, CDR-L2 positions 50-56, and CDR-L3 positions 89-97; for the heavy chain, CDR-H1 covers 31-35, CDR-H2 50-65, and CDR-H3 95-102.[11] This system prioritizes regions of high sequence diversity observed in early alignments of Bence Jones proteins and myeloma light chains, facilitating the annotation of conserved framework regions (FRs) alongside variable loops.[10] The Chothia numbering scheme, introduced in 1987, refines the Kabat approach by incorporating structural data from X-ray crystallography to emphasize the conformations of CDR loops. It adjusts insertion points and boundaries to align with canonical loop structures, particularly in CDR-H1 (26-32) and CDR-H2 (52-56), while maintaining similar definitions for light chain CDRs as in Kabat (L1: 24-34, L2: 50-56, L3: 89-97) and heavy chain CDR-H3 (95-102).[11] This structural focus ensures that equivalent positions across antibodies correspond to topologically similar sites in the three-dimensional fold, aiding in the prediction of loop geometries. The IMGT numbering system, established by the International ImMunoGeneTics information system in the late 1990s and formalized in 2003, offers a unified, species-independent scheme for immunoglobulin and T-cell receptor genes. It uses a standardized alignment of variable domain sequences, with positions marked by colons for insertions (e.g., 27.1, 27.2), and defines CDRs as follows: CDR1 (27-38), CDR2 (56-65), and CDR3 (105-117) for both light and heavy chains.[11] This approach integrates sequence, structure, and genetic data, enabling consistent annotation across diverse immune repertoires. These systems differ in their foundational principles: Kabat emphasizes sequence hypervariability, Chothia prioritizes structural alignment of loops, and IMGT provides an integrative framework for genomic and proteomic databases.[11] For instance, Kabat may include more framework residues in CDRs due to its variability-based boundaries, while Chothia and IMGT better capture conformationally equivalent positions but can vary in insertion handling.[12] A comparison of CDR boundaries across the schemes is shown below:| Chain | CDR | Kabat Positions | Chothia Positions | IMGT Positions |
|---|---|---|---|---|
| Light | 1 | 24-34 | 24-34 | 27-38 |
| Light | 2 | 50-56 | 50-56 | 56-65 |
| Light | 3 | 89-97 | 89-97 | 105-117 |
| Heavy | 1 | 31-35 | 26-32 | 27-38 |
| Heavy | 2 | 50-65 | 52-56 | 56-65 |
| Heavy | 3 | 95-102 | 95-102 | 105-117 |