Fact-checked by Grok 2 weeks ago

Epitope

An epitope, also known as an antigenic determinant, is the specific region on an —a capable of eliciting an —that is recognized and bound by components such as , B-cell receptors, or T-cell receptors, thereby triggering humoral or cellular immunity. These sites are typically small, consisting of 5–8 residues for protein antigens or 1–6 monosaccharides for antigens, and must be accessible on the antigen's surface for effective binding. Epitopes play a central role in adaptive immunity by enabling the to distinguish self from non-self , such as pathogens or foreign proteins, and are essential for processes like production and antigen clearance. Epitopes are broadly classified into two main types based on their structural nature: linear (continuous) epitopes, which consist of a sequential stretch of along the antigen's primary , and conformational (discontinuous) epitopes, which depend on the three-dimensional folding of the antigen and involve brought into proximity by . Linear epitopes are often 9–12 residues long and are more stable to denaturation, while conformational epitopes, typically 15–22 residues, predominate in native proteins and account for about 90% of B-cell epitopes. Additionally, epitopes are categorized by the immune receptor they engage: B-cell epitopes are recognized directly by antibodies or B-cell receptors to stimulate humoral responses, whereas T-cell epitopes are presented by (MHC) molecules to T-cell receptors, divided into MHC class I (for CD8+ T cells, focusing on intracellular pathogens) and MHC class II (for CD4+ T cells, aiding broader coordination). The identification and characterization of epitopes are fundamental to , underpinning design, diagnostic assays, and therapeutic development by targeting immunodominant sites that elicit protective responses. For instance, techniques, including experimental methods like and cryo-electron microscopy alongside predictions, help predict and avoid unwanted responses in biologics. Variations in epitope accessibility, influenced by factors like antigen denaturation or host (e.g., MHC alleles), can modulate immune efficacy, highlighting their dynamic role in disease and immunity.

Definition and Fundamentals

Definition

An epitope, also known as an antigenic determinant, is defined as the specific portion of an molecule that is recognized and bound by the of an or by a . For B-cell recognition, linear epitopes typically comprise 5 to 17 in continuous sequences, while T-cell epitopes are shorter peptides of 8–11 residues for or 13–25 for ; discontinuous epitopes involve spatially proximate residues from non-adjacent parts of the polypeptide chain that form a functional in the native . While often described for protein antigens, epitopes can also occur on carbohydrates (typically 1–6 monosaccharides), , or other molecules. The concept of the epitope emerged from early studies on antigen-antibody interactions, with the term first introduced by immunologist Niels Kaj Jerne in his 1960 paper "Immunological Speculations," where he described it as the surface feature of an capable of eliciting a specific . The of an epitope is influenced by several key physicochemical properties, including its surface to immune molecules, hydrophilicity that promotes solvent exposure, and flexibility allowing conformational adaptation during binding. These characteristics ensure that epitopes are positioned on the antigen's exterior and can form stable, specific interactions with immune receptors, distinguishing them from non-immunogenic regions of the same molecule. For instance, in the model protein , monoclonal antibodies such as HyHEL-10 target epitopes involving critical residues like Arg21, Asp101, and Tyr53, which contribute to the binding interface through hydrogen bonding and van der Waals contacts. While an epitope represents a targeted site on an , it is distinct from the full , which encompasses the entire immunogenic entity capable of provoking a broader .

Relation to Antigens and Antibodies

An epitope represents a discrete subset of an , serving as the specific molecular region recognized by components of the . Antigens are typically large macromolecules, such as proteins or , that contain one or more such epitopes capable of eliciting an . In contrast, haptens are small molecules, like certain chemicals or peptides, that lack inherent on their own but acquire antigenic properties when covalently attached to a larger protein, which provides the necessary structural context and T-cell epitopes for immune activation. This conjugation transforms the hapten into an epitope within the composite , enabling B-cell recognition and production. The functional relationship between epitopes and antibodies centers on the precise interaction between the epitope and the —the antigen-binding site located in the variable regions of the antibody's fragment or on T-cell receptors. This occurs through non-covalent forces, including bonds formed by polar residues like serine and threonine, van der Waals interactions dominated by aromatic side chains such as and , and electrostatic attractions that facilitate initial orientation over distances up to several nanometers. These interactions collectively bury a significant surface area (often ~1600 Ų) at the , expelling molecules and stabilizing the without covalent linkages. The strength of epitope-paratope binding is quantified by , defined as the (K_d), which measures the equilibrium between associated and dissociated states; lower K_d values indicate higher . For monoclonal antibodies, typical K_d values range from 20 to 300 against epitopes, reflecting strong monovalent interactions driven by cumulative weak forces totaling ~12 kcal/ in . , the cumulative binding strength from multiple epitope-paratope engagements (e.g., in IgM or multimeric antigens), amplifies this by orders of magnitude through cooperative effects, enhancing overall immune efficacy. This relational dynamic underpins immune specificity, analogous to the lock-and-key model, wherein the epitope's three-dimensional structure complements the 's and chemical properties for a rigid, highly selective fit with minimal conformational adjustment upon binding. Aromatic residues in the often form the core "lock," interacting with backbone and side-chain atoms on the epitope to discriminate against non-cognate structures, while surrounding hydrophilic contacts fine-tune discrimination. Such precision ensures targeted immune responses while avoiding off-target effects.

Types of Epitopes

Linear Epitopes

Linear epitopes, also known as sequential epitopes, consist of continuous stretches of , typically 5 to 15 residues long, derived from the primary structure of a protein . These epitopes are recognized by antibodies based solely on their linear sequence, without reliance on the protein's folded conformation. A key property of linear epitopes is their resistance to denaturation; unlike conformational epitopes, they remain intact and accessible even when the protein is unfolded by heat, changes, or chemical agents, making them detectable in denatured samples. Linear epitopes constitute approximately 10% of all B-cell epitopes, though they are more prevalent in certain antigens such as bacterial toxins and viral proteins. For instance, in bacterial toxins, a linear epitope spanning residues 40-62 of the epsilon toxin from Clostridium perfringens has been identified as immunogenic when fused with cholera toxin B subunit for vaccine development. In viral proteins, a well-characterized example is the linear epitope within amino acids 93-104 of the VP1 capsid protein of poliovirus type 1, which is recognized by neutralizing monoclonal antibodies and contributes to antiviral immunity. These examples highlight how linear epitopes in pathogens can elicit protective antibody responses, particularly in exposed or linear regions of the antigen. One advantage of linear epitopes is their ease of synthesis as short peptides, which facilitates their use in immunological assays and design without the need for complex . However, a disadvantage is that synthetic linear peptides may not fully mimic the native antigenic context, potentially leading to reduced or specificity compared to epitopes in the intact . In contrast, conformational epitopes require the protein's three-dimensional fold for recognition and are disrupted under denaturing conditions. Experimental evidence for linear epitopes often comes from techniques involving synthetic peptides, such as where overlapping peptides are screened for binding, confirming sequential recognition. Similarly, peptide scanning methods, which use arrays of systematically overlapping peptides covering the sequence, have identified linear epitopes in viral capsid proteins by detecting specific interactions. These approaches provide direct validation of the continuous nature of such epitopes in various pathogens.

Conformational Epitopes

Conformational epitopes consist of discontinuous residues that are spatially adjacent only in the three-dimensional structure of the native , formed by rather than contiguous sequences in the primary chain. These epitopes typically involve clusters of 10 to 20 or more brought together in close proximity, often exceeding 15 residues due to their reliance on or folding. Unlike linear epitopes, which depend solely on primary and can persist under denaturing conditions, conformational epitopes are highly sensitive to denaturation processes such as or chemical disruption, which abolish their structure and prevent antibody recognition. The structural basis of conformational epitopes lies in their location on the exposed surfaces of folded proteins, where disparate residues from secondary structural elements like beta-sheets, loops, and helices converge to form accessible binding sites for antibodies. These epitopes often span regions involving beta-sheet strands connected by loops, creating or convex patches that facilitate specific interactions. A prominent example is found in the HIV-1 glycoprotein gp120, where conformational epitopes at the CD4-binding site and variable loops (such as V1/V2) are critical for recognition by broadly neutralizing antibodies, enabling potent viral inhibition through precise targeting of the trimer's native conformation. Conformational epitopes represent approximately 90% of B-cell epitopes on native proteins, underscoring their dominance in humoral immune responses to folded . Their recognition necessitates an intact, non-denatured structure, as disruption of folding eliminates the spatial arrangement required for binding, highlighting their role in mimicking physiological . Biophysically, these epitopes are maintained by stabilizing features of the protein's tertiary structure, including disulfide bonds that covalently link distant cysteines to lock conformations and hydrophobic cores that bury nonpolar residues internally, shielding the epitope's solvent-exposed face. bridges, in particular, enhance rigidity in loop regions common to epitopes, while hydrophobic interactions contribute to the overall stability of surface patches, ensuring persistence under physiological conditions.

Immune Recognition Mechanisms

B Cell Epitopes

B cell epitopes are regions on the surface of that are directly recognized and bound by receptors (BCRs), which are membrane-bound immunoglobulins, or by secreted antibodies produced by plasma cells in the humoral immune response. This recognition occurs without the need for , focusing on native, extracellular epitopes that are accessible in their three-dimensional structure. BCR-antigen interactions typically take place in a two-dimensional environment, where the antigen may be anchored, facilitating initial binding and subsequent immune signaling. These epitopes are predominantly conformational, comprising approximately 90% of known cases, and are characterized by their solvent-exposed nature on the antigen surface, allowing interaction with the paratope of the BCR or antibody. In terms of physical dimensions, B cell epitopes typically encompass about 15 amino acid residues on average, with an elliptical surface area of roughly 400 Ų and a thickness of around 8 Å, corresponding to a diameter of 15-22 Å that fits within the antigen-binding site of an immunoglobulin. This preference for exposed, discontinuous structures arises because B cells target protruding or flexible regions rich in coils and charged residues, rather than buried or linear sequences. Upon binding, effective B cell activation requires the cross-linking of multiple BCRs by multivalent antigens, which oligomerizes the receptors and initiates intracellular signaling cascades leading to B cell proliferation, differentiation into antibody-secreting plasma cells, and affinity maturation. This cross-linking threshold ensures that only high-avidity interactions trigger a robust response, amplifying the humoral immune defense against pathogens or allergens. Representative examples include epitopes on the capsules of such as or , which are recognized by antibodies to promote opsonization and , eliciting protective . In allergic contexts, protein epitopes from allergens like Amb a 1 serve as targets for IgE antibodies, driving reactions in sensitized individuals.

T Cell Epitopes

T cell epitopes are short peptide fragments derived from intracellular or extracellular antigens that are processed and presented on the surface of antigen-presenting cells (APCs) or infected cells in complex with (MHC) molecules, enabling recognition by T lymphocytes. Unlike B cell epitopes, which are recognized directly on the native structure of antigens by B cell receptors, T cell epitopes require proteolytic processing into peptides typically ranging from 8 to 25 in length before presentation. This processing ensures that T cells monitor intracellular events, such as viral infections or aberrant protein production, by surveying the peptide-MHC (pMHC) complexes displayed on cell surfaces. The recognition process involves two primary pathways corresponding to MHC class I and class II molecules. For MHC class I presentation, endogenous antigens in the cytosol—such as viral or self-proteins—are degraded by the proteasome into peptides of 8-11 amino acids, which are then transported into the endoplasmic reticulum (ER) via the transporter associated with antigen processing (TAP), trimmed, and loaded onto MHC class I molecules for display on the cell surface to CD8+ cytotoxic T cells. In contrast, exogenous antigens taken up by endocytosis are processed in lysosomes or endosomes by proteases into longer peptides (12-25 amino acids), which bind to MHC class II molecules after removal of the invariant chain (CLIP) peptide, and are presented to CD4+ helper T cells primarily on professional APCs like dendritic cells. These pathways allow T cells to distinguish between intracellular threats (via MHC I) and extracellular pathogens (via MHC II), with peptide lengths accommodating the closed groove of MHC I and the open-ended groove of MHC II. MHC restriction is a fundamental principle dictating that T cell recognition is allele-specific, as epitopes must bind with sufficient affinity to particular MHC variants to form stable pMHC complexes; for instance, the HLA-A*02:01 , prevalent in about 50% of certain populations, frequently presents viral epitopes due to its binding motif favoring hydrophobic residues at anchor positions. This polymorphism in (HLA) genes—encoding MHC molecules—means that epitope immunogenicity varies across individuals, influencing immune responses to pathogens. Only peptides that fit the specific binding pockets of an individual's MHC s can elicit a T cell response, underscoring the of MHC diversity in population-level immunity.00701-7) Upon encounter, the (TCR) on a naïve T cell binds to the cognate pMHC complex, initiating through the CD3 complex and chain, but full activation requires a second co-stimulatory signal, such as interacting with B7 molecules on APCs, to prevent anergy or . For + T cells, this leads to into cytotoxic effectors that release perforin and granzymes to induce target cell , thereby eliminating infected or malignant cells. CD4+ T cells, conversely, become helper cells that secrete cytokines to orchestrate broader immune responses, including activation and recruitment. A well-characterized example is the matrix protein 1 (M1) epitope GILGFVFTL (residues 58-66), presented by HLA-A*02:01 in the pathway, which elicits a potent + cytotoxic response to clear infected respiratory epithelial cells.

Cross-Reactivity

Cross-reactivity in epitopes occurs when antibodies or T cell receptors (TCRs) bind to similar but non-identical epitopes on different antigens, primarily through the mechanism of molecular mimicry. In this process, epitopes share key structural residues or motifs that allow cross-binding, such as conserved sequences in viral proteins that mimic host or other structures. For instance, TCRs can recognize epitopes with partial if the core binding residues align sufficiently, enabling an against related pathogens. This phenomenon has significant implications for both protective and detrimental immune responses. Beneficially, it underpins heterosubtypic immunity in vaccines, where antibodies elicited against one provide partial protection against others due to shared epitopes, as demonstrated in studies showing cross-neutralization across subtypes. Conversely, it can drive pathological , such as in , where streptococcal M protein epitopes mimic cardiac , leading to cross-reactive antibodies that attack heart tissue. Factors influencing include exceeding 70% or structural similarity in epitope conformation, often quantified by cross-reactivity indices from assays like or , which measure binding affinity to heterologous antigens. High homology in anchor residues for MHC presentation further enhances T cell cross-reactivity. A notable example is in infections, where cross-reactive antibodies from a primary bind to epitopes on a secondary , facilitating (ADE) that worsens disease severity by promoting viral entry into immune cells. This highlights the double-edged nature of epitope cross-reactivity in immunity.

Epitope Mapping Techniques

B Cell Epitope Mapping

B cell epitope mapping involves a suite of experimental techniques designed to identify the specific regions on recognized by -derived antibodies, focusing on both linear and conformational epitopes central to . libraries represent a cornerstone method, where random or antigen fragment libraries are expressed on surfaces to screen for binding interactions, enabling the isolation of epitope mimics or direct antigen sequences that elicit specific antibody responses. This approach has been instrumental in design by revealing immunodominant B cell epitopes from pathogens. Structural methods provide atomic-level insights into epitope-paratope interfaces. determines the three-dimensional structure of -antigen complexes, often achieving resolutions below 2.5 Å to delineate contacting residues and conformational details. For instance, crystallographic studies have mapped epitopes on viral proteins, highlighting buried and solvent-exposed interactions critical for binding. Complementing this, cryo-electron microscopy (cryo-EM) excels for larger complexes, resolving structures at near-atomic resolution without crystallization; notable examples include mapping neutralizing epitopes on the , where cryo-EM revealed conformational epitopes targeted by memory B cells during . Functional assays quantify binding and validate epitope contributions. Enzyme-linked immunosorbent assay () and () measure antibody- affinity, with competition formats binning antibodies by overlapping epitopes and assessing binding kinetics in real-time. further refines mapping by systematically altering residues and monitoring loss of binding, pinpointing critical within epitopes; high-throughput variants, such as , accelerate identification of both linear and discontinuous sites. A key challenge in epitope mapping is ensuring native-like to capture conformational epitopes, which predominate in structured proteins and are often lost in linear peptide-based assays or denatured forms. Techniques like and cryo-EM mitigate this by preserving tertiary structures, but variability in antigen folding and affinity can complicate comprehensive mapping, necessitating integrated approaches for reliable results.

T Cell Epitope Mapping

T cell epitope mapping involves identifying peptides that, when presented by (MHC) molecules, elicit specific T cell responses, crucial for understanding cellular immunity. This process typically integrates functional assays to detect T cell activation with biochemical methods to assess peptide-MHC interactions, focusing on CD4+ and + T cells that recognize epitopes restricted by and class I molecules, respectively. One key technique is MHC tetramer staining, which uses fluorescently labeled MHC-peptide multimers to directly visualize and quantify epitope-specific T cells via . These tetramers, formed by biotinylated MHC-peptide complexes bound to streptavidin-fluorochrome conjugates, bind to T cell receptors with high specificity, enabling detection of antigen-specific + T cells at frequencies as low as 1:50,000 peripheral blood mononuclear cells (PBMCs) without prior . In -1 studies, MHC tetramers have mapped epitope-specific responses, revealing frequencies of 1-4% of + T cells in chronically infected individuals targeting conserved epitopes like those in protein restricted by *02:01. The enzyme-linked immunospot () assay measures release, such as interferon-gamma (IFN-γ), from T s activated by -MHC complexes, providing a sensitive readout of epitope-specific responses. In this method, PBMCs are incubated with candidate s in multi-well plates coated with -capture antibodies, where activated T s form visible spots proportional to their frequency; it can detect responses from as few as 1 in 100,000 s. is often combined with pools for initial screening, followed by to pinpoint immunogenic epitopes, and has been pivotal in validating T responses to HIV-1 antigens. Peptide libraries, particularly overlapping synthetic spanning proteins, are scanned to identify epitopes capable of MHC binding and T . Typically, libraries consist of 15-18 overlapping by 11 residues to ensure coverage of potential 8-11 mer class I or longer class II epitopes; these are tested in pools or matrices to minimize usage, with positive hits deconvoluted via iterative testing. For instance, such libraries have identified tumor-specific epitopes like those from melanoma-associated antigens by assessing T and production. In vitro binding assays quantify affinity to purified MHC molecules, often integrating predictions to prioritize candidates. Competitive binding assays, such as fluorescence polarization, measure the inhibitory concentration () required for a test to displace 50% of a labeled reference from MHC; lower values (e.g., <500 ) indicate high-affinity binders likely to form stable complexes for T recognition. These assays, performed under physiological conditions (37°C, 5.5 for class II), have refined by validating predicted binders for alleles like HLA-DR1. In HIV-1 , overlapping peptide libraries and have identified immunodominant epitopes restricted by specific HLA alleles, such as the Pol-IY11 (ILKEPVHGVYY) peptide presented by HLA-*12:02, confirmed via and T cell response assays in infected individuals. Similarly, the Nef-MY9 (MARELHPEY) epitope, also HLA-*12:02-restricted, elicits strong + T cell responses, highlighting allele-specific targeting in protective immunity. These approaches underscore the MHC dependency of T cell recognition, where epitopes must bind stably to elicit .

Practical Applications

Epitope Tags in

Epitope tags are short sequences derived from known antigens that are genetically fused to recombinant proteins to enable their detection, purification, and analysis using specific antibodies, without the need for custom antibodies against the target protein itself. These tags are particularly valuable in research settings where studying or low-abundance proteins requires reliable tools for visualization and isolation. Common epitope tags, such as , , and c-Myc, were selected for their compact size, minimal in expression hosts like mammalian or cells, and the existence of high-affinity monoclonal antibodies that recognize them with high specificity. The , derived from the protein, consists of the 9-amino-acid sequence YPYDVPDYA and was first introduced in 1988 for purifying RAS-responsive complexes in yeast. The is an 8-amino-acid synthetic sequence, DYKDDDDK, developed in 1988 as a marker for hybrid from mammalian cells, incorporating an enterokinase cleavage site for tag removal. The c-Myc tag, a 10-amino-acid EQKLISEEDL from the human c-Myc proto-oncogene product, originated from studies in 1985 and is recognized by the widely used 9E10 . In laboratory applications, epitope tags facilitate techniques such as Western blotting for protein expression confirmation and for studying protein-protein interactions. For instance, the is routinely employed to visualize recombinant proteins expressed in mammalian cells, allowing subcellular localization via without disrupting native function. These tags also support purification using anti-tag resins, enabling isolation of tagged proteins from complex lysates with high yield and purity in systems ranging from to eukaryotic cells.
TagSequenceLength (aa)Origin
HAYPYDVPDYA9 hemagglutinin
FLAGDYKDDDDK8Synthetic (enterokinase site)
c-MycEQKLISEEDL10Human c-Myc proto-oncogene
The primary advantages of epitope tags include their small size, which generally avoids significant alterations to the target protein's structure, folding, or activity, and the option for protease-mediated removal, as with the using enterokinase. This modularity makes them versatile for functional studies, where tags can be positioned at the N- or or internally if tolerated. However, limitations exist, such as potential interference with native or localization if the tag is placed in a critical region, and challenges in immunological where the tag itself may mask genuine epitopes or provoke unintended immune responses. Additionally, reliance on commercial antibodies can introduce variability in detection efficiency across different experimental contexts.

Epitope-Based Vaccines

Epitope-based vaccines are designed to elicit targeted s by incorporating specific antigenic epitopes from , rather than using whole organisms or inactivated pathogens. These vaccines typically employ subunit approaches, such as synthetic peptides representing linear T-cell or B-cell epitopes, or virus-like particles (VLPs) that display conformational epitopes to mimic native structures. For instance, the human papillomavirus () vaccine, such as , utilizes recombinant L1 capsid proteins assembled into VLPs that present conformational epitopes, inducing neutralizing antibodies against the without the risks associated with live-attenuated formulations. This design allows precise selection of immunodominant epitopes to focus the on critical pathogen components. A key advantage of epitope-based vaccines is their enhanced safety profile, as they avoid the inclusion of extraneous pathogen material that could cause adverse reactions or immune interference. By excluding non-essential antigens, these vaccines minimize side effects while enabling the creation of multi-epitope constructs that provide broad protection against variant strains, such as in polyvalent formulations targeting multiple serotypes. Additionally, their modular nature facilitates rapid adaptation to emerging threats through computational epitope prediction and synthesis. Despite these benefits, epitope-based vaccines face challenges, particularly their inherently low , which often necessitates the use of to provide T-cell help and enhance responses. For example, the RTS,S incorporates both T-cell and B-cell epitopes from the circumsporozoite protein (CSP) of , combined with the AS01 to boost production and cellular immunity, addressing the poor standalone of epitopes. Other hurdles include ensuring proper epitope processing and presentation , as well as overcoming potential in chronic infections. Clinical progress in epitope-based vaccines has demonstrated their potential, with peptide vaccines for inducing robust, epitope-specific CD8+ T-cell responses that correlate with tumor regression in phase I/II trials. These vaccines, often combining helper and cytotoxic T-lymphocyte epitopes, have shown durable immune memory and epitope spreading, where initial responses expand to additional tumor antigens, supporting their role in personalized . Ongoing advancements, including delivery systems, continue to improve efficacy in infectious disease settings beyond cancer.

Neoepitopes in Cancer

Neoepitopes are novel epitopes arising from somatic mutations in tumor cells, which generate unique antigenic peptides not present in normal tissues and capable of eliciting an antitumor immune response. These patient-specific neoepitopes typically result from point mutations, insertions, deletions, or gene fusions that alter protein sequences, leading to altered peptides that can bind to major histocompatibility complex (MHC) molecules for presentation to T cells. Unlike shared tumor antigens, neoepitopes are highly individualized, making them ideal targets for personalized cancer immunotherapies that minimize off-target effects on healthy cells. The generation of neoepitopes begins with tumor genomic sequencing to identify somatic variants, followed by bioinformatics prediction of peptides that can bind to the patient's specific HLA alleles and provoke T cell recognition. For instance, the G12D , prevalent in pancreatic ductal , produces a neoepitope that has been targeted in T cell receptor (TCR) , demonstrating tumor regression in a clinical case of metastatic . Whole-exome sequencing of tumor biopsies, combined with sequencing to confirm expression, enables the selection of high-affinity neoepitopes for therapeutic development. In therapeutic applications, neoepitope-targeted deliver patient-specific peptides or nucleic acids encoding them to stimulate T cell responses against tumors, while chimeric receptor () T cells or TCR-engineered T cells can be designed to recognize neoepitope-MHC complexes. Clinical trials of personalized neoantigen in advanced have reported objective response rates of approximately 30%, with durable responses in a subset of patients when combined with checkpoint inhibitors. As of 2025, expansion cohorts in trials like autogene cevumeran combined with have shown promising ORRs of 33.3% in CPI-naive advanced patients (n=9). Similarly, CAR-T therapies targeting neoepitope-derived peptides, such as those from mutant , have shown preclinical efficacy in solid tumors by enhancing tumor-specific . Key challenges in neoepitope-based include variability in MHC binding affinity, which depends on individual HLA types and can limit neoepitope , as well as intratumoral heterogeneity that allows antigen escape through subclonal mutations. These factors contribute to variable clinical responses, with only a fraction of predicted neoepitopes eliciting robust T cell activation . Addressing tumor and optimizing neoepitope selection remain critical for broader .

Computational Tools

Epitope Prediction Methods

Epitope prediction methods employ computational algorithms to identify potential antigenic sites from protein sequences, primarily focusing on T cell and B cell epitopes. For T cell epitopes, models such as NetMHC predict peptide affinity to ( molecules by estimating the half-maximal inhibitory concentration (), classifying peptides with IC50 values below 500 nM as strong binders likely to elicit + T cell responses. These artificial neural network-based approaches, initially developed using quantitative binding data, have evolved to incorporate pan-specific predictions for diverse MHC alleles, enhancing their utility in personalized immunotherapies. B cell epitope prediction tools, such as BepiPred, target linear epitopes by integrating sequence-based features like hydrophilicity, surface accessibility, and flexibility, which are derived from propensity scales and hidden Markov models trained on experimentally validated epitopes. This method scores segments for their likelihood of being exposed and immunogenic, prioritizing regions with high solvent exposure to facilitate recognition, though it performs best for sequential rather than discontinuous epitopes. Recent advances in epitope prediction leverage , particularly models trained on large datasets from the Immune Epitope Database (IEDB), to improve accuracy by capturing complex motifs and structural nuances. For instance, convolutional neural networks and transformers have been integrated into tools like MHCflurry and subsequent iterations, achieving predictive accuracies of approximately 70-80% for T cell epitopes when evaluated on independent benchmarks for and . These AI-driven methods outperform traditional models by reducing false positives through multi-layer feature extraction from peptide-MHC interactions. To validate predictions, especially for conformational epitopes, computational pipelines often combine sequence-based scoring with in silico molecular docking simulations that model antigen-antibody or peptide-MHC complexes. Tools employing rigid or flexible docking, such as those based on energy minimization algorithms, assess binding stability and epitope exposure, providing structural rationale for experimental prioritization and bridging the gap between linear predictions and three-dimensional antigen presentation.

Epitope Databases

The Immune Epitope Database (IEDB) serves as a central repository for experimentally validated immune epitopes, encompassing data on recognition, T cell responses, and (MHC) interactions derived from over 25,000 publications. Established in 2004 and funded by the National Institute of Allergy and Infectious Diseases (NIAID), the IEDB currently holds more than 2.2 million entries as of 2025, covering epitopes associated with infectious diseases, allergies, , and transplantation. Epitopes in the IEDB are richly annotated with detailed attributes, including sequences, associated MHC alleles (such as HLA I and II specificities), and experimental outcomes like affinities, immunogenicity measurements, and functional responses in cellular s. For instance, researchers can query the database for epitopes by specifying the , such as retrieving T cell epitopes from restricted to specific HLA alleles, facilitating targeted immunological studies. Specialized databases complement the IEDB for allergen epitopes, such as the Structural Database of Allergenic Proteins (SDAP 2.0), which curates over 1,600 sequences with associated epitope mappings, structural models, and data derived from experimental validations. Similarly, the COMPARE database, as of its 2025 release, focuses on clinically relevant protein sequences with descriptions and citation support, enabling comparative analysis for allergenicity assessment in food and environmental contexts to support safety evaluations and . These databases are instrumental in immunological research, particularly for training machine learning-based epitope prediction tools by providing large-scale, validated datasets that improve model accuracy in forecasting MHC binding and immunogenicity. They also enable benchmarking of experimental epitope mapping techniques, such as comparing high-throughput sequencing results against curated assay data to validate novel discoveries. The IEDB undergoes regular curation updates, with significant expansions post-2020 incorporating thousands of COVID-19-related epitopes from and T cell studies, reflecting the surge in research and enhancing resources for design and variant surveillance. Allergen databases like SDAP are similarly maintained with periodic releases to include emerging data from clinical trials and structural analyses.

References

  1. [1]
    State of the art in epitope mapping and opportunities in COVID-19
    Mar 6, 2023 · Epitopes or antigenic determinants are the antigen parts that bind to antibodies or to membrane receptors on lymphocytes, thus causing either ...
  2. [2]
  3. [3]
    What is an Epitope? - News-Medical
    May 10, 2021 · An epitope is the part of an antigen that the host's immune system recognizes, eliciting the immune response to an invading pathogen.Missing: authoritative | Show results with:authoritative
  4. [4]
    The antigenic structure of proteins: a reappraisal - PubMed
    The antigenic structure of proteins: a reappraisal. Annu Rev Immunol. 1984:2:67-101. doi: 10.1146/annurev.iy.02.040184.000435. Authors. D C Benjamin, J A ...
  5. [5]
    Epitope - an overview | ScienceDirect Topics
    Epitopes are defined as localized regions on the surface of an antigen that are recognized by the immune system, specifically by B- or T-cells.
  6. [6]
    An Introduction to B-Cell Epitope Mapping and In Silico ... - NIH
    Dec 29, 2016 · The main objective of epitope identification is to replace an antigen in the immunization, antibody production, and serodiagnosis. The accurate ...
  7. [7]
    High-resolution mapping of the HyHEL-10 epitope ... - PubMed - NIH
    The complex formed between hen egg white lysozyme (HEL) and the monoclonal antibody HyHEL-10 Fab fragment has an interface composed of van der Waals ...
  8. [8]
    The design and implementation of the immune epitope database ...
    Epitopes are defined as parts of antigens interacting with receptors of the immune system. Knowledge about their intrinsic structure and how they affect the ...
  9. [9]
    The effect of haptens on protein-carrier immunogenicity - PMC - NIH
    Immunodominance is described as epitopes in an antigen that are preferentially recognized by B and T cells, and hence the immune response focuses on only a few ...Protein--Hapten Conjugation · Elevation In Treg Cell... · Discussion
  10. [10]
    Biological Activity of the Carrier as a Factor in Immunogen Design ...
    Nov 14, 2018 · Only peptides can be presented via MHC molecules; therefore, proteins are the strongest immunogens, and haptens cannot elicit immune response.
  11. [11]
    Origins of specificity and affinity in antibody–protein interactions
    We found that the energetically critical epitope portions are largely composed of backbone atoms, side-chain carbons, and hydrogen bond donors/acceptors.Results And Discussion · Protein Atom Type... · Pairwise Atomistic Contacts...Missing: covalent | Show results with:covalent
  12. [12]
    Hydrophobic, hydrophilic and other interactions in epitope-paratope ...
    Microscopic scale specific interactions, e.g. between epitopes and paratopes, are also non-covalent and caused by attractive LW, AB and EL forces, which ...
  13. [13]
    Hydrophobic, hydrophilic and other interactions in epitope-paratope ...
    Upon expulsion of interstitial water of hydration between epitope and paratope, a strong interfacial bond ultimately arises which is mainly caused by LW forces.Missing: review | Show results with:review
  14. [14]
    Measuring Affinity Constants of 1450 Monoclonal Antibodies to ... - NIH
    Using this platform we measured affinity constants of over 1,410 rabbit monoclonal antibodies and 46 mouse monoclonal antibodies to peptide targets that are ...
  15. [15]
    Affinity & Avidity of Antibodies | Bio-Rad
    The affinity of monoclonal antibodies can be measured accurately because they are homogeneous and selective for a single epitope. Polyclonal antibodies are ...
  16. [16]
    Antibody Structure and Function: The Basis for Engineering ...
    In the lock and key model, the two molecules interact in a manner that minimizes changes in the conformations of the two protein surfaces from that observed in ...
  17. [17]
    Linear Epitope - an overview | ScienceDirect Topics
    B-cell epitopes are usually 5–17 amino acids in length but can be extended without adverse effects on the immune function.
  18. [18]
    Recent Progress in Antibody Epitope Prediction - PMC
    Aug 8, 2023 · Epitopes can be linear or conformational, meaning that they can either be a linear sequence of amino acids or a three-dimensional (3D) structure ...Missing: prevalence | Show results with:prevalence
  19. [19]
    AI-driven epitope prediction: a systematic review, comparative ...
    Aug 30, 2025 · Linear B-cell epitopes (continuous peptide segments) constitute only ~10% of all B-cell epitopes. In contrast, most are structural epitopes ...Missing: prevalence | Show results with:prevalence
  20. [20]
  21. [21]
    Immunization with recombinant fusion of LTB and linear epitope (40 ...
    Jul 12, 2019 · Epsilon toxin (Etx) produced by Clostridium perfringens types B and D, a major causative agent of enterotoxaemia causes significant economic ...<|control11|><|separator|>
  22. [22]
    A poliovirus type 1 neutralization epitope is located within amino ...
    We thus conclude that the neutralization epitope recognized by C3 is located within the region of amino acids 93-104 of capsid polypeptide VP1.Missing: linear | Show results with:linear
  23. [23]
    Synthetic peptides from four separate regions of the poliovirus type 1 ...
    Sep 17, 1984 · Thus, the neutralizing determinants on VP1 reside in specific noncontig- uous regions of the protein and can be defined by specific pep- tides ...
  24. [24]
    Linear Epitope Binding Patterns of Grass Pollen-Specific Antibodies ...
    Mar 30, 2022 · We developed an allergome-wide peptide microarray, aiming to track linear epitope binding patterns in allergic diseases and during AIT.Missing: prevalence | Show results with:prevalence
  25. [25]
    Identification of linear epitopes on the flagellar proteins of ... - Nature
    May 11, 2021 · In this paper, the FliC and FliD protein polypeptide epitopes were mapped in silico and by using PEPSCAN procedure. We identified four promising epitopes.
  26. [26]
    Natural immunogenic properties of bioinformatically predicted linear ...
    Nov 3, 2021 · The present study describes the immunogenic potential of those predicted epitopes that are conserved across and within the four serotypes, during natural ...Discussion · Methods · Mice Immunization And...Missing: prevalence | Show results with:prevalence
  27. [27]
    An overview of methods for the structural and functional mapping of ...
    Here, the epitope is defined as the antigenic surface recognized by a given antibody. Identifying the structures, sequences, and sequence constraints on such ...
  28. [28]
    Conformational Epitope - an overview | ScienceDirect Topics
    B-cell epitopes are usually 5–17 amino acids in length but can be extended without adverse effects on the immune function. The conformational epitopes used for ...
  29. [29]
    Screening for Conformational Epitopes Using Heat Denaturation of ...
    CONCLUSIONS: These data indicate that conformational epitopes are sensitive to heating the protein attached to plastic surfaces. Heat denaturation of allergens ...
  30. [30]
    Conformational Epitope - an overview | ScienceDirect Topics
    Loss of IgE binding to denatured allergens indicates that conformational IgE epitopes, especially from aeroallergens, are common.10 Food allergens are ...
  31. [31]
    High-resolution Mapping of Linear Antibody Epitopes Using ... - NIH
    In general, the epitopes were from 5 to 10 amino acids long (range from 4 to 12 amino acids; Fig. 5). The epitopes contained highly dominant positions where ...
  32. [32]
    Identification of conformational epitopes for human IgG on ...
    N111, located in a well defined long loop between the β2 and β3 strands, was also shown to contribute to antibody binding. Surface 1.1–1.4 are overlapping ...
  33. [33]
    Broad neutralization by a combination of antibodies recognizing the ...
    Jul 23, 2012 · Broad neutralization by a combination of antibodies recognizing the CD4 binding site and a new conformational epitope on the HIV-1 envelope protein.
  34. [34]
    Conformational B‐Cell Epitopes Prediction from Sequences Using ...
    Jun 17, 2014 · Although the proportion of linear epitopes is very small while the proportion of conformational epitopes is ~90%, the study on ...
  35. [35]
    B cell epitope prediction by capturing spatial clustering property of ...
    Nov 11, 2024 · It is known that approximately 90% of B cell epitopes are conformational epitopes, and the structure-based methods typically outperform the ...Missing: percentage | Show results with:percentage
  36. [36]
    Disulfide Bond Introduction for General Stabilization of ...
    Many theoretical and experimental studies indicated that disulfide bridges can increase the conformational stability of proteins mainly by constraining the ...
  37. [37]
    Functional Implications of the Binding Mode of a Human ...
    ... conformation-dependent epitope which is stabilized by a hydrophobic core in V2. Our results provide a binding motif signature for anti-V2i MAb 2158 and ...
  38. [38]
    Cation–π, amino–π, π–π, and H‐bond interactions stabilize antigen ...
    Feb 1, 2014 · Thus, compared with the conformational epitope regions, the protein–protein interfaces contain more hydrophobic residues and less polar and ...
  39. [39]
    B cell receptors and free antibodies have different antigen-binding ...
    In vivo, B cell receptor (BCR)–antigen interactions occur in a 2-dimensional (2D) environment with antigen anchored to a membrane.
  40. [40]
    Analysis of Virus-Specific B Cell Epitopes Reveals Extensive ... - NIH
    Jun 21, 2024 · We were surprised to find that most virus-specific B cell epitopes (64.44%) lay on viral antigens that remained enclosed in host cells or viral ...
  41. [41]
    Defining and studying B cell and T cell receptor interactions - NIH
    ... epitope. B cell receptors (BCRs) and their secreted counterparts, antibodies, recognize native three-dimensional epitopes, whereas T cell receptors (TCRs) ...
  42. [42]
    B-cell epitope prediction through a graph model - PMC
    B-cell epitopes are of two types: about 10% of them are linear B-cell epitopes and about 90% are conformational B-cell epitopes [2-4].Missing: diameter | Show results with:diameter
  43. [43]
    Advancements in the conservation of the conformational epitope of ...
    Feb 28, 2025 · Moreover, after immunization, detergents may dissociate from MP, thus resulting in denaturation and possible conformational epitope loss.
  44. [44]
    Structural analysis of B-cell epitopes in antibody:protein complexes
    Jul 10, 2012 · The average epitope was found to be made up of ~15 residues with one linear stretch of 5 or more residues constituting more than half of the ...
  45. [45]
    B-cell activation by armed helper T cells - Immunobiology - NCBI - NIH
    Thus, protein antigens binding to B cells both provide a specific signal to the B cell by cross-linking its antigen receptors and allow the B cell to attract ...
  46. [46]
    Molecular requirements of the B‐cell antigen receptor for sensing ...
    This observation was interpreted to mean that a “cross‐linking” of two monomeric BCR complexes is required for B‐cell activation, thereby supporting the cross‐ ...
  47. [47]
    The tipping points in the initiation of B cell signalling - PubMed Central
    B cells are selected by the binding of antigen to clonally distributed B cell receptors (BCRs), triggering signalling cascades that result in B cell activation.
  48. [48]
    Carbohydrate-Based Targets and Vehicles for Cancer and Infectious ...
    Bacterial carbohydrates, particularly those of the bacterial capsules of Haemophilus influenzae, Streptococcus pneumoniae and Neisseria meningitidis can be the ...
  49. [49]
    The architecture of the IgG anti-carbohydrate repertoire in primary ...
    The prominent exposure of carbohydrate structures (glycans) on the surface of cells or bacterial capsules renders them accessible to antibodies, which ...Glycan Array Analysis · Results · Glycan-Specific Igg...
  50. [50]
    Design of an Epitope-Based Peptide Vaccine Against the Major ...
    Apr 1, 2022 · Tested sera from patients with ragweed allergy showed that the ragweed specific IgE antibodies can bind to the identified B-cell epitope. ...Results · B-Cell Epitope Prediction · Docking Study Of B-Cell And...
  51. [51]
    Present Yourself! By MHC Class I and MHC Class II Molecules - PMC
    Nov 1, 2017 · T cells have antigen receptors that recognize antigenic peptide, but only in the context of MHC I or MHC II molecules that are displaying the ...
  52. [52]
    The ins and outs of MHC class II-mediated antigen processing and ...
    In this Review, we describe our current knowledge of the mechanisms of uptake and processing of antigens, the intracellular formation of peptide–MHC class II ...
  53. [53]
    Single-molecule investigations of T-cell activation - PubMed Central
    Nov 1, 2019 · T-cell activation is initiated by the direct physical interaction of the T cell antigen receptor (TCR) with cognate peptide presented by the ...
  54. [54]
    Physical detection of influenza A epitopes identifies a stealth ... - PNAS
    Using new MS technology, epitopes on infected human HLA-A2 + lung epithelium are identified and abundances characterized.
  55. [55]
    Epitopes and Mimotopes Identification Using Phage Display ... - NIH
    Jun 29, 2023 · To overcome these challenges, phage display has emerged as a valuable tool for identifying specific epitopes that could be used in vaccines.
  56. [56]
    Epitope Mapping of Antibody-Antigen Interactions with X-ray ... - NIH
    We present here a protocol for epitope mapping using X-ray crystallography to accurately determine the conformational epitope recognized by a nanobody.
  57. [57]
    B-cell epitope mapping for the design of vaccines and effective ...
    B-cell epitope mapping is a promising approach to identifying the main antigenic determinants of microorganisms, in special concern the discontinuous ...
  58. [58]
    A high-throughput shotgun mutagenesis approach to mapping B-cell ...
    We have developed a high-throughput strategy, shotgun mutagenesis epitope mapping, that enables the identification of both linear and conformational epitopes.
  59. [59]
    Multi‐perspectives and challenges in identifying B‐cell epitopes - PMC
    To further improve the accuracy of epitope prediction, methods tried combinations of different properties (e.g., hydrophilicity, flexibility, and polarity); the ...
  60. [60]
    High Throughput T Epitope Mapping and Vaccine Development - PMC
    In the present review we mention the predictive and experimental systems that can be used for T-cell epitope mapping. Since the experimental work requires ...
  61. [61]
    MHC Tetramer Analyses of CD8+ T Cell Responses to HIV and SIV
    Jan 14, 2010 · The method is extraordinarily robust, and can detect antigen-specific populations at frequencies as low as 1:5,000 CD8+ T cells (or ...<|separator|>
  62. [62]
    T-cell Epitope Mapping Using the ELISPOT Approach - PubMed
    Herein we describe the rationale, methodology, and useful hints for performing T-cell epitope mapping using ELISPOT analysis, and review typical results of such ...Missing: MHC tetramer peptide libraries
  63. [63]
    Measurement of peptide binding to MHC class II molecules by ...
    IC50 can be determined by plotting the binding competition curve. A flow chart schematizing the assay is presented in Figure 1. Figure 1. A schematic overview ...
  64. [64]
    Identification of Immunodominant HIV-1 Epitopes Presented by HLA ...
    Aug 13, 2019 · We identified a number of novel HLA-C*12:02-bound HIV-1 peptides and showed that although the majority of them did not elicit T cell responses ...
  65. [65]
  66. [66]
  67. [67]
    A game of tag: A review of protein tags for the successful detection ...
    Jun 19, 2025 · Here, we review the impact of commonly used affinity and epitope tags (e.g. Polyhistidine‐tag, Strep‐tag II, HA‐tag and FLAG‐tag) on recombinant ...
  68. [68]
    HPV vaccine: an overview of immune response, clinical protection ...
    The L1 capsid protein contains multiple overlapping epitopes, some of which may be immunodominant. The immune system responds to the antigen by producing a ...
  69. [69]
    Epitope-Based Immunome-Derived Vaccines: A Strategy for ... - NIH
    This vaccine concept is based on the identification of a minimal set of antigens that induce a competent immune response to a pathogen or neoplasm. Recognition ...
  70. [70]
    Immunoinformatics Approach for Epitope-Based Vaccine Design - NIH
    Common protocols used to design epitope-based peptide vaccines include tumor antigen identification, protein structure analysis, T cell epitope prediction, ...
  71. [71]
    Characterization of T-cell immune responses in clinical trials of the ...
    Hence RTS,S was designed to include CSP T-cell epitopes in addition to the prominent B-cell epitope made up of the asparagine-alanine-asparagine-proline ...
  72. [72]
    Peptide Vaccine: Progress and Challenges - PMC - PubMed Central
    Jul 2, 2014 · Despite several advantages, peptides are typically poorly immunogenic when used alone, requiring the next generation of adjuvants to overcome ...
  73. [73]
    Phase I/II study of immunotherapy with T-cell peptide epitopes in ...
    Previous studies in small groups of patients suggested that immunization of melanoma patients with peptide epitopes recognized by T cells could induce ...
  74. [74]
    Neoantigens: promising targets for cancer therapy - Nature
    Jan 6, 2023 · Neoantigens are newly formed antigens generated by tumor cells as a result of various tumor-specific alterations, such as genomic mutation, ...
  75. [75]
    Advances in the development of personalized neoantigen-based ...
    Jan 20, 2021 · Tumours have a high degree of genetic heterogeneity, which has fundamental implications for the efficacy of cancer therapy, including ...<|control11|><|separator|>
  76. [76]
    Neoantigen T-Cell Receptor Gene Therapy in Pancreatic Cancer
    Jun 1, 2022 · In this patient, TCR gene therapy targeting the KRAS G12D driver mutation mediated the objective regression of metastatic pancreatic cancer.
  77. [77]
    Personalized neoantigen vaccine and pembrolizumab in advanced ...
    Apr 7, 2024 · The objective response rate (modified intention-to-treat) per Response Evaluation Criteria in Solid Tumors 1.1 was 30.6% (11 of 36 patients), ...
  78. [78]
    NetMHCpan 4.0: Improved peptide-MHC class I interaction ...
    We present NetMHCpan-4.0, a method trained on both binding affinity and eluted ligand data leveraging the information from both data types.
  79. [79]
    BepiPred-2.0: improving sequence-based B-cell epitope prediction ...
    May 2, 2017 · BepiPred-2.0 is based on a random forest algorithm trained on epitopes annotated from antibody-antigen protein structures.
  80. [80]
    T Cell Epitope Prediction and Its Application to Immunotherapy
    Sep 14, 2021 · In this review, we focus on the currently available computational tools for epitope prediction, with a particular focus on tools aimed at identifying ...
  81. [81]
    Development of a Novel In Silico Docking Simulation Model for the ...
    We have developed a novel in silico epitope prediction method, based on HLA crystal structure and a peptide docking simulation model, calculating the peptide- ...
  82. [82]
    Conformational epitope matching and prediction based on protein ...
    May 31, 2021 · A conformational epitope (CE) is composed of neighboring amino acid residues located on an antigenic protein surface structure.
  83. [83]
    Immune Epitope Database (IEDB): 2024 update - Oxford Academic
    Nov 18, 2024 · Over the past 20 years, the IEDB has extracted antibody, T cell, and major histocompatibility complex (MHC) experimental data from >25 000 ...
  84. [84]
    Integrating machine learning to advance epitope mapping - PMC - NIH
    Sep 30, 2024 · 224,000 entries (B-cell & T-cell), https://cedar.iedb.org/, (23). 5, IEDB, T-cell & B-cell epitopes, 1,610,000 entries (B-cell & T-cell), https ...
  85. [85]
    IEDB.org: Free epitope database and prediction resource
    The Immune Epitope Database (IEDB) is a freely available resource funded by NIAID. It catalogs experimental data on antibody and T cell epitopes.Database Export · T Cell Prediction - Class I · IEDB Solutions · IEDB User WorkshopMissing: deep | Show results with:deep
  86. [86]
    The Immune Epitope Database (IEDB): 2024 update - PMC - NIH
    Nov 18, 2024 · The Immune Epitope Database (IEDB): 2024 update ... (A) The most growth has been noted in the numbers of assays per epitope and per publication.
  87. [87]
    Structural Database of Allergenic Proteins: SDAP 2.0
    SDAP is a Web server that integrates a database of allergenic proteins with various computational tools that can assist structural biology studies related ...
  88. [88]
    The updated Structural Database of Allergenic Proteins (SDAP 2.0 ...
    SDAP 2.0 is a public resource with 3D models of allergens, computational tools, and 1657 curated sequences, 334 structures, and 1565 3D models.
  89. [89]
    The COMPARE Database: A Public Resource for Allergen ... - Frontiers
    Aug 5, 2021 · SDAP for example, can be used to find sequence and structural neighbors for an allergen, and to search for the presence of an epitope (18–20).