HLA-DR4
HLA-DR4 is a serotype within the human leukocyte antigen (HLA) class II system, specifically recognizing the gene products encoded by the HLA-DRB104 alleles, which form the beta chains of major histocompatibility complex (MHC) class II molecules responsible for presenting peptide antigens to CD4+ T lymphocytes in the adaptive immune response. These molecules are expressed on antigen-presenting cells such as dendritic cells, macrophages, and B cells, and their role in immune regulation makes HLA-DR4 a key player in distinguishing self from non-self antigens. The HLA-DR4 serogroup is one of the most polymorphic in the HLA-DR locus, with more than 300 alleles identified as of 2019, including prominent subtypes like DRB10401, *0404, *0405, and *0408.[1] Many HLA-DR4 alleles share a conserved amino acid motif known as the shared epitope (SE), consisting of sequences such as QKRAA, QRRAA, or RRRAA at positions 70–74 in the DRB1 beta chain's third hypervariable region, which influences antigen-binding specificity and T-cell activation.[2] This structural feature contributes to the serogroup's complexity and its variable implications in immune-mediated conditions. HLA-DR4 haplotypes exhibit distinct structural diversity in their DR and associated DS beta chains, leading to five distinct haplotypes that differ in their peptide presentation capabilities.[3] HLA-DR4 is strongly associated with susceptibility to autoimmune diseases, particularly rheumatoid arthritis (RA), where SE-positive alleles like DRB1*0401 (relative risk ~6) and *0404 (relative risk ~5) increase disease risk by facilitating the presentation of arthritogenic self-peptides, such as type II collagen fragments, to autoreactive T cells.[2] A dose effect is observed in RA, with individuals carrying two SE alleles showing higher severity compared to those with none, who often experience milder forms. In type 1 diabetes (T1D), HLA-DR4 subtypes defined by specific residues at beta chain positions 71, 74, and 86 form motifs that either confer risk, neutrality, or protection; for instance, certain configurations at β74 strongly correlate with increased T1D susceptibility in large cohort studies involving thousands of patients.[4] These associations highlight HLA-DR4's role in modulating immune tolerance and its potential as a biomarker for disease prediction and therapeutic targeting.[2]Molecular and Serological Characteristics
Gene Locus and Structure
The HLA-DR4 antigen is encoded by genes situated on the short arm of human chromosome 6 at locus 6p21.31, within the major histocompatibility complex (MHC) class II region. This genomic region spans approximately 1000 kb and houses a cluster of highly polymorphic genes responsible for immune recognition. Specifically, HLA-DR4 expression arises from the HLA-DRA gene, which produces the invariant alpha chain, and the HLA-DRB1 gene, which encodes the highly polymorphic beta chain characterized by DRB1*04 alleles. The alpha chain gene is monomorphic, exhibiting minimal sequence variation across individuals, whereas the beta chain gene displays extensive polymorphism, particularly in the first exon encoding the peptide-binding domain. The mature HLA-DR4 protein forms a transmembrane heterodimer consisting of a glycosylated alpha chain (molecular weight 34-35 kDa) and beta chain (29-32 kDa). Each chain features two extracellular domains—alpha1 (residues 1-90) and alpha2 (91-181) for the alpha chain, and beta1 (1-91) and beta2 (92-182) for the beta chain—followed by a transmembrane helix and a short cytoplasmic domain. The alpha1 and beta1 domains fold into an immunoglobulin-like structure, creating a peptide-binding groove with two parallel alpha-helices (one from each chain) that flank an antiparallel beta-sheet floor composed of eight strands. Disulfide bonds within the alpha2 and beta2 domains stabilize the overall fold, while N-linked glycosylation sites, such as Asn78 in the alpha1 domain and Asn19 in the beta1 domain, contribute to proper assembly, trafficking, and stability; certain DRB1*04 allelic variants may exhibit subtle differences in glycosylation occupancy or accessibility due to nearby polymorphisms. High-resolution crystal structures provide detailed insights into the HLA-DR4 architecture. For instance, the 2.05 Å structure of HLA-DRA01:01/HLA-DRB104:01 bound to a human collagen type II peptide (PDB entry 7NZE, released 2022) illustrates the open-ended peptide-binding cleft accommodating 13-25 residue peptides, with anchor residues at positions P1, P4, P6, P7, and P9. Key beta-chain residues, including 67 (leucine), 70 (glutamine), and 71 (arginine/lysine), line the groove and influence peptide anchoring and specificity, underscoring the structural basis for allelic diversity in antigen presentation.Serological Typing
HLA-DR4 is defined as a serotype of the human leukocyte antigen (HLA) class II DR locus, identified by the reactivity of antisera specific to epitopes encoded primarily by HLA-DRB1*04 alleles.[5] These antisera bind to the β-chain of the HLA-DR heterodimer expressed on B lymphocytes and antigen-presenting cells, distinguishing DR4 from other DR serotypes such as DR1 and DR5 through unique epitope recognition patterns. Serological typing of HLA-DR4 historically relied on complement-dependent cytotoxicity (CDC) assays, where patient lymphocytes are incubated with anti-DR4-specific alloantisera or monoclonal antibodies, followed by complement addition to induce cell lysis in positive reactions.[6] Microlymphocytotoxicity variants of these assays, using purified B-cell populations, improved specificity for class II typing by reducing interference from class I antigens.[7] Correlation with molecular typing shows strong concordance, as most common DRB1*04 alleles (e.g., *04:01 to 04:16) react positively with DR4 antisera, though rare variants like DRB104:17 to *04:60 lack defined serological reactivity due to limited testing data.[5] HLA-DR4 antisera exhibit minimal cross-reactivity with DR1 (associated with DRB101 alleles) or DR5 (DRB111/*12), as these serotypes are defined by distinct β-chain hypervariable region motifs that alter antibody binding affinity.[8] However, subtle epitope sharing can occur in heterozygous individuals, necessitating confirmatory testing to resolve ambiguities.[7] The serological identification of HLA-DR4 emerged in the early 1980s through international histocompatibility workshops, with the 8th Workshop in 1980 standardizing DR antigen definitions using panel-reactive antisera from immunized donors.[9] This built on earlier cellular typing efforts, but serological methods dominated until the 1990s, when polymerase chain reaction (PCR)-based genotyping began replacing them for higher resolution.[10] Despite its foundational role, serological typing offers lower resolution than molecular methods, often failing to distinguish between DRB1*04 subtypes and providing blanks or ambiguities in up to 25% of class II typings due to weak antigen expression or reagent limitations.[7] It also incompletely covers rare alleles, which may not elicit detectable serological responses, limiting its utility in diverse populations or high-resolution matching scenarios.[11]Allelic Variants and Frequencies
The HLA-DR4 specificity is encoded by alleles of the HLA-DRB104 group, following the standardized IMGT/HLA nomenclature, which assigns unique identifiers based on nucleotide sequence differences.[12] The primary alleles range from DRB104:01 to DRB104:17, with DRB104:01 representing the most widespread variant across diverse populations and DRB1*04:05 showing elevated prevalence in East Asian groups.[13] This nomenclature facilitates precise identification of molecular variants, distinguishing them from serological definitions.[12] Sequence polymorphisms within the DRB104 alleles are concentrated in exon 2, which codes for the alpha-helical and beta-sheet structures of the peptide-binding groove.[14] These variations influence antigen presentation potential; for instance, DRB104:01 carries lysine at beta chain position 71, while DRB104:04 has arginine at this site, altering the groove's electrostatic properties.[15] As of 2025, the IMGT/HLA database recognizes over 300 DRB104 subtypes, reflecting ongoing high-throughput sequencing efforts that have added numerous variants since 2020.[16][12] Population-specific allele frequencies for DRB104 variants exhibit significant diversity, as documented in global databases. DRB104:01 typically ranges from 5-10% allele frequency in European-descended populations, such as 13.5% in English cohorts and 12.5% in Dutch samples, but rises to approximately 30% in certain Native American groups like the Yupik of Alaska.[17][18] In contrast, DRB104:02 occurs at 2-5% in Caucasians, based on registry data from diverse U.S. subpopulations.[19] DRB104:05 is notably common in Asians, reaching 15.5% allele frequency in Japanese populations from Kyoto and Osaka, and around 9-14% in other East Asian groups like Koreans and Taiwanese.[20][21]| Allele | Example Population | Approximate Allele Frequency (%) | Source |
|---|---|---|---|
| DRB1*04:01 | Europeans (e.g., Dutch) | 12.5 | allelefrequencies.net |
| DRB1*04:01 | Native Americans (e.g., Yupik) | 23.2 | allelefrequencies.net |
| DRB1*04:02 | Caucasians (U.S. registry) | 2-5 | PubMed |
| DRB1*04:05 | Japanese | 15.5 | allelefrequencies.net |
Functional Role
Antigen Presentation Mechanism
HLA-DR4 molecules, as part of the MHC class II family, follow the canonical biosynthetic pathway for antigen presentation. Newly synthesized HLA-DR4 αβ heterodimers assemble in the endoplasmic reticulum (ER), where they associate with the invariant chain (Ii) to form a nonameric complex (Ii₃:(αβ)₃), preventing premature binding of endogenous peptides and facilitating proper folding and trafficking.[22] The Ii chain binds via its class II-associated invariant chain peptide (CLIP) region to the peptide-binding groove of HLA-DR4, acting as a placeholder. This complex is then transported from the ER through the Golgi apparatus to late endosomal/lysosomal compartments known as MHC class II compartments (MIICs).[23] In the acidic environment of endosomes (pH ~5-6), the Ii chain undergoes stepwise proteolytic degradation, primarily by cathepsin S, which cleaves Ii into smaller fragments, ultimately generating the CLIP peptide that remains bound to the HLA-DR4 groove.[24] Cathepsin S activity is pH-dependent, with optimal function in mildly acidic endosomal conditions, ensuring efficient Ii processing across the endocytic pathway.[25] HLA-DM, a peptide editor molecule, then catalyzes the removal of CLIP and facilitates the exchange for antigenic peptides derived from endocytosed proteins, which are degraded by lysosomal proteases into 13-25 amino acid fragments suitable for binding. This exchange is also pH-dependent, occurring optimally at endosomal acidity to promote stable peptide-MHC complexes.[23][25] Specific to HLA-DR4, the open-ended peptide-binding groove accommodates extended peptides of 13-25 amino acids, with binding stabilized by hydrophobic anchor residues preferentially at positions P1, P4, P6, and P9 within the peptide core.[26] HLA-DM plays a crucial role in editing the peptide repertoire for HLA-DR4 by promoting dissociation of low-affinity peptides and favoring high-stability complexes, thereby shaping the immunopeptidome presented on antigen-presenting cells. In certain contexts, such as HLA-DR4-restricted presentation of collagen-derived peptides, HLA-DM exerts negative regulation by inhibiting surface presentation in the recycling pathway, reducing T cell recognition even at low expression levels.[23][27] Once loaded, HLA-DR4-peptide complexes traffic to the cell surface for CD4+ T cell recognition, with surface stability influenced by peptide affinity; high-affinity peptides confer a half-life of approximately 15-25 hours on monocyte-derived dendritic cells.[28] Recycling of surface HLA-DR4 molecules back to endosomes allows for additional peptide exchange, particularly for immunodominant epitopes, maintaining presentation efficiency without de novo synthesis. This recycling is mediated by cytoplasmic tail motifs and depends on endosomal pH for dynamic peptide editing.[29][25]Role in Immune Response
HLA-DR4 contributes to adaptive immunity by presenting peptide antigens derived from extracellular pathogens and self-proteins to CD4+ T cells via their T cell receptors (TCRs), initiating a cascade of immune signaling that amplifies the response to foreign threats while maintaining homeostasis.[30] This presentation stabilizes the interaction between antigen-presenting cells (APCs) and CD4+ T cells through HLA-DR4's binding to CD4, enhancing TCR signaling and promoting T cell proliferation and differentiation.[30] Upon antigen recognition, HLA-DR4-restricted CD4+ T cells undergo activation, leading to the release of key cytokines such as interleukin-2 (IL-2) and interferon-gamma (IFN-γ), which drive clonal expansion and effector functions.[31] HLA-DR4 plays a specific role in biasing differentiation toward Th1 and Th17 subsets; for instance, certain DR4 alleles promote Th1 effector memory CD4+ T cells that secrete IFN-γ, while others enhance Th17 polarization through IL-17 production, facilitating defense against intracellular pathogens and extracellular bacteria, respectively.[32][33] HLA-DR4 also participates in immune tolerance to prevent excessive reactivity. In central tolerance, thymic APCs expressing HLA-DR4 present self-peptides to developing CD4+ T cells, inducing deletion of high-affinity autoreactive clones or their diversion into regulatory T cells (Tregs).[34] Peripherally, HLA-DR4-mediated presentation can trigger anergy in low-affinity self-reactive T cells or expand FoxP3+ Tregs that suppress unwanted responses through cytokine modulation and cell-cell contact.[34] Specific DR4 variants, such as DRB1*04:02, enhance these mechanisms by increasing thymic deletion and Treg numbers, thereby strengthening self-tolerance.[34] Polymorphisms in HLA-DR4, particularly the shared epitope (SE) in alleles like DRB104:01 and DRB104:04, influence T cell selection and reactivity; the SE motif directly contacts the TCR during peptide presentation, potentially favoring the escape of autoreactive T cells from tolerance checkpoints and promoting their activation.[35] This structural feature alters peptide-MHC-TCR interactions, leading to biased TCR repertoires that may heighten responsiveness to certain antigens.[32] Full CD4+ T cell activation by HLA-DR4 requires co-stimulatory signals from APCs, where CD80 (B7-1) and CD86 (B7-2) bind CD28 on T cells, providing the second signal necessary to prevent anergy and sustain cytokine production. Without these interactions, HLA-DR4 presentation alone may induce tolerance rather than immunity. Activated HLA-DR4-restricted CD4+ T cells further support humoral immunity by providing help to B cells; they recognize antigens presented on B cell MHC class II (including DR4) and deliver CD40L and cytokines to promote B cell proliferation, class switching, and antibody production.[36] Overall, HLA-DR4 is essential for coordinating pathogen-specific humoral and cellular responses, enabling effective clearance of infections through Th1/Th17-mediated inflammation and antibody-mediated neutralization. However, carriers of certain DR4 alleles exhibit hyper-responsiveness, characterized by amplified T cell activation and cytokine output, which can intensify immune vigilance but risks overactivation.[33]Disease Associations
Autoimmune Disease Links
HLA-DR4 is strongly associated with increased susceptibility to rheumatoid arthritis (RA), with the DR4 serotype conferring an odds ratio (OR) of approximately 3-4 for disease development.[37] This risk is primarily driven by shared epitope (SE)-bearing alleles such as HLA-DRB1*04:01 and 04:04, which encode a conserved amino acid sequence in the beta-chain that enhances presentation of citrullinated peptides, contributing to autoimmunity.[35] A review of SE mechanisms highlights how these alleles promote T-cell responses to arthritogenic self-antigens in RA pathogenesis.[38] Furthermore, the heterozygous genotype HLA-DRB101:01/04:01 is linked to higher RA mortality, particularly from ischemic heart disease, underscoring its role in disease severity.[39] In Japanese populations, HLA-DRB104:05 shows a pronounced association with RA susceptibility and progression, reflecting allele-specific variations across ethnic groups.[40] In type 1 diabetes (T1D), HLA-DRB104:01 in combination with the DQ8 haplotype (DQA103:01-DQB1*03:02) confers one of the highest genetic risks, with ORs ranging from 10 to 15, due to enhanced presentation of islet autoantigens like insulin and GAD65.[41] This DR4-DQ8 combination accounts for a substantial portion of T1D heritability, particularly in early-onset cases.[42] Recent analyses of stratified genetic risk in DR4 carriers reveal heterogeneity in progression pathways, with distinct non-HLA loci influencing outcomes in DR4-positive individuals.[43] Associations with other autoimmune diseases vary by allele and population. In systemic lupus erythematosus (SLE), HLA-DR4 acts as a protective factor in certain ethnic groups, such as Asians, with reduced allele frequencies and ORs below 1 observed in meta-analyses.[44] For pemphigus vulgaris, HLA-DRB104:02 significantly elevates risk (OR 5-10), promoting autoantibody production against desmoglein 3 through altered peptide binding in the antigen groove.[45] Pemphigoid gestationis, a rare pregnancy-associated autoimmune bullous dermatosis, demonstrates a strong genetic link to HLA-DR4 alongside DR3. Molecular analyses indicate that MHC class II antigens DR3 and DR4 are present in a majority of affected individuals, contributing to autoantibody production against hemidesmosomal proteins like BP180. This association underscores HLA-DR4's role in aberrant immune tolerance during gestation, though specific odds ratios vary across studies without consistent quantification exceeding 20-fold risk.[46][47] In multiple sclerosis, HLA-DRB104:01 modestly increases susceptibility (OR ~1.5), potentially via presentation of myelin-derived peptides, though this effect is less dominant than HLA-DRB1*15:01.[48] Mechanistically, HLA-DR4 alleles contribute to autoimmunity by facilitating enhanced presentation of autoantigens; for instance, in RA, SE-positive DR4 binds and presents collagen II peptides, as revealed by crystal structures showing key interactions in the peptide-binding cleft.[49] This structural basis supports T-cell activation against joint tissues, linking genetic variation to pathological immune responses across these conditions.Infectious and Oncologic Associations
HLA-DR4 exhibits associations with both susceptibility and protection in various infectious diseases, primarily through its influence on antigen presentation to CD4+ T cells. In SARS-CoV-2 infection, the HLA-DRB104:01 allele is enriched among asymptomatic carriers, particularly in populations of European descent. A 2021 study in North East England compared 69 asymptomatic healthcare staff to 49 patients with severe respiratory failure, finding DRB104:01 frequency at 16.7% in the asymptomatic group versus 5.1% in severe cases (p = 0.003, adjusted for age and sex), with ordinal logistic regression confirming a protective effect (coefficient -0.326, p = 0.001). This suggests enhanced viral peptide binding and T cell activation in DRB1*04:01 carriers, reducing progression to severe disease.[50] Mechanistically, HLA-DR4's polymorphic peptide-binding groove can alter presentation of viral epitopes, influencing immune evasion or efficacy. Post-2020 research on COVID-19 reveals that DR4 variants may facilitate broader recognition of SARS-CoV-2 peptides, such as those from the spike protein, potentially mitigating immune escape by variants like Omicron through diversified T cell responses. However, this comes at the cost of heightened inflammation in some contexts, though no DR4-specific evasion patterns dominate recent data.[51][52] In oncology, HLA-DR4 haplotypes confer variable risks for certain malignancies, often intersecting with chronic inflammation. The DR8-DQ4 haplotype is implicated in elevated susceptibility to papillary thyroid carcinoma, where it promotes dysregulated immune surveillance of thyroid follicular cells.[53] A large cohort analysis identified DR8-DQ4 (related to DR4 networks) as an independent marker, but overlapping DR4 associations appear in Hürthle cell variants, with DR4 detected in up to 86% of cases, suggesting impaired tumor antigen presentation.[54] For gastric cancer, HLA-DR4 correlates with aggressive features, including increased lymph node metastasis in poorly differentiated subtypes. A serological study of advanced cases reported higher DR4 prevalence among those with nodal involvement (odds ratio approximately 2-3), linking it to deficient anti-tumor T cell responses against Helicobacter pylori-associated antigens. No seminal 2005 Hashimoto-led study confirms a direct DR4-gastric link, but earlier works highlight this pattern.[55] Recent investigations (2022-2025) reveal no major breakthroughs in HLA-DR4-oncology ties, but stratification of type 1 diabetes (T1D) cohorts by DR4 highlights potential oncology implications, as shared autoimmune pathways may elevate cancer surveillance needs in DR4 carriers without altering core risk metrics (as of 2025).[56]Associations by Haplotype and Genotype
HLA-DR4 haplotypes, such as DRB104:01-DQA103:01-DQB1*03:02 (commonly denoted as DR4-DQ8), confer a substantially elevated risk for type 1 diabetes (T1D), with odds ratios exceeding 10 in heterozygous combinations with DR3 haplotypes.[57] This haplotype's influence extends beyond isolated alleles, as its cis configuration enhances antigen presentation of islet autoantigens, amplifying autoimmune destruction in T1D pathogenesis.[58] Genotypic configurations of HLA-DR4 further refine disease risk and severity profiles. Homozygosity for DR4 alleles carrying the shared epitope (SE), such as DRB1*04:01/04:01, is linked to increased RA severity, including accelerated joint erosion and higher concordance rates in monozygotic twins compared to heterozygous or non-SE genotypes.[59] Compound heterozygotes like DRB104:01/*04:04 show elevated production of anti-cyclic citrullinated peptide (anti-CCP) antibodies, correlating with more aggressive RA phenotypes and poorer radiographic outcomes.[60] These genotypic effects arise from codominant expression of HLA-DR molecules, where both alleles contribute equally to the immunopeptidome, as evidenced by twin studies demonstrating stronger genotype-phenotype correlations in concordant pairs for autoimmune traits.[61] Pleiotropic effects of DR4 haplotypes underscore their broad impact across diseases, with some configurations increasing risk for multiple autoimmune conditions while others confer protection. A 2025 study in the American Journal of Human Genetics analyzed 1,750 HLA haplotypes across 647 diseases, revealing that DR4-linked variants exhibit trade-offs, such as heightened T1D and RA susceptibility alongside reduced odds for certain infections (as of 2025).[62] Recent updates highlight genotypic heterogeneity within DR4 carriers for T1D; a 2025 medRxiv preprint stratified cases by DR3/DR4 status, identifying distinct progression pathways, including moderate genetic correlations (rg=0.6) between DR4-T1D and DR3-T1D subsets, with implications for personalized risk prediction (as of 2025).[43]Population and Evolutionary Aspects
Global Allele Distribution
HLA-DR4, encompassing the HLA-DRB1*04 allele group, exhibits marked variation in prevalence across global populations, reflecting historical migrations, genetic drift, and potential selective pressures such as pathogen exposure. In indigenous Amerindian groups, DR4 allele frequencies are notably high, often reaching 50-70%, as observed in populations like the Argentine Kolla (42%) and Brazilian Kaingang (27%), with overall estimates for Native American groups around 34-63% based on aggregated data.[63][64] In contrast, frequencies are lower in sub-Saharan African populations, typically under 10%, with specific examples including 4.4-4.7% among ethnic groups in northern Ghana such as the Kassem and Nankam. Among Europeans and those of Caucasian descent, DR4 allele frequencies range from 20-30%, with representative values of 24% in southwestern United Kingdom populations and 20% in ethnic Norwegians. In East Asian populations, the overall DR4 frequency is approximately 15-20%, dominated by the HLA-DRB1*04:05 subtype, which accounts for the majority of cases and reaches up to 28% in certain Japanese cohorts, though general population estimates hover around 18-22% in groups like Japanese and South Koreans. South Asian populations show intermediate frequencies, around 15%, as seen in Tamil Nadu Dravidians at 25%. These patterns are compiled from the Allele Frequencies Net Database (AFND), which aggregates data from over 1,300 population studies involving millions of individuals as of 2025.[64][65][66]| Population Group | Representative Frequency (Allele, %) | Example Populations | Source |
|---|---|---|---|
| Amerindians | 34-63 | Argentine Kolla (42), Brazilian (63.4 max) | AFND; Arnaiz-Villena et al. (2007)[64][67] |
| Europeans/Caucasians | 20-30 | UK Southwest (24), Norwegians (20) | AFND[64] |
| East Asians | 15-20 | Japanese (22), South Koreans (20) | AFND; Tsuchiya et al. (2023)[64][66] |
| Sub-Saharan Africans | <10 | Ghana Kassem/Nankam (4.5) | AFND; Gyan et al. (2008)[64] |
| Global Average | ~20 | Aggregated worldwide | AFND[64] |