The WD40 repeat is a short structural motif found in numerous eukaryotic proteins, typically comprising 40–60 amino acid residues and characterized by a conserved glycine-histidine (GH) dipeptide near the N-terminus and a tryptophan-aspartate (WD) dipeptide at the C-terminus. These repeats fold into four anti-parallel β-strands (labeled A–D) that form individual blades in a β-propeller architecture, most commonly with seven blades arranged in a circular scaffold. First identified in 1994 as an ancient regulatory motif in G-protein β-subunits, the WD40 repeat serves primarily as a non-enzymatic platform for mediating protein–protein interactions without intrinsic catalytic activity.[1][2]WD40 domain-containing proteins are among the most abundant and versatile in eukaryotic genomes, with over 260 non-redundant members identified in humans alone, often grouped into subfamilies based on additional domains or functions. The β-propeller structure provides a stable, rigid scaffold with distinct surfaces—the top for specific ligand binding, the bottom for core interactions, and the sides for inter-domain contacts—enabling recognition of short linear motifs, post-translational modifications like phosphorylation, and even nucleic acids. This adaptability allows WD40 proteins to participate in diverse cellular processes, including signal transduction (e.g., in G-protein signaling), cell cycle regulation (e.g., via CDC20 in the anaphase-promoting complex), apoptosis (e.g., through TRAF proteins), and chromatin modification (e.g., in histone deacetylase complexes).[3][1][1]Notable examples include the G-protein β-subunits, which were pivotal in the motif's discovery and facilitate heterotrimeric G-protein activation in response to G-protein-coupled receptors, and the COP1 protein, which uses its WD40 domain to target transcription factors for degradation in plant photomorphogenesis and animal circadian rhythms. In disease contexts, mutations in WD40 repeat proteins like WDR45 are linked to neurodegeneration, such as beta-propeller protein-associated neurodegeneration (BPAN), highlighting their essential roles in cellular homeostasis. Overall, the WD40 repeat's evolutionary conservation across eukaryotes underscores its fundamental importance in assembling multiprotein complexes for regulatory functions.[1][4][1]
Structure and Sequence
Sequence Motif
The WD40 repeat is a short structural motif typically comprising 40–60 amino acid residues, from which the name "WD40" is derived. This motif is characterized by a conserved glycine-histidine (GH) dipeptide near the N-terminus and a tryptophan-aspartate (WD) dipeptide at the C-terminus.[2][5]The sequence of the WD40 repeat exhibits considerable variability, particularly in the flexible loop regions between beta-strands, while maintaining a coreconsensus pattern featuring the GH and WD dipeptides with intervening hydrophobic residues, as captured by models like the PROSITE pattern [LIVMSTAC]-[LIVMFYWSTAGC]-[LIMSTAG]-[LIVMSTAGC]-x(2)-[DN]-x-{P}-[LIVMWSTAC]-{DP}-[LIVMFSTAG]-W-[DEN]-[LIVMFSTAGCN].[6] This pattern reflects the essential sequence features that define the motif across diverse proteins.A typical WD40 domain comprises 4–9 repeats, with 7 being the most common number, which collectively fold into a closed propeller architecture. Identification of WD40 repeats in protein sequences relies on computational methods such as hidden Markov models (HMMs), which model the probabilistic patterns of the motif to predict its presence with high accuracy.[7]
Three-Dimensional Structure
The WD40 repeat domain adopts a circular β-propeller fold, typically consisting of 4 to 8 blades arranged symmetrically around a central axis, with each blade formed by four anti-parallel β-strands designated A (innermost) to D (outermost). This architecture creates a toroidal, tapered structure that serves as a stable scaffold for protein interactions. The overall domain spans approximately 4-6 nm in diameter, with a central tunnel of about 1-2 nm wide.[8][9]Due to circular permutation in the sequence, the β-strands within each blade are contributed by adjacent repeats: the D strand from one repeat pairs as the outer strand with the A, B, and C strands from the next repeat to complete the blade. The inner A and B strands form hydrogen bonds with each other, while the outer C and D strands are linked by flexible loops, enabling the skewed arrangement that propagates around the propeller. Closure of the ring occurs via a "velcro" mechanism in the final blade, where N-terminal strands supply the missing outer elements, facilitating velocity-dependent folding and enhancing stability during biosynthesis.[8][10][11]The propeller's stability is reinforced by a hydrophobic core formed by packed side chains from the β-strands, supplemented by inter-blade salt bridges and hydrogen bonds that rigidify the interface between repeats. Crystal structures exemplify this fold, such as the 7-bladed propeller of human WDR5 (PDB: 2GNQ), which demonstrates the symmetric blade arrangement and conserved core interactions. Variations exist, including incomplete propellers with fewer than 4 full blades that may oligomerize to approximate a complete ring, or hybrid domains where WD40 motifs fuse with other folds like TPR repeats for specialized functions.[8][12][13][14]
Function and Interactions
Protein-Protein Interactions
WD40 repeats facilitate protein-protein interactions primarily through the exposed surfaces of their β-propeller fold, where the top and bottom faces, along with interconnecting loops and blade edges, provide versatile platforms for molecular recognition and binding. These surfaces are characterized by conserved hotspot residues that enable stable or transient associations with partner proteins, allowing WD40 domains to act as central hubs in complex assemblies.[1][12]In their adaptor role, WD40 repeats serve as scaffolds for multi-protein complexes, recruiting key regulatory elements such as ubiquitin ligases and transcription factors to coordinate sequential or cooperative interactions. The modular nature of the β-propeller architecture supports simultaneous binding of multiple partners, enhancing the efficiency of signal transduction and regulatory processes at the molecular level.[15][16]Binding specificity is dictated by distinct electrostatic and hydrophobic patches distributed across the propeller surface, which selectively engage complementary features on target molecules; for example, clusters of charged residues on the top face can interact with oppositely charged motifs like those on histone tails. Hydrophobic grooves and rings formed by conserved residues further stabilize these interfaces, ensuring partner selectivity amid diverse cellular contexts.[16][17][12]The dynamics of WD40-mediated interactions are supported by flexible loops that undergo localized conformational changes upon binding, which refine the binding pocket and enhance overall affinity without disrupting the core propeller rigidity. These adaptive adjustments allow for fine-tuned regulation of interaction strength and duration.[18]Co-immunoprecipitation assays have confirmed the formation of these multi-component complexes, revealing dependencies on specific surface residues for partner recruitment. Complementarily, NMR spectroscopy has elucidated the transient and dynamic aspects of WD40 interactions, capturing intermediate states and loop movements that underpin binding mechanisms.48475-4/fulltext)[19]
Roles in Cellular Processes
WD40 repeat-containing proteins serve as versatile scaffolds that integrate and modulate diverse cellular pathways, facilitating signal transduction, protein turnover, gene expression control, and cell fate decisions. These proteins often act as hubs in multiprotein complexes, enabling precise regulation of cellular homeostasis through their β-propeller structures that coordinate interactions without enzymatic activity themselves.[20]In signal transduction, WD40 repeats contribute to pathway modulation by recognizing phosphorylated substrates and linking them to degradation machinery, as seen in the Wnt/β-catenin pathway where they facilitate β-catenin turnover to prevent aberrant signaling. For instance, the WD40 domain in substrate adaptors binds phosphorylated β-catenin, promoting its ubiquitination and degradation, thereby maintaining low cytoplasmic levels in the absence of Wnt ligands and allowing rapid response to signaling cues. This scaffolding role ensures tight control over pathway activation, preventing constitutive signaling that could lead to uncontrolled cell proliferation.[21]WD40 repeats play a critical role in protein degradation by functioning as substrate adaptors within cullin-RING E3 ubiquitin ligase complexes, such as CUL4-DDB1, where they selectively recruit targets for ubiquitination and proteasomal breakdown. These adaptors, often containing DWD motifs, dock onto the DDB1 subunit via specific sequence elements, enabling the ligase to target a wide array of substrates involved in DNA repair, replication, and cell cycle progression. This adaptability allows a single cullin scaffold to assemble numerous distinct E3 ligases, each tailored to specific cellular needs, thus regulating protein levels with high fidelity.[22]In transcription regulation, WD40 repeats mediate the recruitment of co-activators and repressors to chromatin, influencing histone modifications and gene expression patterns. For example, certain WD40 proteins recognize unmodified or symmetrically dimethylated arginine 2 on histone H3, facilitating the assembly of Trithorax group complexes that deposit activating H3K4 methylation marks at promoters and enhancers. Conversely, other WD40 domains bind repressive marks like H3K27me3, aiding polycomb repressive complex 2 in silencing developmental genes, thereby balancing activation and repression to fine-tune transcriptional outputs across cellular contexts.[23]WD40 repeats are integral to apoptosis and cell cycle control, particularly through their involvement in checkpoint mechanisms that ensure genomic integrity. In the anaphase-promoting complex/cyclosome (APC/C), WD40-containing activators bind substrates bearing destruction motifs, triggering their ubiquitination to drive mitotic progression and prevent aneuploidy. For instance, the seven WD40 repeats form a β-propeller that recruits securin and cyclin B for degradation, enabling sister chromatid separation and mitotic exit, while dysregulation can activate apoptotic pathways in response to prolonged mitotic arrest.[24]Dysregulation of WD40 repeat proteins is implicated in various diseases, notably cancer and endocrine disorders, where altered scaffolding disrupts normal pathway fidelity. In oncogenesis, aberrant WD40 function can stabilize pro-proliferative signals or impair degradation of oncoproteins, contributing to tumorigenesis across multiple tissues. In endocrine contexts, mutations or misregulation of these proteins interfere with nuclear receptor signaling, leading to disruptions in hormone-responsive gene expression and syndromes like congenital hypothyroidism or adrenal insufficiency.[25][26]
Occurrence and Evolution
Distribution Across Organisms
WD40 repeats are ubiquitous across eukaryotic organisms, serving as a fundamental structural motif in diverse cellular contexts. Genome-wide analyses have identified approximately 262 non-redundant WD40-containing genes in the human genome, reflecting their extensive role in complex multicellular eukaryotes.[3] In simpler eukaryotes such as the yeast Saccharomyces cerevisiae, the count is notably lower, with 83 WD40 proteins annotated, indicating a scaling with organismal complexity.[27]In contrast, WD40 repeats are rare in prokaryotes, with systematic surveys identifying only about 4,000 such proteins across bacterial and archaeal genomes, primarily in select phyla like Actinobacteria and Proteobacteria.[10] This scarcity supports the hypothesis of a predominantly eukaryotic origin and expansion of the WD40 family, though isolated prokaryotic instances suggest possible ancient horizontal transfer or convergent evolution.[28]Among plants, WD40 repeats exhibit expansions tailored to specific physiological demands, such as growth and metabolism. A 2024 genome-wide study in Capsicum annuum (pepper) revealed 269 CaWD40 genes, many implicated in developmental and metabolic pathways.[29] Similarly, a 2025 pan-WD40ome analysis across 26 diverse maize inbred lines identified 6,849 WD40 genes, highlighting line-specific structural and functional diversity that underscores adaptive variations in plant genomes.[30]In vertebrates, WD40 gene numbers generally exceed those in invertebrates, correlating with increased regulatory complexity; for instance, human counts surpass the ~172 in silkworm (Bombyx mori).[31] Comprehensive annotation resources like WDSPdb facilitate cross-species comparisons, cataloging over 600,000 predicted WD40 proteins from 4,426 species, including detailed structural predictions for eukaryotic and rare prokaryotic entries.[32]
Evolutionary Conservation
The WD40 repeat domain traces its origins to the last eukaryotic common ancestor (LECA), where it emerged as a versatile structural module essential for protein interactions across diverse cellular functions. Phylogenetic analyses indicate that core WD40 repeats have been highly conserved throughout eukaryotic evolution, maintaining structural integrity and functional roles in phyla ranging from protists to metazoans. This deep conservation underscores the domain's fundamental importance, with homologs identifiable in nearly all eukaryotic lineages but absent or rudimentary in prokaryotes, suggesting an eukaryotic-specific innovation.[33][28]Sequence conservation of WD40-containing proteins exhibits gradients correlated with the number of repeats, where proteins harboring more repeats (typically 7 or greater) display elevated evolutionary stability compared to those with fewer (often 4-6). Recent pan-genome analyses of maize inbred lines reveal that WD40 proteins with higher repeat counts are more frequently retained in core orthogroups across diverse accessions, reflecting stronger purifying selection and functional constraints. In contrast, proteins with fewer repeats show greater variability and are prone to lineage-specific losses, highlighting how repeat multiplicity influences resilience to evolutionary pressures. These patterns align with observations in other eukaryotes, where multi-repeat architectures correlate with enhanced conservation under selective regimes.[30][10]The expansion of WD40 repeats primarily occurred through tandem and segmental gene duplications, enabling the diversification of β-propeller structures while preserving core motifs. Tandem duplications within genes facilitated the addition of repeats, promoting architectural complexity, whereas whole-gene duplications contributed to family-wide proliferation. Horizontal gene transfer appears rare, confined mostly to prokaryotic contexts and not a significant driver in eukaryotic WD40 evolution.[3][28]Adaptive variations in WD40 repeats reflect lineage-specific evolutionary pressures, particularly in specialized metabolic and signaling pathways. In plants, such as Capsicum annuum, WD40 proteins have evolved to regulate secondary metabolism, including anthocyanin biosynthesis, as evidenced by genome-wide identifications linking specific CaWD40 genes to pigmentation and stress responses. In animals, lineage-specific expansions of the WD40 family have supported adaptations in signaling cascades, with duplications enhancing roles in pathways like those involving kinases and apoptosis regulators. These divergences illustrate how conserved scaffolds accommodate functional innovations without compromising structural fidelity.[34][35]
Examples and Applications
Notable Proteins Containing WD40 Repeats
One prominent example of a protein featuring WD40 repeats is WDR5, which forms a seven-bladed β-propeller structure essential for its role in histone methylation complexes. WDR5 serves as a core subunit in the MLL/COMPASS complex, where its WD40 domain binds to the histone H3 tail, facilitating H3K4 trimethylation and thereby promoting transcriptional activation.[36] This interaction is critical for the complex's catalytic activity, as structural studies reveal that the central cavity of the WDR5 β-propeller accommodates the unmodified arginine 2 of histone H3, stabilizing substrate presentation to the methyltransferase.[37]In cell cycle regulation, CDC20 exemplifies a WD40 repeat-containing activator of the anaphase-promoting complex/cyclosome (APC/C). CDC20 possesses a C-terminal WD40 domain that forms a β-propeller, enabling it to bind substrates and activate APC/C ubiquitination during mitosis.[38] This domain recognizes destruction motifs like D-boxes and KEN-boxes on mitotic regulators, ensuring timely degradation for metaphase-to-anaphase transition and mitotic exit.DDB1, involved in DNA damage response, is a key adaptor in the cullin4-DDB1 E3 ubiquitin ligase complex, characterized by three WD40 β-propeller domains (BPA, BPB, and BPC) that form a unique triple-propeller architecture. The BPA and BPC propellers create a docking platform for substrate receptors, while BPB interacts with cullin4, facilitating ubiquitination of proteins like histones and replication factors in response to UV-induced damage.[39]The G-protein signaling pathway highlights transducin β (GNB1), where the protein's seven WD40 repeats assemble into a canonical seven-bladed β-propeller that mediates heterotrimer formation with Gα and Gγ subunits. This structure is pivotal in phototransduction, as the propeller's blades provide interaction surfaces for effector binding, transmitting signals from activated rhodopsin to downstream cyclic nucleotidephosphodiesterase in rod cells.[1]In plants, TTG1 (TRANSPARENT TESTA GLABRA1) in Arabidopsis thaliana is a WD40 repeat protein that regulates trichome development through its β-propeller domain, which interacts with MYB and bHLH transcription factors to form an activator complex.[40] Mutations in TTG1 disrupt this complex, leading to glabrous phenotypes and impaired epidermal cell differentiation, underscoring its role in cell fate specification.[41]Proteins with hybrid domains, such as COP1 in plants, combine WD40 repeats with other motifs to integrate signaling pathways. COP1 features a C-terminal WD40 β-propeller alongside an N-terminal RING finger and coiled-coil domain, enabling it to function as an E3 ubiquitin ligase in photomorphogenesis by targeting transcription factors like HY5 for degradation in the dark.[42] The WD40 domain in COP1 facilitates substrate recognition and complex assembly with SPA proteins, balancing light-dependent de-etiolation.[43]
Therapeutic and Research Applications
WD40 repeat domains, traditionally viewed as challenging targets due to their involvement in protein-protein interactions (PPIs) considered "undruggable," have emerged as viable candidates for small-molecule intervention through systematic ligandability assessments. A 2025 evaluation systematically screened and characterized ligands for over 100 WD40 repeat (WDR)-containing proteins, identifying potent binders for several family members and demonstrating that these β-propeller structures can accommodate drug-like molecules at conserved sites, thereby opening avenues for modulating PPIs in disease contexts.[44] This approach builds on initiatives like the Structural Genomics Consortium's Target 2035 project, which from 2020 to 2025 focused on WDR proteins to develop chemical probes and advance dark proteome targeting.[45]Proteolysis-targeting chimeras (PROTACs) exploiting WD40 repeats have shown promise for selective protein degradation, particularly via DCAF1 and WDR5. In 2025, a small-molecule probe, OICR-41103, was developed as a potent, selective binder to the DCAF1 WD40 domain, enabling recruitment to CRL4 E3 ligase for targeted degradation of neo-substrates in cellular models.[46] For WDR5, DCAF1-based PROTACs were reported in 2024, with crystal structures of ternary complexes revealing how these bifunctional molecules induce WDR5 ubiquitination and proteasomal degradation, reducing oncogenic signaling in cancer cells.[47] Similarly, VHL-recruiting WDR5 PROTACs discovered in 2023-2024 achieved nanomolar potency in degrading WDR5, suppressing proliferation in pancreatic ductal adenocarcinoma models.[48]In cancer therapeutics, modulation of WD40 repeats, especially WDR5, has been linked to oncogenesis through its role in MLL complexes and MYC stabilization. A 2023 review highlighted WDR5 inhibitors and degraders that impair histone methylation and gene expression in leukemias and solid tumors, with expanded profiling in 2024 confirming their efficacy in disrupting MYC-driven proliferation across multiple cancer cell lines.[49][50] Pharmacologic WDR5 degradation via PROTACs in 2025 further suppressed synovial sarcoma growth in vitro and in vivo by targeting its interaction with SS18-SSX fusion proteins.[51] Emerging links to endocrine dysregulation involve WD40-mediated hormone signaling, though therapeutic modulation remains preclinical.Research tools leveraging WD40 repeats have advanced functional genomics and structural studies. CRISPR-Cas9 knockout screens have identified essential WD40 proteins in viral replication and cellular pathways; for instance, a 2022 screen pinpointed WDR81 as a host factor for reovirus infection, enabling dissection of WD40 roles in endosomal trafficking.[52] Cryo-electron microscopy (cryo-EM) has provided high-resolution insights into WD40-containing complexes, such as the 2024 structure of the Spo11 core complex bound to DNA, revealing how WD40 repeats from Ski8 organize meiotic double-strand break machinery.[53] These tools facilitate targeted perturbations and visualization of dynamic assemblies.Challenges in WD40-targeted therapies include achieving specificity amid multi-WD40 architectures and off-target effects in the expansive WDR family. Pan-WD40ome analyses, such as a 2025 study across maize inbred lines, underscore structural diversity and conservation, informing human efforts but highlighting the need for isoform-selective ligands.[30] Future directions emphasize comprehensive WD40ome mapping and integrative approaches combining AI-driven ligand design with high-throughput screens to overcome these hurdles.[44]