Fact-checked by Grok 2 weeks ago

Structural motif

A structural motif is a conserved three-dimensional arrangement of secondary structural elements in biomolecules, such as proteins and nucleic acids, that recurs across diverse sequences without direct evolutionary relatedness and often serves as a modular building block for larger functional architectures. In proteins, structural motifs typically consist of one or more secondary structure elements—like α-helices, β-strands, or loops—connected in specific geometries, forming supersecondary or tertiary patterns that can influence folding, stability, and interactions. Examples include the motif, common in DNA-binding proteins for sequence-specific recognition, and the beta-alpha-beta unit, a recurrent motif in nucleotide-binding enzymes like dehydrogenases. These motifs are smaller than domains but larger than individual secondary structures, enabling the prediction of protein function from atomic models in databases like the . In nucleic acids, particularly , structural motifs are recurrent folds involving base-pairing, stacking, and backbone conformations that stabilize secondary and tertiary structures, independent of primary sequence. Key examples encompass tetraloops (such as GNRA or UNCG types), which cap helical stems and mediate long-range interactions, and the kink-turn motif, which introduces sharp bends essential for assembly. These motifs underpin RNA's modular architecture, facilitating diverse roles in , , and molecular . Overall, structural motifs highlight the principle of structural in , where similar folds arise from unrelated sequences to fulfill analogous functions, aiding in bioinformatics tools for motif detection, , and understanding biomolecular .

Overview and Fundamentals

Definition and Characteristics

A structural refers to a recurring three-dimensional () arrangement of atoms, residues, or structural elements within biomolecules, such as proteins or nucleic acids, that appears across evolutionarily unrelated molecules despite lacking . These motifs are typically stabilized by non-covalent interactions, including hydrogen bonds, hydrophobic effects, and van der Waals forces, which enable the folding of disparate primary sequences into similar spatial configurations. Unlike larger structural domains, motifs often represent smaller, modular units that serve as fundamental building blocks in biomolecular architecture, such as compact features like beta turns or more extended patterns like beta hairpins. Key characteristics of structural motifs include their structural independence from primary sequence, allowing for where similar 3D folds emerge in proteins or RNAs with minimal shared ancestry. They play crucial roles in guiding biomolecular folding pathways and enabling functional properties, such as sites or catalytic regions, by providing stable points for higher-order . For instance, motifs facilitate the transition from secondary structural elements (e.g., helices or sheets) to folds, often through supersecondary structures that combine multiple motifs into cooperative units. Structural motifs are distinct from sequence motifs, which are defined by specific patterns in the linear primary structure, such as conserved or sequences that may predict function but do not necessarily dictate 3D geometry. In contrast, structural motifs emphasize and can occur in molecules with divergent sequences, highlighting their reliance on physicochemical principles rather than genetic inheritance. Archetypal examples include the in proteins, a simple anti-parallel beta-sheet loop stabilized by hydrogen bonding, and the stem-loop in , a double-stranded capped by a single-stranded loop that aids in tertiary interactions.

Historical Development

The concept of structural motifs in proteins emerged in the through advancements in , which allowed researchers to visualize and classify recurring three-dimensional arrangements of secondary structures. Jane S. Richardson's seminal 1977 analysis of β-sheet topologies in known protein structures identified patterns such as parallel and antiparallel sheets, establishing these as fundamental motifs that reflect evolutionary relatedness among proteins. Her work, based on the limited but growing set of atomic-resolution structures available at the time, shifted focus from linear sequences to spatial architectures, laying the groundwork for motif-based classification in . In the 1980s, the recognition of structural motifs expanded to nucleic acids, particularly through crystallographic studies of (tRNA), where stem-loop hairpins were identified as conserved elements critical for folding and function. Additional tRNA structures solved during this decade, building on earlier low-resolution models, highlighted these motifs as modular units in RNA architecture. Concurrently, the (PDB), established in 1971, experienced exponential growth, reaching over 10,000 entries by 1999, which facilitated systematic motif detection across diverse biomolecules. Influential contributions included Alexander Rich's elucidation of alternative DNA conformations, such as left-handed in synthetic , which demonstrated motifs' role in DNA dynamics and regulation. Ada Yonath's pioneering crystallization of ribosomal subunits in 1980, culminating in high-resolution structures by the early 2000s, revealed intricate RNA and protein motifs within the , underscoring their functional interdependence. Post-2000 developments integrated (NMR) spectroscopy and (cryo-EM) to resolve motifs in larger, dynamic complexes previously inaccessible to methods. These techniques enabled the mapping of interactions and quaternary assemblies, enriching motif catalogs in the PDB. The understanding of motifs evolved from sequence-driven to structure-centric paradigms, revealing conserved folds amid sequence divergence and informing evolutionary models. The 2021 release of profoundly accelerated motif prediction by achieving near-experimental accuracy for protein structures, democratizing access via the AlphaFold Database and enabling motif annotation at scale. Between 2023 and 2025, AI advancements, including 3's multimodal predictions and generative models for , further propelled AI-driven motif databases, enhancing discovery of novel motifs in non-canonical biomolecules.

Structural Motifs in Nucleic Acids

Motifs in RNA Structures

RNA structural motifs are recurring three-dimensional folds that contribute to the functional versatility of RNA molecules, often forming through base pairing and tertiary interactions despite sequence variability. These motifs enable RNA to adopt compact architectures for roles in gene regulation, , and molecular recognition. In RNA, motifs like stem-loops and pseudoknots build secondary and structures, while elements such as kink-turns and A-minor motifs introduce bends and stabilize helices, facilitating dynamic conformational changes. The -loop, also known as the motif, is the most common secondary structure in , consisting of a double-stranded formed by Watson-Crick base pairing between complementary sequences and a single-stranded loop at the apex. This motif arises through intramolecular base pairing, typically involving A-U and G-C pairs in an A-form helix, with the loop size ranging from 3 to 8 nucleotides to minimize steric strain. Stem-loops are prevalent in microRNAs (miRNAs), where the characteristic structure is processed by to generate mature miRNAs that regulate via mRNA targeting. In , such as the from satellite tobacco ringspot virus, stem-loops form part of the catalytic core, positioning substrates for phosphodiester bond cleavage through reversible ligation. Pseudoknots represent a more complex involving stem-loops that create non-nested topologies, where a single-stranded pairs with a loop from another stem to form a second . This structure is stabilized by bonds, such as base triples (e.g., C+·(C-G)) and loop- interactions, which enhance thermal stability and compactness. Pseudoknots play a critical role in programmed ribosomal frameshifting, particularly -1 frameshifting in viruses, by acting as a mechanical barrier that pauses the over slippery sequences, allowing alternative ; for instance, the pseudoknot in (PLRV) achieves 15-20% frameshift efficiency to produce fusion proteins essential for replication. Beyond frameshifting, pseudoknots contribute to activity in the hepatitis delta virus (HDV) , where a nested forms the catalytic core for self-cleavage. Other notable RNA motifs include kink-turns and A-minor interactions, which modulate helical geometry and packing. The kink-turn (K-turn) is a recurrent motif comprising two helices connected by a three-nucleotide bulge and tandem sheared G·A base pairs, inducing a sharp ~120° bend in the RNA axis by juxtaposing minor grooves and widening the major groove. Structural parameters include C1′–C1′ distances of ~10.2 Å for canonical N1-class pairs and ~8.9 Å for tighter N3-class pairs, with the motif requiring Mg²⁺ ions or protein binding (e.g., L7Ae) for full folding. K-turns occur in ribosomal RNAs (e.g., Kt-7 in 23S rRNA) and spliceosomal U4 snRNA, facilitating compact folding. The A-minor motif involves the insertion of adenine's minor-groove edges from loops into the minor groove of adjacent A-form helices, forming hydrogen bonds with ribose 2′-OH groups of receptor base pairs, predominantly G-C steps. Classified into types I (multiple H-bonds) and II (single/double H-bonds), it stabilizes coaxial stacking of helices, as seen in 16S/23S rRNAs where 65% correlate with helix junctions, without altering groove widths but enhancing interhelical packing. These motifs underpin RNA's functional diversity, enabling and while exhibiting evolutionary . In self-splicing introns, motifs like pseudoknots and kink-turns in group introns (e.g., D6 branch ) position the bulged nucleophile for lariat formation, catalyzing excision via a two-metal-ion conserved from bacterial introns to eukaryotic spliceosomes. Regulatory roles include miRNA-mediated silencing via stem-loops and pseudoknot-driven frameshifting in viral genomes, allowing adaptive . Despite divergence, these motifs are evolutionarily conserved, as evidenced by shared catalytic cores (e.g., J5/6 linker in group introns) across distant , underscoring RNA's ancient role in splicing and highlighting structural primacy over in .

Motifs in DNA Structures

Structural motifs in DNA primarily manifest as deviations from the canonical right-handed B-form double helix, adopting stable three-dimensional configurations that influence genomic function. These non-B-form structures, such as cruciforms, G-quadruplexes, D-loops, and H-DNA, arise from specific sequence elements like inverted repeats or G-rich tracts and are often stabilized by negative supercoiling or environmental factors. Unlike the transient folds in RNA, DNA motifs tend to be more static and supercoil-dependent, playing roles in static genomic architecture rather than catalysis. Cruciform structures form X-shaped conformations from inverted repeat sequences in double-stranded DNA, where opposing strands each extrude an intra-strand hairpin, creating two paired arms. Extrusion occurs via two mechanisms in negatively supercoiled DNA: a concerted C-type pathway involving simultaneous arm formation or a stepwise S-type pathway with sequential hairpin development. Folded cruciforms feature base stacking within the hairpin stems for stability, while unfolded variants exhibit sharp bends at the four-way junction. Negative supercoiling significantly stabilizes these structures by relieving torsional stress. Cruciforms serve as models for Holliday junctions, facilitating homologous recombination by promoting strand exchange. G-quadruplexes (G4s) emerge from G-rich sequences, where two or more planar G-tetrads—each comprising four guanines linked by Hoogsteen hydrogen bonds—stack to form quadruplex helices stabilized by monovalent cations like potassium. These structures adopt diverse topologies, including parallel or antiparallel strands, and are prevalent in telomeres, where they cap chromosome ends, and in gene promoters, modulating oncogene expression. As of 2025, investigations highlight G4s as anticancer targets, with ligands disrupting their formation to inhibit tumor cell proliferation by inducing DNA damage and altering transcription, including recent progress on novel G4-stabilizing agents and their clinical potential. Other notable motifs include D-loops, H-DNA, and , which involve triple-helical arrangements or helical inversions. D-loops, or displacement loops, occur when a single-stranded DNA invades a duplex, displacing one strand to form a three-stranded intermediate, often during . H-DNA represents an intramolecular in polypurine-polypyrimidine tracts with interruptions, where one strand folds back to bind via Hoogsteen pairing, yielding H-y ( motif) or H-p ( motif) conformations and leaving a single-stranded region. is a left-handed double-helical structure that forms preferentially in sequences with alternating s and s, such as repeats, under conditions of negative supercoiling or high salt concentrations; it features a zigzag backbone with alternating syn-anti conformations and dinucleotide repeat units. motifs are implicated in , , and immune responses, with anti-Z-DNA antibodies associated with autoimmune diseases like systemic . Structural metrics, such as propeller twist angles, distinguish these motifs; for instance, enhanced negative propeller twists in regions reduce base-pair planarity and enhance groove accessibility compared to B-DNA's typical -5° to -15° range. These motifs exert profound biological influence, often impeding replication fork progression to cause pauses that heighten rates and genomic instability. In transcription, structures like H-DNA block , thereby repressing at specific loci. Evolutionarily, non-B DNA motifs drive genome diversification by promoting recombination hotspots and regulatory innovations, underscoring their significance in gene regulation across species.

Structural Motifs in Proteins

Secondary Structure Motifs

Secondary structure motifs in proteins are local, recurring patterns in the polypeptide backbone stabilized primarily by hydrogen bonds between backbone atoms, serving as fundamental building blocks for higher-order folds. These motifs include , , and turns, which collectively enable the protein chain to adopt compact, functional conformations despite sequence variability. The is a right-handed coiled structure in which the polypeptide backbone forms a cylinder with approximately 3.6 residues per turn and a pitch of 5.4 . It is stabilized by intramolecular hydrogen bonds between the carbonyl oxygen of residue i and the hydrogen of residue i+4, creating a regular pattern along the helix axis. The characteristic backbone angles are φ ≈ -60° and ψ ≈ -45°, positioning these residues in a favored region of the . In membrane proteins, alpha helices often exhibit amphipathic character, with hydrophobic residues facing the and hydrophilic ones oriented toward the aqueous environment or protein interior. Beta sheets consist of two or more strands—extended polypeptide segments with φ ≈ -120° and ψ ≈ +120°—aligned either or antiparallel to form a pleated surface stabilized by interstrand bonds between carbonyl oxygens and hydrogens. Antiparallel sheets feature more linear hydrogen bonding patterns, while sheets have slightly offset bonds, contributing to overall stability. All observed beta sheets display a right-handed , typically around 10–20° per residue, arising from the intrinsic of L-amino acids and optimizing side-chain packing. A common within beta sheets is the , formed by two adjacent antiparallel strands connected by a short turn of 2–5 residues, allowing the chain to reverse direction compactly. Turns and loops connect secondary structure elements, with beta turns being the most prevalent type for reversing chain direction over four consecutive residues. Classified into types I, II, II', and I' based on angles and ing patterns (e.g., type I features a between residues i and i+3 with specific φ/ψ values for positions i+1 and i+2), these turns accommodate diverse side chains while maintaining backbone flexibility. Type IV turns, lacking a standard i to i+3 , provide additional variability. Loops, often unstructured, link distant elements but contribute to functional sites. In globular proteins, secondary structure motifs account for approximately 50% of residues, with the remainder in loops or irregular regions; this prevalence is assessed through Ramachandran plots, which map allowed φ/ψ angles and highlight secondary structure clusters./01:_Unit_I-_Structure_and_Catalysis/04:_The_Three-Dimensional_Structure_of_Proteins/4.02:_Secondary_Structure_and_Loops)

Tertiary and Quaternary Motifs

structural motifs in proteins represent compact, three-dimensional folds that integrate multiple secondary structural elements to form functional units, often enabling specific interactions such as binding or enzymatic activity. These motifs extend beyond local secondary structures by incorporating spatial arrangements that stabilize higher-order architectures, frequently involving hydrophobic cores, hydrogen bonding networks, and sometimes metal ion coordination. In contrast to isolated alpha helices or beta sheets, tertiary motifs like the and exemplify how proteins achieve precise recognition of macromolecules such as DNA. The (HTH) consists of two connected by a short turn of three to four , forming a recognition that inserts into the major groove of for sequence-specific binding. This is stabilized by hydrophobic interactions between the helices and often operates as part of a three- bundle, where the second helix makes direct contacts with DNA bases via side-chain atoms. A classic example is the lambda repressor protein from bacteriophage lambda, where the HTH binds to sequences (O_L1), regulating viral through lock-and-key complementarity and induced fit mechanisms. Zinc finger motifs, particularly the C2H2 subtype, feature a compact beta-beta-alpha fold coordinated by a zinc ion bound to two cysteine and two histidine residues, creating a stable domain approximately 25-30 amino acids long. The zinc stabilizes a short beta hairpin followed by an alpha helix, with the helix presenting key residues (often at positions -1, 2, 3, and 6 relative to the helix) that contact three to four DNA base pairs in the major groove. This modular architecture allows tandem arrays of C2H2 fingers to recognize extended DNA sequences (20-40 base pairs), facilitating RNA binding in some cases, as seen in transcription factors like CTCF that regulate chromatin boundaries. Among other tertiary motifs, the Greek key represents a topological of four antiparallel strands connected by a characteristic crossover , forming of many beta-barrels and beta-sandwiches. This motif, often left-handed in beta-barrels with five or six strands, enforces specific folding constraints and is prevalent in half of analyzed beta-barrel structures, contributing to protein stability through edge-to-edge strand packing. The omega , a nonregular secondary structure, appears as a surface-exposed segment of 6-16 residues lacking regular dihedral angles but rich in bonds, adopting a shape that remains flexible yet functionally critical. These loops often act as lids over active sites, enabling substrate access in enzymes and molecular in binding proteins. Nest and niche motifs provide small, precise sites for binding within folds, utilizing main-chain carbonyl or groups from residue triads or tetrads. A nest, typically a three-residue , binds anions like phosphates via bridging NH atoms from positions i and i+2, commonly found in Schellman loops and iron-sulfur clusters. In contrast, niches—such as the three-residue niche3 (with alpha-beta conformation at i+2 and i+3) or four-residue niche4 (alpha-alpha-beta at i+2 to i+4)—accommodate cations like +, +, or Ca2+ through carbonyl oxygens, occurring in about 7% of soluble protein residues and often at beta-turns or alpha-helix C-termini to support in enzymes like the +/+-. Quaternary motifs extend tertiary elements across subunit interfaces, promoting multimer assembly and functional regulation. The , a hallmark of basic region-leucine zipper (bZIP) transcription factors, forms a coiled-coil dimer via alpha helices with residues at the d positions of a heptad repeat, stabilized by hydrophobic knobs-into-holes packing and electrostatic interactions at e and g positions. This motif drives reversible homodimerization or heterodimerization (e.g., Fos-Jun pairs), enabling over 1,400 potential dimers among human bZIPs and facilitating DNA binding through allosteric coupling that induces basic region helix formation. In assemblies like GCN4, the zipper supports short-lived dimers (<1 second lifetime) essential for transcriptional regulation and higher-order complex formation.

Motifs in Other Biomolecules

In Carbohydrates

In carbohydrates, structural motifs primarily arise from the arrangement of monosaccharide units linked by glycosidic bonds, which dictate the overall three-dimensional conformation of polysaccharides and glycoconjugates. The α-1,4-glycosidic linkage, as seen in amylose, a component of , promotes a left-handed helical structure with approximately six glucose units per turn, facilitating compact packing and enzymatic accessibility for energy storage. In contrast, the β-1,4-glycosidic linkage in cellulose results in extended, linear chains that form rigid, hydrogen-bonded microfibrils, providing tensile strength in plant cell walls. These linkage-specific motifs influence solubility, digestibility, and biological roles, with α-linkages generally yielding more flexible, soluble polymers compared to the insoluble β-forms. Branching motifs further diversify carbohydrate structures, particularly in complex glycans attached to proteins. Glycoclusters, multivalent arrays of glycan units, and antennae—the branched extensions in N-linked glycoproteins—create spatial arrangements that enhance avidity in molecular recognition. At the monosaccharide level, the predominant chair conformation (⁴C₁ for D-glucopyranose) minimizes steric hindrance, positioning hydroxyl groups equatorially for optimal hydrogen bonding and linkage formation. These motifs, often biantennary or triantennary in glycoproteins, contribute to conformational flexibility and surface display on cells. Specific examples illustrate how modifications enhance functional motifs. In heparin, a sulfated glycosaminoglycan, distinct sulfation patterns—such as 3-O-sulfation at glucosamine residues—form binding motifs that selectively interact with proteins like antithrombin III, regulating coagulation. Similarly, glycan folds recognized by lectins, such as the β-sandwich domains in galectins binding β-galactoside motifs, mediate cell adhesion and signaling through precise stereochemical complementarity. Structural analysis of these motifs relies on advanced techniques to resolve torsion angles and dynamics. Nuclear magnetic resonance (NMR) spectroscopy determines glycosidic torsion angles φ (H1'-C1'-O-Cn) and ψ (C1'-O-Cn-Hn), revealing preferred conformations like the ⁴C₁ chair with exoanomeric effects stabilizing linkages. These motifs play critical roles in recognition, as in blood group antigens where ABO-specific terminal glycans (e.g., α-GalNAc for A antigen) serve as motifs for antibody binding and transfusion compatibility. Post-2023 cryo-electron microscopy (cryo-EM) studies have provided insights into glycan shields on viruses, such as the densely branched N-glycans on SARS-CoV-2 spike protein forming a protective lattice that modulates immune evasion and antibody access.

In Lipids and Membranes

In lipid bilayers, a fundamental structural motif is the lamellar phase, where amphiphilic lipids self-assemble into stacked sheets with hydrophobic tails oriented inward and hydrophilic headgroups facing aqueous environments on both sides. This arrangement minimizes exposure of nonpolar regions to water, forming a bilayer thickness typically around 4-5 nm for common phospholipids like . The stability of this motif arises from van der Waals interactions between tails and electrostatic repulsion between headgroups, as detailed in structural studies of hydrated bilayers. Within these lamellae, specialized domains such as emerge as cholesterol- and sphingolipid-enriched motifs, creating liquid-ordered phases that differ from the surrounding liquid-disordered regions due to tighter packing and reduced fluidity. These rafts, often 10-200 nm in size, facilitate lateral segregation and are implicated in membrane organization. Lipid shapes dictate alternative motifs beyond bilayers, governed by the packing parameter P = \frac{V}{a \cdot l}, where V is the hydrophobic tail volume, a the headgroup area, and l the tail length. Cylindrical s with P \approx 1 favor lamellar bilayers or micelles, while conical shapes with P > 1 promote inverted hexagonal phases (H_{II}), featuring lipid cylinders with heads outward and water channels inside. These non-lamellar motifs are critical for fusion events, as the negative curvature of H_{II} phases lowers the energy barrier for stalk formation between fusing bilayers. Inverted micelles (P > 1) form in apolar environments, with headgroups facing inward to enclose an aqueous core and tails interacting with the nonpolar solvent. Specific lipid motifs include clusters in mitochondrial membranes, where this diphosphatidylglycerol forms oligomeric assemblies that stabilize high-curvature regions like cristae, comprising up to 20% of inner membrane lipids. Lipid rafts often incorporate (GPI)-anchored lipids, enhancing domain rigidity through saturated acyl chains that align with . Recent cryo-EM studies as of 2025 have resolved lipid motifs around membrane-embedded proteins, revealing clusters binding via specific pockets in pores and scaffolds, with densities indicating dynamic interactions at 3-4 Å resolution. Functionally, these motifs support signaling, as seen with (PIP2) headgroups clustering to recruit effectors in plasma membranes, and contribute to curvature by lipids like promoting hexagonal transitions that aid fusion without protein involvement. motifs similarly influence mitochondrial dynamics through curvature stabilization.

Identification and Applications

Computational Methods

Computational methods for detecting and predicting structural motifs in biomolecules rely on algorithms that analyze three-dimensional structures, often drawing from large databases of experimentally determined conformations. These approaches address the challenge of identifying recurring patterns across diverse sequences, enabling the of motifs in proteins, nucleic acids, and other biomolecules. Template-based and fragment-based detection methods form the core of motif identification, while predictive tools leverage to infer motifs . Template-based search methods, such as the server, facilitate the detection of structural motifs by performing 3D alignments between a query structure and entries in the (PDB). The server compares protein structures using distance-matrix alignment, identifying similarities that reveal conserved motifs even among distantly related sequences. This tool is particularly effective for unifying protein families and detecting motifs like helices or beta-sheets through global or local superimpositions. Fragment-based approaches complement this by focusing on partial matches, allowing detection of motifs within larger, non-homologous structures. For instance, DeepFold employs deep convolutional neural networks to learn low-dimensional representations of structural fragments, enabling efficient retrieval and classification of motifs such as zinc fingers or leucine zippers from protein databases. Prediction of structural motifs has advanced significantly with models that generate atomic-level structures from sequences. AlphaFold3, released in 2024, predicts multi-molecule complexes including proteins, nucleic acids, and ligands, thereby inferring motifs like DNA-binding domains or hairpins through end-to-end diffusion-based modeling. For secondary structures, RNAfold uses dynamic programming to compute minimum free-energy folds, identifying common motifs such as stem-loops or pseudoknots based on thermodynamic parameters. In glycans, post-2023 predictors like CandyCrunch apply to data, reconstructing trees and motifs with up to 90% accuracy in top-ranked predictions, addressing the branching complexity unique to carbohydrates. Key databases underpin these methods by providing curated repositories of motifs. For proteins, and CATH classify domains hierarchically into classes, architectures, topologies, and superfamilies, facilitating motif searches based on evolutionary relationships. RNA STRAND compiles validated secondary structures from diverse RNA types, enabling statistical analysis of motifs across organisms. The PDB serves as a central archive for all biomolecular structures, where motif similarity is quantified using (RMSD), with thresholds below 2 indicating high structural conservation. Challenges in computational motif detection include accommodating structural flexibility, particularly in loops and disordered regions that deviate from rigid templates. Machine learning advances, such as 2025 diffusion models for membrane proteins, are addressing this by generating ensemble predictions that capture lipid-induced conformations and motifs in dynamic environments like bilayers.

Biological and Therapeutic Importance

Structural motifs play crucial roles in biological specificity, particularly in protein-DNA interactions. Zinc finger motifs, for instance, are essential for transcription factors that bind specific DNA sequences to regulate gene expression, enabling precise control over cellular processes such as development and response to environmental signals. These motifs exemplify how recurrent structural elements confer functional modularity, allowing proteins to interact with diverse targets while maintaining stability. Evolutionary modularity is a key feature of structural motifs across biomolecules, where they act as reusable building blocks that facilitate and functional . In protein , motifs enable the recombination of domains, promoting evolvability by allowing incremental changes without disrupting overall . This is evident in the conservation of motifs like beta-sheets and helices, which support diverse functions from to signaling, underscoring their role in the diversification of biomolecular systems. However, disruptions in motif folding can lead to diseases; in prion disorders, misfolded alpha-helical motifs convert to beta-sheet-rich conformations, propagating aggregates that cause neurodegeneration. Therapeutically, structural motifs serve as targets for drug design, enhancing selectivity and efficacy. G-quadruplex motifs in DNA and RNA oncogenes are stabilized by small molecules like CX-5461 (Pidnarulex), which inhibits RNA polymerase I and induces DNA damage in cancer cells, earning FDA Fast Track Designation for advanced solid tumors. In vaccine development, glycan motifs on the SARS-CoV-2 spike protein are manipulated to improve immunogenicity; glycan-masked receptor-binding domain vaccines elicit broader neutralizing antibodies by shielding non-neutralizing epitopes and exposing conserved sites. Protein engineering leverages motifs for de novo design, using tools like to create custom structures with predefined folds, such as novel binders or enzymes, by assembling secondary structure motifs into stable scaffolds. In gene editing, CRISPR-Cas9 relies on protospacer-adjacent motif (PAM) recognition for target specificity, where the NGG sequence adjacent to the guide RNA directs cleavage, enabling precise genomic modifications. Emerging applications in 2025 integrate with motif analysis for diagnostics, where models predict protein and structural motifs to identify disease-associated variants, accelerating discovery in and infectious diseases. In synthetic biology, motifs drive interdisciplinary innovations, such as designing artificial protein complexes that mimic natural assemblies for biosensors or therapeutic delivery systems.

References

  1. [1]
    Protein Structural Motifs in Prediction and Design - PMC
    Apr 28, 2017 · Referred to as a TERM (tertiary motif), this motif is defined as the union of the local backbone fragment around the central residue (e.g., ± 2 ...
  2. [2]
    [PDF] RNA structural motifs: building blocks of a modular biomolecule
    Jul 3, 2006 · A comprehensive definition of an RNA structural motif should be based on and consist of not only base-pairing or secondary structure ...
  3. [3]
    Structural motifs of biomolecules - PNAS
    Biomolecular structures are assemblies of emergent anisotropic building modules such as uniaxial helices or biaxial strands. We provide an approach to ...Missing: review paper
  4. [4]
    Real-time structural motif searching in proteins using an inverted ...
    Within proteins, structural motifs are characteristic arrangements of amino residues, which may or may not be near one another in the linear polypeptide chain.
  5. [5]
    Structural Motifs in RNA - Annual Reviews
    Jul 1, 1999 · An RNA motif is a discrete sequence or combination of base juxtapositions found in naturally occurring RNAs in unexpectedly high abundance.Missing: definition paper
  6. [6]
    RNA Structural Motifs - Zhang - Wiley Online Library
    Aug 15, 2011 · An RNA structural motif is defined as a collection of residues that fold into a stable three-dimensional (3D) structure and which can be found in naturally ...
  7. [7]
    Protein Structural Motifs - an overview | ScienceDirect Topics
    Protein structural motifs are defined as specific arrangements of peptide chains, such as α-helices, β-strands, and polyproline helices, that serve as building ...
  8. [8]
    β-Sheet topology and the relatedness of proteins - Nature
    Aug 11, 1977 · β-Sheet topology and the relatedness of proteins. Jane S. Richardson. Nature volume 268, pages 495–500 (1977)Cite ...
  9. [9]
    Enhancing our Understanding of Protein Structure: the Work of Jane ...
    Dec 12, 2005 · In addition to the accomplishments above, Jane is widely known for her creation of ribbon drawings to schematize protein 3D structures, first ...
  10. [10]
    An RNA-centric historical narrative around the Protein Data Bank
    In the 1980s, additional tRNA structures were solved as well as the first RNA ... A stem capped by a loop is called a hairpin. The numbering follows that ...
  11. [11]
    The Protein Data Bank - PMC - PubMed Central - NIH
    The breakdown of the types of structures in the PDB is shown in Table 2. As of September 14, 1999, the PDB contained 10 714 publicly accessible structures with ...Missing: motifs | Show results with:motifs
  12. [12]
    Cruciform structures are a common DNA feature important for ...
    Aug 5, 2011 · Cruciform structures are fundamentally important for a wide range of biological processes, including replication, regulation of gene expression, nucleosome ...Missing: Alexander | Show results with:Alexander
  13. [13]
    BenchMarks The Ribosome at Atomic Resolution - ScienceDirect.com
    Dec 11, 2009 · Ada Yonath reported the first crystals of the 50S ribosomal subunit in 1980, a crucial step that would require almost 20 years to bring to ...
  14. [14]
    The future of integrated structural biology - Cell Press
    Sep 17, 2024 · The developments of cryo-ET raise the possibility that single-particle cryo-EM, X-ray, and NMR structure determination become routine techniques ...
  15. [15]
    Evolving concepts of the protein universe - ScienceDirect.com
    Mar 21, 2025 · In this Perspective, we discuss ideas and developments that emerged over the past three decades regarding the protein structure-function paradigm.
  16. [16]
    AlphaFold two years on: Validation and impact - PNAS
    Here, we discuss some of the latest work based on AlphaFold, with a particular focus on its use within the structural biology community.
  17. [17]
    Generative artificial intelligence performs rudimentary structural ...
    Aug 21, 2024 · We show the current capacity of natural language generative AI to perform basic structural biology modeling and interaction analysis with atomic-scale accuracy.
  18. [18]
    The roles of structural dynamics in the cellular functions of RNAs
    RNAs such as microRNAs readily fold into stem-loop secondary structures, which can be recognized by protein, and other RNAs such as ribozymes and ...
  19. [19]
    Predicting RNA secondary structures from sequence and probing data
    Jul 1, 2016 · The resulting structures are composed of two fundamental building blocks: paired regions (mostly A-form helices), and unpaired loops.
  20. [20]
    Structures, Kinetics, Thermodynamics, and Biological Functions of ...
    Oct 15, 2007 · First, we review the structure of hairpins including diversity in the stem, loop, and closing base pair. The function of RNA hairpins in biology.Missing: paper | Show results with:paper
  21. [21]
    Structure and function of the hairpin ribozyme - ScienceDirect.com
    A minimal hairpin ribozyme contains four base-paired helices, H1 through H4, and two unpaired loops, A and B, with the reactive phosphodiester located within ...Missing: miRNA paper
  22. [22]
    Frameshifting RNA pseudoknots: Structure and mechanism - PMC
    The subject of this review is RNA pseudoknots and related RNA motifs that simulate −1 programmed ribosomal frameshifting (−1 PRF). An important goal of these ...Missing: interlocking paper
  23. [23]
    Pseudoknots: RNA Structures with Diverse Functions | PLOS Biology
    Jun 14, 2005 · A pseudoknot is an RNA structure that is minimally composed of two helical segments connected by single-stranded regions or loops.Missing: seminal | Show results with:seminal
  24. [24]
    The kink-turn in the structural biology of RNA
    Apr 12, 2018 · The k-turn is a common structural motif in many functional RNA species, that introduces a sharp bend into the axis of duplex RNA.
  25. [25]
    The kink-turn: a new RNA secondary structure motif - PMC - NIH
    This helix–internal loop–helix motif has an unusual conformation, which includes a kink in its phosphodiester backbone that bends the RNA helix axis by ∼120°.Missing: review | Show results with:review
  26. [26]
  27. [27]
    Structural insights into intron catalysis and dynamics during splicing
    Nov 22, 2023 · These findings shed light on the evolution of splicing and highlight the conservation of structural components, catalytic mechanism and ...
  28. [28]
    RNA structure in splicing: An evolutionary perspective - PMC - NIH
    RNA structure replaces the need for essential factors in splicing. In our recent report published in Genome Research,56 we identified evolutionarily conserved ...
  29. [29]
    Cruciform structures are a common DNA feature important for ...
    Aug 5, 2011 · DNA cruciforms play an important role in the regulation of natural processes involving DNA. These structures are formed by inverted repeats, and ...
  30. [30]
    Effects of Replication and Transcription on DNA Structure-Related ...
    Jan 5, 2017 · In this article, we discuss the interactions of non-B DNA with the replication and/or transcription machinery, particularly in disease states (eg, tumors)
  31. [31]
    Interaction of Proteins with Inverted Repeats and Cruciform ...
    Cruciforms occur when inverted repeat sequences in double-stranded DNA adopt intra-strand hairpins on opposing strands. Biophysical and molecular studies of ...
  32. [32]
    The mechanism of cruciform formation in supercoiled DNA
    There are two alternative pathways by which inverted repeat sequences in supercoiled DNA molecules may extrude cruciform structures, called C-type and S-type.
  33. [33]
    Interarm Interaction of DNA Cruciform Forming at a Short Inverted ...
    Inverted repeat sequences are known to form cruciform structures in negatively supercoiled DNA (for review, Wells, 1988; Sinden, 1994). They are widespread ...
  34. [34]
    Dynamics of cruciform extrusion in supercoiled DNA - NIH
    An inverted repeat has been created in a plasmid by ligation of two 13 nucleotide synthetic oligonucleotides into the cloning vector pAT153.
  35. [35]
    Structure and dynamics of supercoil-stabilized DNA cruciforms
    Jul 3, 1998 · In addition, cruciforms provide a model system for structural studies of Holliday junctions, intermediates in homologous DNA recombination.
  36. [36]
    DNA G-Quadruplexes as Targets for Natural Product Drug Discovery
    DNA guanine (G)-quadruplexes (G4s) are unique secondary structures formed by two or more stacked G-tetrads in G-rich DNA sequences.
  37. [37]
    G-Quadruplexes in Tumor Immune Regulation - NIH
    Apr 27, 2025 · G-tetrad is formed by Hoogsteen hydrogen bonding. Stability ... Targeting G-quadruplexes in gene promoters: A novel anticancer strategy?
  38. [38]
    A Phenotypic Approach to the Discovery of Potent G-Quadruplex ...
    Aug 1, 2024 · G-quadruplexes are higher-order DNA (and RNA) structures formed by the association of successive short G-tracts [17,18,19,20,21] and have ...<|separator|>
  39. [39]
    Dynamic Processing of Displacement Loops during ...
    Displacement loops (D-loops) are pivotal intermediates of homologous recombination (HR), a universal DNA double strand break (DSB) repair pathway.Missing: triple | Show results with:triple
  40. [40]
    Triplex H-DNA structure: the long and winding road from the ...
    An early, crucial indication of H-DNA's biological relevance is the fact that H-DNA interacts differently with cellular machinery compared to B-DNA.
  41. [41]
    DNA Triple Helices: biological consequences and therapeutic ... - NIH
    For example, H-DNA, or intramolecular triplex DNA is a structure in which half of the pyrimidine tract swivels its backbone parallel to the purine strand in ...
  42. [42]
    Delineation of the DNA Structural Features of Eukaryotic Core ... - NIH
    Feb 9, 2022 · Meanwhile, more negative propeller twist angles weaken hydrogen bonding patterns between base pairs. Furthermore, the reduction in helical ...
  43. [43]
    Transcription blockage by stable H-DNA analogs in vitro - PMC
    Jun 22, 2015 · DNA sequences that can form unusual secondary structures are implicated in regulating gene expression and causing genomic instability.
  44. [44]
    Non-canonical DNA structures are drivers of genome evolution - PMC
    Non-B DNA motifs affect mutation rate and facilitate genome instability. Non-B DNA structures can pose obstacles for replicative polymerase progression during ...
  45. [45]
    The structure of proteins: Two hydrogen-bonded helical ... - PNAS
    Two hydrogen-bonded helical structures for a polypeptide chain have been found in which the residues are stereochemically equivalent.
  46. [46]
    FOLDING AMPHIPATHIC HELICES INTO MEMBRANES - NIH
    High amphiphilicity is a hallmark of interfacial helices in membrane proteins and membrane-active peptides, such as toxins and antimicrobial peptides.<|control11|><|separator|>
  47. [47]
    Structure of β-sheets: Origin of the right-handed twist and of the ...
    β-Sheets in observed protein structures sometimes have a larger twist than those in the structures computed here. This may be due to irregularities in amino ...
  48. [48]
    A systematic analysis of the beta hairpin motif in the Protein Data Bank
    Jan 2, 2021 · The beta hairpin motif is a ubiquitous protein structural motif that can be found in molecules across the tree of life.<|control11|><|separator|>
  49. [49]
    Helix-Turn-Helix - an overview | ScienceDirect Topics
    The helix-turn-helix is defined as a simple DNA-binding motif consisting of two α helices connected by a short extended chain of amino acids, ...
  50. [50]
  51. [51]
    An Altered Specificity Mutation in the Lambda Repressor Induces ...
    Apr 8, 1994 · The lambda repressor exhibits structural characteristics of lock and key complementary through the helix-turn-helix motif, and of induced ...
  52. [52]
    C2H2 Zinc Finger Proteins: The Largest but Poorly Explored Family ...
    Each of the three C2H2 domains binds to three or four DNA nucleotides via amino acids at the same α-helical positions (Fig. 1B): arginine at position –1, as ...
  53. [53]
  54. [54]
    A comprehensive analysis of the Greek key motifs in protein beta ...
    The Greek key motifs are the topological signature of many beta-barrels and a majority of beta-sandwich structures.
  55. [55]
    Omega loops: nonregular secondary structures significant in protein ...
    It has become clear that omega-loops are often involved in protein function and molecular recognition. One motif, an omega-loop lid, that is flexible and ...
  56. [56]
    A Novel Main Chain Motif in Proteins Bridged by Cationic Groups
    The discovery2 of the nest, a common three residue motif in proteins in which anionic or δ– atoms or groups bridge pairs of main chain NH atoms, leads to the ...
  57. [57]
  58. [58]
    The Case of Basic Region Leucine Zipper Transcriptional Regulators
    This review discusses how conformational flexibility “built” into the amino acid sequence allows bZIP proteins to interact with a large number of diverse ...
  59. [59]
  60. [60]
    Amylose - an overview | ScienceDirect Topics
    Amylose typically forms a left-handed α-helical structure, in which six anhydroglucose units make up one turn of the helix. The molecular weight (Mw) of ...
  61. [61]
    9.3: Polysaccharides - Biology LibreTexts
    Jan 17, 2025 · Cellulose, a structural homopolymer of glucose in plants, has β 1,4 main chain links without branching. Multiple chains are held together by ...
  62. [62]
    The Conformation of Glycosidic Linkages According to Various ...
    Jul 10, 2024 · The MD simulations concerned the homooctamers of d-glucopyranose residues linked by the three types of glycosidic linkages: α(1 → 4), β(1 → 3), ...
  63. [63]
    Mining High-Complexity Motifs in Glycans: A New Language To ...
    With the text-based definitions of motifs, we could search for the presence of motifs in a set of glycans and quantify the association of lectin binding with ...Missing: glycoclusters | Show results with:glycoclusters
  64. [64]
    Monosaccharide Diversity - Essentials of Glycobiology - NCBI - NIH
    The preferred conformation of a pyranose ring is the “chair” conformation, similar to the structure of cyclohexane. The conversion from Haworth projection to ...MONOSACCHARIDES: BASIC... · MONOSACCHARIDES EXIST...
  65. [65]
    Complex N-Glycan Number and Degree of Branching Cooperate to ...
    Apr 6, 2007 · The number of N-glycans (n) is a distinct feature of each glycoprotein sequence and cooperates with the physical properties of the Golgi N-glycan-branching ...N-Glycan Branching Is... · Hexosamine/n-Glycan... · Supplemental DataMissing: glycoclusters | Show results with:glycoclusters
  66. [66]
    Dissecting structure-function of 3-O-sulfated heparin and ... - Science
    Dec 22, 2021 · To date, over 400 heparin/HS-binding proteins have been identified (53), yet only a handful of 3-O-sulfated structures have been identified or ...
  67. [67]
    C-type Lectins - Essentials of Glycobiology - NCBI - NIH
    C-type lectins are CA++-dependent glycan-binding proteins that share primary and secondary structural homology in their carbohydrate-recognition domains ...
  68. [68]
    Conformational Preferences at the Glycosidic Linkage of ...
    Jan 5, 2024 · The glycosidic torsion angles are defined as follows: φ =H1′-C1′-On-Cn and ψ=C1′-On-Cn-Hn where n denotes the linkage position; torsion angles ...
  69. [69]
    Structures Common to Different Glycans - Essentials of Glycobiology
    The A, B, and H blood group antigens are glycans presented on Type-1 or Type-2 structures (Figure 14.2), on O-GalNAc glycans (Type-3), or on glycolipids (Type-4) ...REGULATED... · THE A, B, AND H HUMAN... · LEWIS BLOOD GROUPS
  70. [70]
    Spike N354 glycosylation augments SARS-CoV-2 fitness for human ...
    Glycan shield density analysis reveals a strong correlation that viruses historically classified as 'evasion strong' [9] had significantly elevated glycan ...
  71. [71]
    Structure of lipid bilayers - PMC - PubMed Central
    The uncertainty in structural results for lipid bilayers is being reduced and best current values are provided for bilayers of five lipids.
  72. [72]
    Sphingolipids and lipid rafts: Novel concepts and methods of analysis
    In the original “lipid raft” model formalized by Simons in 1997, lipids such as cholesterol and sphingomyelin form a liquid ordered phase (Lo) in the membrane, ...
  73. [73]
    Complementary molecular shapes and additivity of the packing ...
    A dimensionless packing parameter, S, given by S = V/al, where V is the hydrocarbon volume, a is the area of the head group, and l is the critical length of the ...
  74. [74]
    Lipid Polymorphisms and Membrane Shape - PMC - NIH
    In this article, we will discuss the importance of lipid polymorphisms in the shaping of membranes and its role in controlling cellular membrane morphology.
  75. [75]
    The role of cardiolipin in the structural organization of mitochondrial ...
    The underlying principle is that cardiolipin clusters of a certain size, but not individual cardiolipin molecules, stabilize the geometry of curved regions of ...Missing: motifs | Show results with:motifs
  76. [76]
    Cryo-EM structures of a protein pore reveal a cluster of cholesterol ...
    Mar 26, 2025 · We show that the octameric pore interacts with 112 lipids in the upper leaflet of the membrane, reveal the roles of lipids, and demonstrate that the ...
  77. [77]
    PI(4,5)P2: signaling the plasma membrane - PMC - PubMed Central
    Nov 11, 2022 · PI(4,5)P2 is central to plasma membrane function, acting as a cofactor for signaling, transport, and structure, and is a substrate for second ...<|control11|><|separator|>
  78. [78]
    Zinc finger proteins: insights into the transcriptional and post ...
    Jul 24, 2021 · Zinc finger proteins are basically thought of as transcription factors meant primarily to bind DNA. However, some of the zinc finger proteins ...
  79. [79]
    Structures and biological functions of zinc finger proteins and their ...
    Jan 9, 2022 · ZFPs primarily function as transcription factors in tumorigenesis and tumor progression. Transcription factors (TFs) are proteins that play a ...
  80. [80]
    Quantifying Modularity in the Evolution of Biomolecular Systems - PMC
    Functional modules are considered the primary building blocks of biomolecular systems. Here we study to what extent functional modules behave cohesively ...
  81. [81]
    The evolutionary origins of modularity - Journals
    Mar 22, 2013 · A key driver of evolvability is the widespread modularity of biological networks—their organization as functional, sparsely connected subunits ...
  82. [82]
    Mechanism of misfolding of the human prion protein revealed by a ...
    Mar 17, 2021 · The misfolding and aggregation into amyloid fibrils of the prion protein (PrP) have been strongly linked with a group of neurodegenerative disorders that ...
  83. [83]
    The G-quadruplex ligand CX-5461: an innovative candidate for ...
    Apr 18, 2025 · CX-5461, also known as Pidnarulex, is a First-in-Class anticancer drug that has received 'Fast Track Designation' approval from the FDA.
  84. [84]
    Development of Glycan-masked SARS-CoV-2 RBD vaccines ... - NIH
    Sep 26, 2024 · Glycan masking is a promising strategy in vaccine design, leveraging glycans to shield less important regions and direct the immune system ...
  85. [85]
    De novo design of protein structure and function with RFdiffusion
    Jul 11, 2023 · Here we show that by fine-tuning the RoseTTAFold structure prediction network on protein structure denoising tasks, we obtain a generative model ...
  86. [86]
    Importance of the PAM sequences in CRISPR-Cas9 gene editing
    Dec 12, 2023 · The protospacer-adjacent motif (PAM) is a short, conserved sequence on the targeted strand of DNA adjacent to the protospacer (the target DNA).
  87. [87]
    Protein structure prediction via deep learning: an in-depth review
    Apr 2, 2025 · The application of deep learning algorithms in protein structure prediction has greatly influenced drug discovery and development.
  88. [88]
    Hierarchical design of artificial proteins and complexes toward ...
    A central goal of “synthetic structural biology” is the design and construction of artificial proteins and protein complexes as desired. In this paper, I review ...