Fact-checked by Grok 2 weeks ago

Cis-regulatory element

A cis-regulatory element (CRE) is a noncoding DNA sequence located on the same chromosome as the gene it regulates, essential for controlling the spatiotemporal expression of that gene by providing binding sites for transcription factors and other regulatory proteins. These elements act in cis, meaning they influence gene activity without diffusing away from their genomic location, and they are distinct from trans-acting factors such as soluble transcription factors that bind to them. CREs are fundamental to the genome's regulatory architecture, enabling precise patterns of gene expression that underpin cellular differentiation, development, and responses to environmental cues across organisms. The concept of cis-regulatory elements originated in the 1960s with François Jacob and Monod's discovery of sequences in that control in response to environmental signals. In the following decades, research in eukaryotes revealed more complex elements, such as enhancers identified in the 1970s and 1980s, which can function over long distances and independently of orientation. These findings laid the foundation for understanding gene regulation through combinatorial control modules, with significant contributions from studies on developmental systems like embryos in the 1990s. CREs encompass several types, including promoters, which are proximal to the transcription start site and initiate assembly; enhancers, which can activate transcription from distances up to hundreds of kilobases away; silencers, which repress activity; and insulators, which block unwanted interactions between regulatory elements and . Often organized into cis-regulatory modules—clusters of motifs targeted by specific transcription factors—these elements integrate signals from multiple inputs to fine-tune expression levels in a context-dependent manner. In eukaryotic genomes, CREs number in the tens of thousands, with their sequences encoding combinatorial logic that dictates when, where, and how much a is expressed. The study of CREs has revealed their critical role in and , as sequence variations within these elements can alter without changing protein-coding regions, contributing to phenotypic diversity and . Advances in genomic technologies, such as and massively parallel reporter assays, have enabled the identification and functional validation of CREs at scale, highlighting their dynamic nature across cell types and conditions. Dysregulation of CREs is implicated in numerous disorders, including developmental abnormalities and cancers, underscoring their biomedical significance.

Introduction

Definition and Characteristics

Cis-regulatory elements (CREs) are sequences located on the same DNA molecule as the genes they regulate, functioning to control the transcription of those genes by influencing initiation, enhancement, or repression of activity. These elements are essential for the precise spatiotemporal expression of genes, ensuring that transcription occurs at the appropriate time, place, and level during development or in response to environmental cues. Key characteristics of CREs include their position-dependency, where they can act as proximal elements near the transcription start site or as distal elements located thousands of base pairs away from the regulated . They contain sequence-specific motifs, such as binding sites for transcription factors, which allow for targeted interactions that dictate regulatory outcomes. Additionally, CREs often exhibit tissue-specific or developmental stage-specific activity, enabling differential across cell types or during . In contrast to trans-regulatory elements, which are diffusible proteins such as transcription factors that can act on genes across different DNA molecules or chromosomes, are fixed genomic sequences that exert their effects only in , meaning on the same DNA strand. The basic structure of CREs typically includes core motifs like the , a short AT-rich sequence that serves as a for the in promoter regions to facilitate assembly of the transcription initiation complex.

Historical Development

The foundational concepts of cis-regulatory elements emerged from studies on prokaryotic gene regulation in the mid-20th century. In 1961, François Jacob and proposed the model for the in , introducing the as a DNA sequence that binds a repressor protein to control transcription of adjacent structural genes, marking the first recognition of a cis-acting regulatory element.80072-7) This model was experimentally validated in 1966 when and Benno Müller-Hill isolated the protein, confirming its specific binding to the site and solidifying the operator-repressor mechanism as a key paradigm for negative regulation. The identification of promoters as positive cis-regulatory elements followed in the 1970s, building on prokaryotic insights but extending to eukaryotes through in vitro transcription assays. In bacteria, promoters were defined earlier as RNA polymerase binding sites, but eukaryotic promoters were characterized via reconstituted transcription systems, with Robert Roeder's 1969 discovery of distinct nuclear RNA polymerases paving the way for mapping promoter sequences in genes like those encoding ribosomal RNA and beta-globin by the mid-1970s. These studies revealed that eukaryotic promoters, unlike their prokaryotic counterparts, often required additional factors for accurate initiation, highlighting early differences in regulatory complexity. A major advance came in the 1980s with the discovery of enhancers, distal cis-regulatory elements that activate transcription independently of position and orientation. The first enhancer was identified in 1981 within the simian virus 40 () genome by Joy Banerji, Sergio Rusconi, and Walter Schaffner, who showed that a 72-base-pair repeat sequence dramatically boosted beta-globin in transfected cells, even when located thousands of base pairs away.90413-8) This finding extended to cellular genes, with Susumu Tonegawa's group demonstrating in 1983 an enhancer in the immunoglobulin kappa light chain locus that drove tissue-specific expression in B cells, linking enhancers to developmental regulation in eukaryotes. These discoveries shifted focus from proximal prokaryotic controls to long-range eukaryotic mechanisms. By the 1990s, assays in transgenic models further illuminated distal cis-regulatory elements, revealing their prevalence in complex genomes. Experiments using or beta-galactosidase reporters fused to genomic fragments identified enhancers and silencers acting over megabase distances, as seen in studies of clusters and beta-globin loci. Simultaneously, insulators emerged as boundary elements preventing inappropriate enhancer-promoter interactions; Paul Schedl and colleagues described the scs insulator in in 1991, which blocked enhancer activity in a position-dependent manner. This period marked a conceptual from simple prokaryotic operons to the intricate, combinatorial networks of eukaryotic cis-regulatory elements, emphasizing chromatin architecture and tissue-specific control.

Classification

Promoters

Promoters are cis-regulatory elements located immediately upstream of the transcription start site (TSS), typically spanning 50 to 200 base pairs, and serve as the primary sites for initiating basal transcription of protein-coding genes in eukaryotes. They are proximal regulatory sequences that direct the assembly of the pre-initiation complex (PIC) at the core promoter region, which is generally confined to about -40 to +40 bp relative to the TSS. The core promoter contains specific sequence motifs that facilitate transcription initiation. The TATA box, a well-characterized element, is located approximately 25 to 30 bp upstream of the TSS and has the TATA(A/T)A(A/T), which binds the (TBP), a subunit of the general transcription factor TFIID. The Initiator (Inr) element overlaps the TSS (from -2 to +5 bp) and features the consensus YYANWYY (where Y is a and N/W is A or T), enabling precise selection of the transcription start and interaction with TFIID components like TAF1 and TAF2. The Downstream Promoter Element (DPE), positioned 28 to 32 bp downstream of the TSS, has the consensus (A/G)G(A/T)CGTG and is recognized by TAF6 and TAF9 within TFIID, particularly in TATA-less promoters to enhance initiation efficiency. These core elements collectively recruit (Pol II) and general transcription factors () to form the . TFIID, initiated by TBP binding to the or Inr/DPE motifs, nucleates the assembly of TFIIA, TFIIB, Pol II-TFIIF, TFIIE, and TFIIH, positioning Pol II at the TSS for promoter melting and transcription elongation. This basal machinery ensures accurate and efficient RNA synthesis, with the absence or presence of specific motifs modulating the promoter's responsiveness. Promoters exhibit variations suited to different gene functions. Housekeeping genes, which maintain constitutive expression, often feature TATA-less promoters enriched with CpG islands—GC-rich regions spanning several hundred base pairs around the TSS that lack TATA boxes but promote open and broad accessibility. In contrast, tissue-specific promoters incorporate motifs that confer selective activation in particular cell types, such as heart or liver, through binding sites for lineage-restricted transcription factors. Promoter strength, which determines transcriptional output levels, correlates with the density and combination of core motifs; strong promoters typically include multiple elements like and Inr for robust formation, while weak promoters rely on fewer or suboptimal motifs, resulting in lower basal activity. For instance, /Inr-equipped promoters can drive up to several-fold higher transcription rates compared to those with isolated elements.

Enhancers

Enhancers are distal cis-regulatory elements that activate transcription of target by interacting with promoters over long genomic distances, often spanning tens to hundreds of kilobases or even up to megabases away. Unlike proximal elements, enhancers function independently of their orientation relative to the gene and can be located upstream, downstream, or within introns of the regulated gene. This positional flexibility allows enhancers to integrate signals from various transcription factors to modulate in a context-specific manner. Structurally, enhancers typically consist of clusters of binding sites for multiple transcription factors, enabling cooperative interactions that amplify regulatory output. A specialized subset known as super-enhancers comprises large clusters of such sites, often spanning several kilobases, characterized by exceptionally high occupancy of complex and master transcription factors, which drive robust activation of cell identity genes. These super-enhancers exhibit enhanced stability and transcriptional potency compared to typical enhancers. In addition, stretch enhancers represent another class of extended regulatory regions, typically at least 3,000 base pairs long, that emerge during to orchestrate cell-type-specific gene programs, particularly in developmental contexts. Enhancers are broadly classified into constitutive types, which maintain activity across multiple cell types to support housekeeping functions, and cell-type-specific enhancers, which are activated in particular lineages through lineage-determining transcription factors. Enhancers boost the transcription rate of target genes by 10- to 1000-fold through mechanisms involving looping that brings them into physical proximity with promoters, facilitating recruitment of and coactivators. Recent advances, including studies from 2024, have highlighted the role of in conferring enhancer-promoter specificity, where transcription factors and form condensates that selectively stabilize interactions and enhance bursting frequency at cognate promoters. Enhancers are commonly identified by epigenetic markers such as H3K27 (H3K27ac), which distinguishes active enhancers from poised ones marked solely by H3K4 monomethylation, and DNase I hypersensitive sites, which indicate open accessible to regulatory proteins. These markers enable genome-wide mapping and validation of enhancer activity across cell types.

Silencers

Silencers are cis-regulatory elements () that repress transcription by binding proteins, functioning as the negative counterparts to enhancers that promote expression. These sequences can be located proximally near the promoter or distally, often thousands of base pairs upstream or downstream, and exert their effects independently of orientation or position relative to the target . Silencers are classified into constitutive types, which maintain repression under basal conditions, and inducible types, which activate repression in response to specific cellular signals or developmental cues. The primary mechanisms of silencer action involve the recruitment of transcriptional repressors, such as Polycomb group (PcG) proteins, which form Polycomb repressive complexes (PRCs) to modify structure. PRC2, containing , catalyzes trimethylation of at lysine 27 (), promoting formation and transcriptional silencing, while PRC1 ubiquitylates to compact and block access. These modifications can spread bidirectionally from the silencer, enabling long-range repression through chromatin looping that brings the silencer into proximity with target promoters. In mammals, a notable example is the silencer element within the Igf2/H19 locus, which enforces by repressing the maternal Igf2 while allowing paternal expression of Igf2 and maternal expression of H19. Deletion of this silencer disrupts imprinting, leading to biallelic Igf2 expression independent of changes. Another example involves H3K27me3-rich regions acting as distal silencers that loop to target genes like Igf2, repressing expression; CRISPR-mediated excision of such regions upregulates the targets and alters interactions. Unlike enhancers, which facilitate open and activator recruitment to boost transcription, silencers promote chromatin compaction and binding to inhibit it, though some silencers exhibit bifunctionality by acting as enhancers in alternative cellular contexts, such as in bidirectional promoters where they overlap. Recent studies have elucidated silencer roles in allele-specific silencing during X-chromosome inactivation, where PcG-mediated deposition by Xist-recruited complexes establishes and maintains repression on the inactive .

Insulators

Insulators, also known as boundary elements, are cis-regulatory elements that prevent inappropriate interactions between enhancers and promoters or block the spread of repressive states. They function by either blocking enhancer-promoter communication (enhancer-blocking activity) or acting as barriers to propagation (barrier activity), thereby organizing the into distinct functional domains. Insulators are typically located between genes and regulatory elements, spanning 100 to 500 base pairs, and bind specific proteins such as (CCCTC-binding factor) in vertebrates, which facilitates their chromatin looping and architectural roles. In eukaryotes, well-studied examples include the chicken β-globin , which contains -binding sites and protects the locus from position effects when integrated into transgenes. Insulators contribute to higher-order organization, such as topologically associating domains (TADs), where they delineate boundaries to ensure precise gene regulation during development and cell differentiation. Dysregulation of insulators, such as mutations, is associated with developmental disorders and cancers by altering enhancer-gene wiring.

Operators

Operators are short DNA sequences located near the promoter regions of prokaryotic operons, serving as binding sites for or activator proteins that modulate the initiation of transcription. In bacteria such as , these sequences typically span 15–35 base pairs and exhibit dyad symmetry to facilitate dimerization of the bound regulatory proteins, enabling precise control over in response to environmental signals. A classic example is the lac in the lactose (lac) operon, which consists of the 35-base-pair sequence 5'-TGTGTGGAATTGTGAGCGGATAACAATTTCACACA-3' and is bound by the LacI protein to prevent access to the promoter. In the absence of , LacI binds tightly to the , repressing transcription of the lacZYA genes involved in ; however, or its non-metabolizable analog IPTG induces a conformational change in LacI, releasing it from the and allowing transcription to proceed. This inducible mechanism exemplifies negative , where binding blocks progression, ensuring efficient resource allocation in nutrient-variable environments. Variations in operator function include attenuator sequences, as seen in the tryptophan () operon of E. coli, where a leader region forms alternative RNA hairpins that act as a conditional downstream of the promoter, attenuating transcription when levels are high by mimicking -mediated control through ribosome stalling. In contrast, positive operators occur in the arabinose () operon, where the AraC protein binds to sites (araO and araI) in the presence of arabinose, looping DNA to facilitate polymerase recruitment and activate expression of araBAD genes for arabinose . Prokaryotic operators represent simpler regulatory architectures compared to eukaryotic cis-regulatory elements, relying on proximal binding and direct steric hindrance or facilitation of transcription machinery, which has provided foundational insights into the evolutionary conservation of gene control mechanisms across domains of life.

Mechanisms of Action

Transcription Factor Binding

Cis-regulatory elements (CREs) function primarily through the sequence-specific binding of transcription factors (TFs) to short DNA motifs, typically 6–20 base pairs long, which serve as recognition sites for TF DNA-binding domains. These motifs exhibit variability in sequence, reflecting the probabilistic nature of binding affinity. The position weight matrix (PWM) is a foundational model for quantifying this affinity, representing the motif as a matrix where each position i and base b has a probability P_{i,b} derived from aligned binding sites, compared against background frequencies f_b. The binding score for a sequence is calculated as: \text{Score} = \sum_{i=1}^{L} \log_2 \left( \frac{P_{i,b}}{f_b} \right) where L is the motif length; higher scores indicate stronger predicted binding. This model, originally developed for signal detection in nucleic acid sequences, enables computational prediction of TF binding sites across genomes. Cooperative binding enhances the stability and specificity of TF interactions with CREs, where multiple TFs bind adjacently or form multimers, often significantly increasing occupancy compared to independent binding. This multimerization can occur through protein-protein interactions between TF domains, stabilizing the complex on DNA and amplifying transcriptional output, as observed in yeast enhancers where paired motifs show biased spacing of 10–50 bp. In mammalian systems, such cooperativity is prevalent at active enhancers, with TF pairs like AP-1 and ETS factors exhibiting synergistic binding that dominates over additive effects. The kinetics of TF binding to CREs are characterized by rapid association and dissociation rates, typically in the range of 10^6–10^8 M^{-1} s^{-1} for association and 0.1–10 s^{-1} for dissociation, resulting in dwell times of seconds to minutes that directly influence transcriptional bursting. Short dwell times, often under 10 seconds, contribute to stochastic gene expression noise in bacteria, while longer residence times correlate with sustained activation and burst frequency in eukaryotes; for instance, varying TF concentration modulates burst size, but dwell time adjustments primarily affect burst frequency. These dynamics are measured via single-molecule tracking, revealing that TF search mechanisms, including 1D sliding along DNA, facilitate efficient target location. Upon binding, TFs often undergo allosteric conformational changes that expose or reshape interaction surfaces, facilitating recruitment of co-activators such as or acetyltransferases. For example, in nuclear receptors, DNA binding induces dimerization and alters the ligand-binding domain to enhance co-activator affinity through hydrophobic groove formation. Recent cryo-EM structures (as of 2024) have provided detailed insights into these allosteric mechanisms in TF-DNA complexes. In bacterial systems like CueR, metal ion binding triggers torsional DNA twisting and allosteric activation, propagating signals to recruitment without altering core protein-DNA contacts. These changes ensure from DNA recognition to modification. Binding specificity is disrupted by in motifs, leading to of TF affinity and associated diseases; for instance, single nucleotide variants in the TERT promoter abolish ETS factor binding, increasing melanoma risk through elevated expression. Similarly, GATA2 motif disruptions in enhancers cause MonoMAC syndrome, an , by reducing differentiation. Such variants comprise a significant portion of disease-associated non-coding SNPs, highlight the clinical impact of impaired TF-CRE interactions.

Chromatin Looping and Interactions

Cis-regulatory elements (), such as enhancers, interact with target gene promoters through looping, a process mediated by protein complexes that bring distant genomic regions into physical proximity within the three-dimensional () nuclear architecture. The Mediator complex plays a central role in facilitating enhancer-promoter interactions by bridging transcription factors bound to CREs and the basal transcription machinery at promoters, thereby stabilizing these contacts. Concurrently, , a ring-shaped SMC protein complex, drives loop extrusion by translocating along fibers, progressively enlarging loops until they are anchored by CTCF-bound sites, which act as architectural barriers to define loop boundaries and prevent aberrant spreading of regulatory signals. This mechanism ensures specific, long-range communication between enhancers and promoters, often spanning hundreds of kilobases, and is essential for precise gene activation in eukaryotic cells. Topologically associating domains (TADs) represent self-interacting regions that insulate CRE-gene interactions, confining regulatory elements to their appropriate targets and thereby maintaining transcriptional fidelity across types. TADs are typically delimited by convergent sites and occupancy, forming stable compartments where intra-TAD enhancer-promoter loops predominate over inter-TAD contacts. Disruptions to TAD boundaries, such as those induced by structural variants, can lead to misregulation by allowing ectopic enhancer access to unintended promoters, resulting in pathogenic changes. For instance, alterations in TAD integrity have been shown to rewire regulatory networks, promoting aberrant activation of nearby genes. The dynamics of looping are governed by , where multivalent interactions among proteins like and transcription factors drive the formation of biomolecular condensates that concentrate regulatory components and facilitate loop stabilization. Live-cell techniques reveal that these loops exhibit dynamic lifetimes on the order of 10–30 minutes, allowing for transient yet functional enhancer-promoter engagements that respond to cellular signals over timescales from minutes to hours. Such temporal flexibility enables rapid adjustments in without permanent structural reconfiguration. Experimental evidence for these interactions has been robustly established through chromatin conformation capture (3C) techniques, including 3C, , and , which cross-link and quantify physical contacts between genomic loci to map looping frequencies at population and single-cell resolutions. , in particular, provides genome-wide interaction profiles, revealing enhancer-anchored loops as enriched features in active landscapes. Recent advances in single-cell , such as droplet-based methods introduced in 2024, have enhanced resolution to dissect heterogeneous looping patterns across individual cells, uncovering cell-type-specific regulatory dynamics previously obscured in bulk analyses. In pathological contexts, disruptions to loops contribute to cancer progression by enabling enhancer hijacking, where structural variants reposition to aberrantly activate oncogenes. For example, such events can relocate enhancers to new promoters, driving overexpression of involved in and survival. These alterations underscore the therapeutic potential of targeting loop architecture to restore normal regulation in malignancies.

Combinatorial Logic

Combinatorial logic in cis-regulatory elements (CREs) refers to the integration of multiple (TF) inputs to produce precise patterns, often modeled using operations such as AND, OR, and NOT gates. In this framework, activation requires the simultaneous presence of specific activators (AND logic), while repression occurs in the absence of certain factors or presence of repressors (NOT logic). A classic example is the even-skipped (eve) stripe 2 enhancer in , where expression is activated by the maternal Bicoid (Bcd) and the gap protein Hunchback (Hb) acting in an AND configuration to define a broad anterior domain, but sharply repressed by the gap proteins Kruppel (Kr) and Giant (Gt) via NOT logic to restrict the pattern to a precise stripe. This Boolean-like wiring, hardwired into the DNA sequence through clustered binding sites, translates overlapping TF gradients into discrete spatial domains during embryogenesis. Quantitative aspects of this logic arise from concentration-dependent TF binding, which generates responses essential for sharp expression boundaries. TF gradients produce sigmoidal activation curves, where low yields gradual transitions, but higher-order interactions sharpen outputs. This is captured by the Hill equation for : \text{Activity} = \frac{[\text{TF}]^n}{K_d^n + [\text{TF}]^n} Here, [\text{TF}] is the TF concentration, K_d is the , and n (the Hill coefficient) measures ; values of n > 1 indicate synergistic binding, amplifying sensitivity to TF levels and enabling all-or-nothing responses in CREs. Such models explain how combinatorial inputs process noisy signals into robust patterns. Shadow enhancers, pairs or groups of quasi-redundant driving overlapping expression, further enhance robustness by buffering against perturbations in concentrations or mutations. These elements, often located nearby but independently, ensure consistent gene output; for instance, in neurogenic genes, primary and shadow enhancers compensate for each other's loss, maintaining developmental fidelity. Combinatorial control also reduces noise in by requiring multiple bindings, which averages fluctuations and suppresses cell-to-cell variability compared to single-input . Recent computational modeling has advanced understanding of this using single-cell data, simulating Boolean-like integrations in dynamic contexts. For example, logic-incorporated regulatory networks inferred from single-cell multi-omics reveal how combinatorial TF rules govern cell fate transitions, with simulations showing emergent robustness from AND/OR motifs in mammalian development. These approaches, validated against 2023–2025 datasets, highlight non-modular interactions and predict expression outcomes beyond simple Boolean approximations.

Functional Roles

In Gene Regulatory Networks

Cis-regulatory modules (CRMs), also known as enhancers or silencers clusters, function as integrated units within gene regulatory networks (GRNs), where multiple cis-regulatory elements (CREs) cooperate to process combinatorial inputs from transcription factors (TFs) and output precise gene expression patterns. These modules act as computational logic processors, employing Boolean-like operations such as AND, OR, or XOR gates to interpret TF signals and drive target gene activation or repression in a context-specific manner. For instance, a CRM may integrate signals from several TFs to ensure expression only when multiple conditions are met, thereby enabling modular control over developmental processes. This modularity allows GRNs to decompose complex regulatory logic into reusable components, facilitating the coordination of gene expression across cellular states. In GRNs, CRMs participate in recurrent network motifs that confer specific dynamical properties, such as feed-forward loops (FFLs) and auto-regulatory circuits. In an FFL motif, a primary (X) directly activates a secondary TF (Y) and both jointly regulate a gene (Z) through CREs at the Z promoter, which integrate the inputs via logic gates to accelerate or delay responses to stimuli; for example, incoherent FFLs act as sign-sensitive accelerators for rapid ON signals. Auto-regulation occurs when a gene's product binds back to its own CRM, often via loops that stabilize expression levels—negative auto-regulation speeds up response times and reduces cell-to-cell variability, while positive forms maintain bistable states. These motifs, enriched in GRNs compared to random networks, allow CRMs to embed temporal and logical control, enhancing network efficiency. The scalability of CREs within CRMs enables precise spatiotemporal gene regulation in complex developmental systems, as seen in the segmentation network, where hierarchical CRMs drive stripe-specific expression patterns along the embryo's anterior-posterior axis through layered TF interactions. This distributed architecture allows GRNs to scale from local modules to genome-wide coordination, achieving robustness against perturbations; multiple redundant within and across CRMs buffer mutations by compensating for losses in individual elements, preserving overall network function even after severe disruptions. For example, in conserved regulators like CLV3, synergistic interactions among upstream and downstream maintain expression despite deletions, demonstrating how CRE multiplicity evolves to enhance mutational resilience. Recent advances in single-cell RNA sequencing (scRNA-seq) have illuminated dynamics underlying transcriptional bursts in GRNs, revealing how enhancer clusters modulate burst frequency and amplitude to fine-tune expression and timing. In 2024 studies, temporally resolved scRNA-seq during stimuli like treatment showed that genes regulated by multiple CRM-bound TFs exhibit heightened burst variability and faster responses, with enhancer perturbations directly altering burst kinetics without affecting steady-state levels. These findings underscore CRMs' role in probabilistic transcription, providing a mechanistic link between regulatory architecture and in heterogeneous cell populations.

Evolutionary Dynamics

Cis-regulatory elements (CREs) exhibit varying degrees of evolutionary conservation depending on their type and position relative to target genes. Core promoter motifs, such as the , are highly conserved across eukaryotes, reflecting their essential role in initiating transcription factor assembly and basal transcription machinery recruitment. In contrast, distal CREs like enhancers and silencers display greater sequence variability, allowing for flexible regulation while maintaining functional output through compensatory mechanisms or redundant elements. This dichotomy enables core elements to preserve fundamental transcriptional processes over billions of years, whereas distal regions evolve more rapidly to accommodate lineage-specific needs. CRE turnover, involving the gain, loss, and shuffling of elements, is a pervasive feature of , often driven by events and local duplications. Enhancers, in particular, undergo frequent turnover, with studies revealing near-complete replacement of retinal-specific over 500 million years in bilaterians, yet preserved regulatory logic through novel sequence motifs. In mammalian lineages, this manifests as lineage-specific gains, such as human-specific neural enhancers that have fixed at an accelerated rate, contributing to unique cognitive traits. Such dynamics highlight how CRE shuffling via mobile elements facilitates adaptive rewiring without disrupting core functions. Mutations in CREs have played a pivotal role in adaptive evolution, enabling rapid trait changes in response to environmental pressures. A classic example is in humans, driven by a variant in the MCM6 gene enhancer that emerged approximately 10,000 years ago, allowing adult milk digestion and spreading under pastoralist selection. Similarly, CRE divergence underlies speciation and morphological adaptations, as seen in threespine stickleback fish, where cis-regulatory changes in the ectodysplasin (EDA) signaling pathway repeatedly reduce armor plate number in freshwater populations, enhancing predator evasion. These cases illustrate how CRE variations provide a substrate for , promoting without altering protein-coding sequences. Recent genomic comparisons have further illuminated CRE contributions to , including heterogeneity. A 2024 meta-analysis of (T2D) loci identified genetic drivers of pathophysiological variation, with non-coding variants in islet-specific explaining subgroup differences in insulin response and beta-cell function. These findings underscore CREs' role in evolutionary to modern metabolic challenges, linking ancient regulatory flexibility to contemporary disparities.

Detection and Analysis

Experimental Techniques

Classical methods for identifying and validating cis-regulatory elements () include assays, which measure the regulatory activity of candidate sequences by linking them to a such as and quantifying expression levels in transfected cells. These assays provide direct evidence of enhancer or silencer function but are limited to testing individual elements. Another foundational technique is DNase-seq, which maps regions of open chromatin hypersensitive to DNase I digestion, highlighting potential like promoters and enhancers across the genome. Introduced in , DNase-seq offers high-resolution identification of active regulatory sites in mammalian cells. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a cornerstone for mapping (TF) binding sites and epigenetic marks associated with CREs, such as H3K4me1 enrichment at enhancers. Developed in 2007, ChIP-seq enables genome-wide profiling of protein-DNA interactions , revealing how TFs and histone modifications regulate . For instance, H3K4me1 marks distal enhancers, while H3K27ac indicates active states. Advanced techniques have enhanced functional testing and precision. CRISPR activation (CRISPRa) and interference (CRISPRi) use catalytically dead Cas9 (dCas9) fused to activators or repressors to modulate CRE activity without altering DNA sequence, allowing high-throughput validation of regulatory potential. These methods, pioneered in 2013, provide causal insights into CRE contributions to gene expression. CUT&Tag, introduced in 2019, improves upon ChIP-seq by using protein A-Tn5 transposase for targeted tagmentation, requiring less input material and reducing background; a 2025 study demonstrated it recovers up to 50% more histone modification peaks missed by traditional ChIP-seq in ENCODE datasets. To probe three-dimensional organization, and its variants capture long-range interactions indicative of looping between and promoters. The original method, established in 2009, generates genome-wide contact maps, while targeted variants like HiChIP enrich for TF-anchored loops to identify functional enhancer-promoter pairs. Single-cell extends chromatin accessibility profiling to individual cells, uncovering cell-type-specific ; 2024 advancements in multimodal integration with have improved resolution for dissecting regulatory heterogeneity in tissues. For large-scale validation, massively parallel reporter assays (MPRAs) test thousands of CRE candidates simultaneously by incorporating barcoded reporters into cells and quantifying expression via sequencing. Developed around and refined since, MPRAs reveal sequence features driving regulatory activity, often integrating with computational tools for prioritization.

Computational Prediction

Computational prediction of cis-regulatory elements (CREs) relies on bioinformatics algorithms that analyze genomic sequences to identify potential regulatory regions, such as transcription factor binding sites (TFBSs). A foundational approach involves motif scanning, where position weight matrices (PWMs) represent the binding preferences of s. Tools like JASPAR provide a curated database of PWMs derived from experimentally validated TF binding profiles, enabling users to scan genomic sequences for matches that indicate potential . Similarly, the MEME Suite facilitates motif discovery by identifying overrepresented sequence patterns in unaligned DNA regions, often applied to ChIP-seq peaks to uncover novel TFBSs within . These methods score sequences based on PWM logarithms, where higher scores suggest stronger binding affinity, though they require thresholds to filter predictions. Advancements in have enhanced CRE prediction by modeling complex sequence dependencies. Deep learning models, such as , use convolutional neural networks to predict cell-type-specific regulatory activity directly from DNA sequences, trained on large-scale epigenomic datasets to forecast accessibility and patterns. Originally developed in 2018 and refined in subsequent updates, captures long-range interactions and has been benchmarked for its ability to generalize across species, with applications in variant effect prediction. More recent models build on this foundation, incorporating architectures to interpret cis-regulatory grammars with higher resolution. Integration of multi-omics data improves prediction accuracy by combining sequence features with epigenomic signals. The project employs pipelines that aggregate ChIP-seq, DNase-seq, and data to annotate candidate CREs (cCREs) genome-wide, using classifiers to distinguish active enhancers and promoters based on histone modifications and accessibility profiles. These approaches leverage supervised models trained on validated elements to predict CRE functionality across tissues. In plants, recent tools like those using on model species (e.g., and ) integrate sequence and data to map CREs, as demonstrated in 2024 frameworks that predict expression from proximal regulatory sequences. Despite progress, computational prediction faces challenges, including high false positive rates in non-conserved genomic regions where motifs occur by chance without functional impact. Incorporating chromatin context remains difficult, but emerging graph neural networks model enhancer-promoter interactions by representing genomic contacts as graphs, reducing errors in long-range CRE identification. Prediction accuracy is evaluated using metrics like area under the curve (AUC-ROC), which measures discrimination between true and false CREs, often exceeding 0.85 in models benchmarked against experimental data. Validation against massively parallel reporter assays (MPRA) confirms predictive power, with top models achieving correlation coefficients above 0.7 for activity scores. These benchmarks highlight the utility of computational tools while underscoring the need for experimental corroboration.

Examples and Applications

Prokaryotic Systems

In prokaryotic systems, cis-regulatory elements (CREs) play a pivotal role in coordinating through operons, enabling rapid responses to environmental cues such as availability. The in serves as a foundational example, where the and promoter regions act as key CREs regulated by the LacI and the (CAP). The , located downstream of the promoter, binds the LacI in the absence of , preventing from transcribing the structural genes lacZ, lacY, and lacA that encode β-galactosidase, lactose permease, and transacetylase, respectively. Upon binding to LacI, the dissociates, allowing transcription ; however, full requires low glucose levels, where CAP-cAMP binds an upstream activator in the promoter to enhance recruitment. This dual regulation exemplifies inducible control, with the and CAP as modular CREs that integrate negative and positive signals for efficient . Another prominent CRE in prokaryotes is the attenuator in the of E. coli, which fine-tunes tryptophan biosynthesis by coupling transcription and translation. The trp leader sequence contains the attenuator, a rho-independent terminator structure formed in the nascent RNA transcript when tryptophan levels are high. Under these conditions, charged tRNA^Trp enables efficient translation of a leader coding , preventing formation of an antiterminator hairpin and allowing the terminator to halt transcription before the structural genes trpEDCBA. In tryptophan scarcity, ribosome stalling at tandem Trp codons disrupts the leader translation, favoring the antiterminator structure and permitting read-through to express biosynthetic enzymes. This mechanism, distinct from repressor-operator interactions, highlights RNA secondary structures as dynamic CREs responsive to availability. The ara operon in E. coli illustrates versatile CRE function through dual operators controlled by the AraC protein, enabling both and repression of L- metabolism genes araBAD. In the absence of arabinose, AraC dimers bind to distant sites O2 and I1, forming a DNA loop that sterically hinders access to the araBAD promoter (pBAD). binding induces a conformational change in AraC, redirecting it to adjacent half-sites I1 and near pBAD, where it recruits to initiate transcription while also activating the araC promoter for autoregulation. This looping-mediated control demonstrates how CREs like the ara operators integrate sensing for switch-like responses in catabolic pathways. Quorum sensing in Vibrio fischeri employs CREs in the lux operon to synchronize bioluminescence with population density. The luxI promoter contains a lux box, a 20-bp inverted repeat CRE that binds the LuxR-autoinducer complex at high cell densities, activating transcription of the luxICDABEG operon encoding luciferase and accessory proteins. Low-density conditions prevent LuxR activation, repressing the lux box and minimizing energy expenditure on light production. This CRE-mediated feedback loop exemplifies intercellular communication, where the lux box integrates diffusible signals for collective behavior in symbiotic environments. In , engineered CREs in expand natural designs for programmable circuits, such as modified promoters and riboswitches that respond to novel inputs. For instance, hybrid promoters combining lac operator variants with orthogonal sites enable layered control in E. coli, allowing logic gates for . These synthetic elements, often derived from natural CREs like the lac operator but with altered binding affinities, facilitate applications in biosensors and , demonstrating the modularity of prokaryotic regulatory architecture.

Eukaryotic Genomes

In eukaryotic genomes, cis-regulatory elements (CREs) often operate within complex environments, facilitating long-range interactions through looping to modulate , in contrast to the more direct promoter-proximal access seen in prokaryotes. These elements include enhancers, silencers, and upstream activating sequences (UASs) that integrate signals from multiple transcription factors (TFs) to achieve precise spatiotemporal control. Model organisms like , , and humans provide key insights into their diversity and function. In budding yeast (), a simple , the HIS3 gene exemplifies UAS-mediated regulation. The HIS3 UAS consists of multiple binding sites for the TF Gcn4, which activates transcription in response to starvation via the general control pathway. This UAS, located approximately 250 base pairs upstream of the transcription start site, enhances HIS3 expression by recruiting the mediator complex and promoting , ensuring biosynthesis under stress conditions. The fruit fly showcases combinatorial CRE logic in developmental patterning through the even-skipped () gene enhancers. The locus contains at least five cis-regulatory modules (CRMs), including the stripe 3+7 enhancer, which drives expression in specific parasegmental stripes along the embryo's anterior-posterior axis. These CRMs integrate inputs from combinatorial TFs such as Bicoid, Hunchback, and Giant, with binding site arrangements dictating stripe-specific activation and repression; for instance, the stripe 2 enhancer relies on activator sites for Bicoid and repressor sites for Kruppel to position expression precisely. This modular system, involving chromatin looping between enhancers and the promoter, underlies the seven-stripe pattern essential for segmentation. In humans, the beta-globin locus control region (LCR) serves as a prototypical super-enhancer, comprising five DNase I-hypersensitive sites (HS1–HS5) located ~10–50 kb upstream of the beta-globin genes. This LCR orchestrates high-level, stage-specific expression of beta-globin in erythroid cells by forming loops that contact promoters via interactions with TFs like and EKLF, thereby opening the locus for transcription and insulating it from surrounding . The integrated action of these sites amplifies expression up to 100-fold compared to individual enhancers. Hox gene clusters in eukaryotes, such as the four paralogous clusters in mammals, rely on to enforce colinearity—the spatial and temporal correspondence between gene order and expression domains along the body axis. Intergenic enhancers and global control elements, like the Hoxb1 autoregulatory enhancer, drive sequential activation from 3' to 5' genes during embryogenesis, often through chromatin looping mediated by Polycomb and Trithorax group proteins. This ensures proper anterior-posterior patterning, with disruptions altering segment identity. Beyond model systems, in non-model eukaryotes adapt to environmental challenges, as seen in apple (Malus domestica) genes. A analysis identified multiple stress-responsive in the promoters of 56 Md genes, including ABA-responsive elements (ABREs) and drought-responsive motifs, which activate expression under abiotic stresses such as , , and salinity. These elements enable rapid chaperone production for and stress tolerance in apple trees.

Disease Associations

Mutations in cis-regulatory elements (CREs), such as single nucleotide polymorphisms (SNPs) within enhancers and promoters, have been implicated in various human diseases by disrupting regulation. A prominent example is the recurrent mutations in the promoter of the (TERT) gene, first identified in melanomas and subsequently in over 50 cancer types, where they create binding sites for transcription factors like , leading to TERT upregulation and reactivation that promotes cellular immortality. These TERT promoter mutations, occurring at high frequencies in gliomas (up to 80%) and thyroid cancers, are among the earliest genetic events in tumorigenesis and correlate with aggressive disease phenotypes. In cancer, CRE dysregulation often manifests through enhancer hijacking, where chromosomal rearrangements reposition s near potent enhancers, driving aberrant expression. For instance, in (T-ALL), the TAL1 is activated in approximately 25% of cases via translocation-mediated hijacking of enhancers, resulting in TAL1 overexpression that disrupts normal hematopoietic differentiation. Additionally, cancer cells exhibit "super-enhancer addiction," where amplified clusters of enhancers sustain high-level transcription of key s, rendering cells vulnerable to inhibitors targeting super-enhancer components like or CDK7, as demonstrated in and other malignancies.31727-5) CRE variants also contribute to complex traits and neurological disorders. Genome-wide association studies (GWAS) in 2024 have identified over 500 loci associated with , with the majority mapping to non-coding CREs in that modulate effector in metabolic tissues. In autism spectrum disorder (), disruptions to CREs bound by chromodomain DNA-binding protein 8 (CHD8), a high-confidence ASD risk factor, alter accessibility and expression of neurodevelopmental genes, leading to impaired neuronal function and behavioral deficits in model systems. Therapeutic strategies targeting CREs hold promise for treating genetic diseases. CRISPR-based editing of the BCL11A erythroid enhancer has reactivated production in , with phase 3 trials (e.g., CTX001/Casgevy) showing durable clinical remissions in over 90% of patients as of 2024, and ongoing expansions into 2025 evaluating long-term efficacy. Recent 2024 GWAS on substance use disorders (SUDs) highlight shared CRE variants across , , , and dependencies, enriching in brain cell-type-specific regulatory elements and informing precision interventions.

References

  1. [1]
    cis-regulatory element / cis regulatory element - Nature
    A noncoding DNA sequence in or near a gene required for proper spatiotemporal expression of that gene, often containing binding sites for transcription factors.
  2. [2]
    in the DNA, ... - As we know c... | Learn Science at Scitable - Nature
    Oct 14, 2010 · A cis-acting (or cis-regulatory) element refers to a region of the chromosomal DNA that regulates the transcription or expression of a gene that is on the same ...
  3. [3]
    Understanding how cis-regulatory function is encoded in DNA ... - NIH
    A major function of the genome is to specify patterns of gene expression through the action of cis-regulatory elements (CREs), but how that function is encoded ...
  4. [4]
    cis-Regulatory Elements in Plant Development, Adaptation, and ...
    cis-Regulatory elements encode the genomic blueprints that ensure the proper spatiotemporal patterning of gene expression necessary for appropriate development.
  5. [5]
    Cis-regulatory programs in the development and evolution of ...
    Jan 16, 2016 · Cis-regulatory elements (CREs), like enhancers, control gene expression in vertebrate appendage development. These elements are crucial for ...
  6. [6]
    Identification of cis-regulatory sequence variations in individual ...
    Oct 10, 2011 · Here we explore the existing methods and software for prediction of functional variation situated in the cis-regulatory sequences governing gene transcription ...Missing: seminal | Show results with:seminal
  7. [7]
    Developmental and evolutionary dynamics of cis-regulatory ...
    Jul 29, 2021 · Cis-regulatory elements (CREs), such as enhancers and promoters, and the transcription factors that bind to them are central to cell fate ...<|separator|>
  8. [8]
    6.1: Prokaryotic gene regulation - Biology LibreTexts
    Oct 22, 2021 · On the other hand, the proteins that bind to these cis-elements are called trans-regulators because (as diffusible molecules) they do not ...<|separator|>
  9. [9]
    Core promoter-specific gene regulation: TATA box selectivity and ...
    TBP binding to the TATA box is the first step in core promoter recognition and leads to a sharp DNA bend that is important for TFIIB recruitment and, hence, for ...
  10. [10]
    Five decades of eukaryotic transcription - Nature
    Aug 26, 2019 · His initial 1969 discovery, as a graduate student, of three nuclear RNA polymerase activities from multiple eukaryotes, distinguished by their ...
  11. [11]
    50+ years of eukaryotic transcription: an expanding universe of ...
    Aug 22, 2019 · The landmark 1969 discovery of nuclear RNA polymerases I, II and III in diverse eukaryotes represented a major turning point in the field.
  12. [12]
    Susumu Tonegawa – Biographical - NobelPrize.org
    I had very little knowledge of immunology, but decided to take Dr. Dulbecco's advice and sent an application letter to the Director of the Institute, Professor ...
  13. [13]
    Overview Article: Identifying transcriptional cis-regulatory modules in ...
    In animals, a significant portion of gene regulation results from the interaction of transcription factors (TFs) with specific cis-regulatory DNA sequences.Crm Discovery · Chromatin ``signatures'' As... · Comparative Genomics...Missing: paper | Show results with:paper
  14. [14]
    Insulators: many functions, many mechanisms - Genes & Development
    Insulators protect genes from outside enhancers or inactivating chromatin, acting as enhancer blockers and barriers, and are a phenotype, not a single element.Missing: history | Show results with:history
  15. [15]
    A conceptual history of the “regulatory genome” - ScienceDirect.com
    The models of eukaryotic gene regulation emerged at the end of a decade of the 60's, when another very important contribution to the study of gene regulation ...
  16. [16]
    The RNA polymerase II core promoter - Genes & Development
    The core promoter is the ultimate target of action of all of the factors that are involved in the regulation of transcription by RNA polymerase II.
  17. [17]
    Functional Characterization of Core Promoter Elements - NIH
    To define an interval (window) for a functional position of the TATA box, Inr, and DPE core elements, we considered the distribution of the element's occurrence ...
  18. [18]
    Molecular mechanisms that distinguish TFIID housekeeping from ...
    Dec 15, 2016 · For example, mammalian genes without a well‐defined TATA‐box but with a CpG island promoter are generally associated with housekeeping function ...
  19. [19]
    Tissue‐specific regulatory elements in mammalian promoters
    Jan 16, 2007 · We report on a systematic analysis of promoters controlling tissue‐specific expression in heart, kidney, liver, pancreas, skeletal muscle, testis and CD4 T ...
  20. [20]
    Cis-Regulatory Elements in Mammals - MDPI
    Enhancers are short DNA sequences within the genome that, by binding with various TF and cofactors, act upon promoters to regulate gene transcription. Enhancers ...
  21. [21]
    Mechanisms of enhancer action: the known and the unknown
    Apr 15, 2021 · The regulators consist of both the cis-regulatory elements (CRE) in the genome and relevant gene products; the latter includes proteins (as ...
  22. [22]
    Chromatin stretch enhancer states drive cell-specific gene ... - NIH
    Oct 14, 2013 · We propose that stretch enhancers form as pluripotent cells differentiate into committed lineages, to program important cell-specific gene ...
  23. [23]
    Enhancer–promoter specificity in gene transcription - Nature
    Apr 25, 2024 · Enhancers can increase bursting frequency to augment cognate gene expression via increased enhancer–promoter contact. Enhancer bursting, unlike ...
  24. [24]
    Histone H3K27ac separates active from poised enhancers ... - PNAS
    We find that histone H3K27ac distinguishes active enhancers from inactive/poised enhancer elements containing H3K4me1 alone.
  25. [25]
    Genome-wide Identification and Characterization of Enhancers ...
    Active enhancers are always associated with DNase I hypersensitive sites (DHSs), a hallmark of chromatin regions sensitive to the binding of transcription ...
  26. [26]
    Review Regulation of the l-arabinose operon of Escherichia coli
    Dec 1, 2000 · At pBAD, the AraC protein not only acts positively to stimulate transcription in the presence of arabinose, but also acts negatively in the ...
  27. [27]
    Widespread evidence of cooperative DNA binding by transcription ...
    Cooperative TF binding should be manifest in genomic sequences as biased arrangements of TF-binding sites. Here, we explore the extent and diversity of such ...Missing: seminal | Show results with:seminal
  28. [28]
    Structural basis for a molecular allosteric control mechanism ... - PNAS
    Our study reveals an allosteric mechanism whereby binding of a coactivator promotes formation of nonsymmetrical RAR homodimers with a 2∶1 stoichiometry.
  29. [29]
    Allosteric Transcriptional Regulation via changes in the Overall ...
    Upon binding a metal ion in the allosteric site, CueR switches into an activator conformation. It maintains all protein-DNA contacts but introduces torsional ...
  30. [30]
    The Mediator complex regulates enhancer-promoter interactions
    Jul 10, 2023 · In this process, ring-shaped Cohesin complexes translocate along chromatin and extrude progressively larger loops, until they are halted at CTCF ...
  31. [31]
    CTCF is a DNA-tension-dependent barrier to cohesin-mediated loop ...
    Apr 19, 2023 · Biochemically distinct cohesin complexes mediate positioned loops between CTCF sites and dynamic loops within chromatin domains. Preprint at ...
  32. [32]
    Landscape of cohesin-mediated chromatin loops in the ... - Nature
    Jul 29, 2020 · Approximately one-quarter of cohesin-mediated loops are anchored by enhancers across diverse cell types, representing the most enriched loop- ...
  33. [33]
    Topologically Associating Domains and Regulatory Landscapes in ...
    Jul 6, 2021 · Topologically associating domains are believed to facilitate interactions between cis-regulatory elements (CREs) and their target promoters, ...
  34. [34]
    Disruptions of Topological Chromatin Domains Cause Pathogenic ...
    May 7, 2015 · Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions.
  35. [35]
    Principles of genome folding into topologically associating domains
    Apr 10, 2019 · Disruption of TAD structures by altering their boundaries can lead to ectopic contacts between cis-regulating elements and gene promoters, and ...Missing: misregulation | Show results with:misregulation
  36. [36]
    Phase Separation-Mediated Chromatin Organization and Dynamics
    We survey key recent findings on the mechanistic roles of phase separation in regulating the organization and dynamics of chromatin-based molecular processes.
  37. [37]
    Chromatin domains in the cell: Phase separation and condensation
    Chromatin domains are highly dynamic, with a loop lifetime of 10–30 min, as observed through live-cell imaging [16,17]. Understanding the properties of these ...
  38. [38]
    Live-cell imaging of chromatin contacts opens a new window into ...
    Jun 23, 2023 · The loops provide a dynamic structural scaffold to chromatin and impact gene expression, by directly recruiting enhancers to promoters or ...
  39. [39]
    Guide to Hi-C & Related Methods - Active Motif
    Apr 20, 2020 · There are many different ways to investigate the 3-D structure of chromatin, such as 3C, 4C, 5C, and Hi-C. This article discusses the basic ...
  40. [40]
    Droplet Hi-C enables scalable, single-cell profiling of chromatin ...
    Oct 18, 2024 · Here we introduce Droplet Hi-C, which uses a commercial microfluidic device for high-throughput, single-cell chromatin conformation profiling in droplets.
  41. [41]
    Hijacked enhancer-promoter and silencer-promoter loops in cancer
    At the chromatin-loop level, SVs can relocate enhancers or silencers from their original genomic loci to activate oncogenes or repress tumor suppressor genes.Missing: disruptions TALDO1
  42. [42]
    Enhancers dysfunction in the 3D genome of cancer cells - PMC
    Nov 6, 2023 · In this review, we summarize the current knowledge on how 3D genome architecture is dysregulated in cancer, with a focus on enhancer highjacking events.Missing: TALDO1 | Show results with:TALDO1
  43. [43]
    Shadow enhancers: Frequently asked questions about distributed ...
    Nov 15, 2011 · This review, in the form of a FAQ, addresses outstanding questions about “shadow enhancers”, quasi-redundant cis-regulatory elements, and their proposed roles ...
  44. [44]
    A logic-incorporated gene regulatory network deciphers principles in ...
    Apr 23, 2024 · A top-down model-guided framework unveils the role of regulatory logics underlying gene regulatory networks in cell fate decisions.<|control11|><|separator|>
  45. [45]
    Modeling combinatorial regulation from single-cell multi-omics ...
    Jul 24, 2025 · We introduce cRegulon to infer regulatory modules by modeling combinatorial regulation of transcription factors based on diverse GRNs from single-cell multi- ...
  46. [46]
    Structure and function of the feed-forward loop network motif - PubMed
    The FFL, a three-gene pattern, is composed of two input transcription factors, one of which regulates the other, both jointly regulating a target gene.Missing: auto- cis- GRNs
  47. [47]
    Functional cis-regulatory genomics for systems biology - PNAS
    The primary mechanism in gene regulation is interaction of transcription factors with cis-regulatory modules (CRM). For this basic reason, experimental ...
  48. [48]
    Quantitative Analysis of the Drosophila Segmentation Regulatory ...
    Cis-regulatory modules that drive precise spatial-temporal patterns of gene expression are central to the process of metazoan development. We describe a new ...
  49. [49]
    Extreme restructuring of cis-regulatory regions controlling a deeply ...
    At least a portion of this robustness stems from the extensive modularity of cis-regulatory regions–namely, multiple CREs and their higher order, and often ...
  50. [50]
    Cis-regulatory control of transcriptional timing and noise in response ...
    Apr 24, 2024 · Here, we used single-cell RNA-seq during an estrogen treatment time course and machine learning to identify predictors of expression timing and ...
  51. [51]
    Single-cell new RNA sequencing reveals principles of transcription ...
    Aug 28, 2024 · In this study, we investigated transcriptional bursting kinetics transcriptome-wide in primary fibroblasts through temporally resolved scRNA-seq ...
  52. [52]
    Identification and Distinct Regulation of Yeast TATA Box-Containing ...
    Jun 9, 2004 · Paradoxically, while the use of a TATA box is conserved among all eukaryotes, a wide variety of sequences can functionally replace a TATA box ...
  53. [53]
    Evolutionary Innovations in Conserved Regulatory Elements ...
    While enhancer turnover appears to be a common pattern, many enhancers are evolutionarily conserved over long time scales. In fact, even the majority of ...
  54. [54]
    Conservation and Evolution of Cis-Regulatory Systems in ... - NIH
    We systematically explored the conservation of regulatory networks in fungi by examining the cis-regulatory elements that govern the expression of coregulated ...
  55. [55]
    [PDF] Conservation of cis-regulatory codes over half a billion years of ...
    Nov 14, 2024 · nearly complete (i.e., ~99.5%) evolutionary turnover of retinal class-specific cis-regulatory elements beyond 500 million years. Deep ...
  56. [56]
    Fast evolving human-specific neural enhancers are associated with ...
    We report a 5-fold increased evolutionary rate of enhancers that are activated in neural tissues, leading to fixation of ~100 human-specific enhancers ...
  57. [57]
    Massively parallel discovery of human-specific substitutions ... - PNAS
    Our findings reveal mechanisms of enhancer evolution in humans and provide an entry point toward a functional understanding of the genetic changes underlying ...
  58. [58]
    Neanderthal adaptive introgression shaped LCT enhancer region ...
    Positive selection at the 2q21.3 locus in Europeans and Africans has long been tied to lactase persistence (LP) and milk-drinking and is a textbook example ...
  59. [59]
    Evolving New Skeletal Traits by cis-Regulatory Changes in Bone ...
    Our studies show that repeated evolution of armor plate size in freshwater sticklebacks has occurred in part by a cis-acting increase in expression of a ...
  60. [60]
    A recurrent regulatory change underlying altered expression ... - eLife
    Jan 28, 2015 · Armor plate changes in sticklebacks are a classic example of repeated adaptive evolution. Previous studies identified ectodysplasin (EDA) ...
  61. [61]
    Genetic drivers of heterogeneity in type 2 diabetes pathophysiology
    Feb 19, 2024 · Through the newly established Type 2 Diabetes Global Genomics Initiative, we present findings from a very large meta-analysis of T2D GWAS data, ...
  62. [62]
    Multi-omic human pancreatic islet endoplasmic reticulum and ...
    Oct 8, 2024 · Genome-wide comparative human islet ER/cytokine stress response maps. T2D variants overlap 86 stress-responsive cis-regulatory elements (CREs).
  63. [63]
    Using Reporter Gene Assays to Identify cis Regulatory Differences ...
    Transcriptional initiation is regulated by a combination of trans elements binding to cis regulatory sequences. The relative contribution of changes in cis and ...
  64. [64]
    Protocol to study cis-regulatory activity of GWAS loci for specific ...
    Sep 25, 2024 · Luciferase reporter assays can be used to characterize cis-regulatory activity of non-coding DNA sequences. A putative cis-regulatory element ( ...
  65. [65]
    DNase-seq: a high-resolution technique for mapping active gene ...
    DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells.Missing: cis- seminal paper
  66. [66]
    Role of ChIP-seq in the discovery of transcription factor binding sites ...
    This review addresses the important applications of ChIP-seq with an emphasis on its role in genome-wide mapping of transcription factor binding sites.
  67. [67]
    CRISPR activation and interference as investigative tools in ... - NIH
    CRISPR activation and interference (CRISPRa/i) technology offers the unprecedented possibility of achieving regulated gene expression both in vitro and in vivo.
  68. [68]
    CUT&Tag recovers up to half of ENCODE ChIP-seq histone ... - Nature
    Mar 27, 2025 · Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities.
  69. [69]
    Advances in single-cell omics and multiomics for high-resolution ...
    Mar 5, 2024 · Here, we present an overview of single-cell multimodal omics methods that coprofile the genome and transcriptome. a gDNA-mRNA sequencing (DR-seq) ...
  70. [70]
    Massively Parallel Reporter Assays for High-Throughput In Vivo ...
    Mar 29, 2023 · We review one such method, massively parallel reporter assays (MPRAs), in which the activities of thousands of candidate genomic regulatory elements are ...
  71. [71]
    MEME Suite: Introduction
    The MEME Suite allows you to discover novel motifs in collections of unaligned nucleotide or protein sequences, and to perform a wide variety of other ...MEME · Motif Databases · MEME Motif Formats · MEME-ChIP
  72. [72]
    A survey of motif finding Web tools for detecting binding site motifs in ...
    MEME requires specifying how the user believes the occurrences of the motifs are distributed among the sequences, for example, zero or one per sequence. MEME ...
  73. [73]
    calico/basenji: Sequential regulatory activity predictions ... - GitHub
    Basenji provides researchers with tools to: Train deep convolutional neural networks to predict regulatory activity along very long chromosome-scale DNA ...Missing: 2020 2024
  74. [74]
    Expanded encyclopaedias of DNA elements in the human ... - Nature
    Jul 29, 2020 · The comprehensive discovery and annotation of cis-regulatory elements encoded within the human and mouse genomes are major goals of ENCODE. The ...
  75. [75]
    Deep learning the cis-regulatory code for gene expression ... - Nature
    Apr 25, 2024 · We explored this link with the training of interpretable deep learning models predicting gene expression profiles from gene flanking regions of the plant ...
  76. [76]
    Advances in computational and experimental approaches for ...
    May 8, 2024 · DIRECT-NET: An efficient method to discover cis-regulatory elements and construct regulatory networks from single-cell multiomics data.Missing: simulations | Show results with:simulations
  77. [77]
    Understanding the roles of cis-regulatory elements is essential ... - NIH
    This review discusses the role of gene regulatory networks in enhancing understanding of transcriptional regulation and construction methods.
  78. [78]
    DIRECT-NET: An efficient method to discover cis-regulatory ...
    Jun 1, 2022 · We present DIRECT-NET, a machine-learning method based on gradient boosting, to identify genome-wide CREs and their relationship to target genes.
  79. [79]
    Genetic regulatory mechanisms in the synthesis of proteins - PubMed
    Genetic regulatory mechanisms in the synthesis of proteins. J Mol Biol. 1961 Jun:3:318-56. doi: 10.1016/s0022-2836(61)80072-7.Missing: operator | Show results with:operator
  80. [80]
    Mechanism of Activation of Catabolite-Sensitive Genes - PNAS
    In all cases the DNA used to stim- ulate synthesis was derived from c80dlac virus containing a normal lac operon. Page 4. VOL. 66, 1970 BIOCHEMISTRY: ZUBAY, ...
  81. [81]
    ISOLATION OF THE LAC REPRESSOR - PMC - NIH
    Proc Natl Acad Sci USA. 1966 Dec;56(6):1891–1898. doi: 10.1073/pnas.56.6.1891 ISOLATION OF THE LAC REPRESSOR Walter GilbertMissing: discovery | Show results with:discovery
  82. [82]
    Transcription termination at the trp operon attenuators of - PNAS
    Contributed by Charles Yanofsky, August 8,1977. ABSTRACT. Transcription termination at the attenuators of the trp operons of Escherichia coli and Salmonella ...
  83. [83]
    Transcription Attenuation: Once Viewed as a Novel Regulatory ... - NIH
    Transcription attenuation: once viewed as a novel regulatory strategy. Charles Yanofsky. Charles Yanofsky. 1. Department of Biological Sciences, Stanford ...Missing: original paper
  84. [84]
    THE L-ARABINOSE OPERON IN ESCHERICHIA COLI B/r - PNAS
    Abstract.-The product of the regulator gene araC in the L-arabinose gene complex exists in two functional states: P1, the repressor, and P2, theactivator,.
  85. [85]
    Arabinose-induced binding of AraC protein to araI2 ... - PubMed - NIH
    We propose that araI2 occupancy by AraC protein leads to RNA polymerase recognition of the araBAD promoter and that araI1 acts as a switch mechanism.
  86. [86]
    isolation and genetic analysis of functions from Vibrio fischeri
    The direction of transcription of lux operons was deduced from the orientation of mini-Mu insertions in the fusion plasmids. Induction of transcription of one ...Missing: discovery paper
  87. [87]
    Quorum sensing in Vibrio fischeri: elements of the luxl promoter
    The lux box, a 20 bp inverted repeat, is located in the luxl promoter region and is required for LuxR-dependent induction of the luminescence genes.Missing: CREs original paper
  88. [88]
    [PDF] Synthetic Biology Within the Operon Model and Beyond
    Mar 30, 2021 · Abstract. In 1961, Jacob and Monod proposed the operon model of gene regulation. At the model's core was the modular assembly of regulators, ...<|control11|><|separator|>
  89. [89]
    Engineering Synthetic cis-Regulatory Elements for Simultaneous ...
    Dec 18, 2015 · Recognition of cis-regulatory elements by transcription factors (TF) at target promoters is crucial to gene regulation in bacteria.Missing: seminal paper
  90. [90]
    GCN4 protein, synthesized in vitro, binds HIS3 regulatory sequences
    GCN4 protein binds specifically to the 20 bp region of the HIS3 gene that is critical for transcriptional regulation in vivo and contains the TGACTC sequence ...Missing: seminal paper UAS
  91. [91]
    Combinatorial activation and concentration-dependent repression of ...
    Here, we analyze an enhancer element from the even skipped (eve) gene, which activates and positions two stripes of expression (stripes 3 and 7) in blastoderm ...Missing: Boolean | Show results with:Boolean
  92. [92]
    Super-enhancer mediated regulation of adult β-globin gene ...
    Locus control regions (LCRs) are SEs that have been demonstrated to provide position-independent expression of linked genes in transgenic mice through dominant ...
  93. [93]
    Hox genes regulation in vertebrates - Soshnikova - 2014
    Jul 5, 2013 · CIS-REGULATORY ELEMENTS. Collinear patterns of Hox genes expression are controlled by multiple proximal regulatory elements scattered ...
  94. [94]
    Expression analysis of the apple HSP70 gene family in abiotic stress ...
    Oct 14, 2024 · The cis-regulatory factors related to abiotic stress include low-temperature response elements, drought-induced response elements and stress- ...
  95. [95]
    Implicating type 2 diabetes effector genes in relevant metabolic ...
    Sep 6, 2024 · Genome-wide association studies (GWAS) have identified hundreds of type 2 diabetes loci, with the vast majority of signals located in non-coding ...
  96. [96]
    TERT promoter mutations occur frequently in gliomas and a subset ...
    We found that tumors could be divided into types with low (<15%) and high (≥15%) frequencies of TERT promoter mutations.
  97. [97]
    TAL1 activation in T-cell acute lymphoblastic leukemia - NIH
    May 1, 2023 · Such deregulation mechanisms occur in cis and involve the 'hijacking' of enhancer elements leading to oncogene activation. The SIL-TAL1 ...
  98. [98]
    CHD8 regulates neurodevelopmental pathways associated ... - PNAS
    Truncating mutation of chromodomain helicase DNA-binding protein 8 (CHD8) represents one of the strongest known risk factors for autism spectrum disorder (ASD).
  99. [99]
    Interview: Multiplex Base Editing Safely Enhances Sickle Therapy
    Sep 29, 2025 · The FDA-approved therapy Casgevy, targeting the +58-kb BCL11A enhancer, shows promising but inconsistent results, with patients retaining ...
  100. [100]
    Shared and unique 3D genomic features of substance use disorders ...
    Jul 19, 2024 · Recent genome-wide association studies (GWAS) revealed shared genetic components among substance use disorders (SUDs).