Fact-checked by Grok 2 weeks ago

Eukaryotic transcription

Eukaryotic transcription is the enzymatic process by which is synthesized from within the of eukaryotic cells, enabling the expression of genetic information through the production of various types essential for cellular function. Unlike prokaryotic transcription, which occurs in the with a single , eukaryotic transcription involves three distinct polymerases: (Pol I), which synthesizes most ribosomal RNAs (rRNAs); (Pol II), responsible for transcribing all protein-coding messenger RNAs (mRNAs) and many non-coding RNAs; and (Pol III), which produces transfer RNAs (tRNAs), 5S rRNA, and other small RNAs. This compartmentalization and specialization allow for precise control over in complex multicellular organisms. The process unfolds in three main stages: , , and termination. During , RNA polymerases assemble with general transcription factors at promoter regions to form preinitiation complexes (PICs), often facilitated by sequence-specific DNA-binding factors and coactivators like the Mediator complex. follows promoter clearance, where the polymerase travels along the DNA, synthesizing RNA at rates of approximately 20–80 per second for Pol II, while overcoming nucleosomal barriers through and histone modifications. Termination occurs upon recognition of specific signals, such as sites for Pol II transcripts, leading to polymerase release and RNA processing events like 5'-capping, splicing, and 3'- that are often co-transcriptionally coupled. Regulation of eukaryotic transcription is multifaceted and primarily occurs at the stage, involving enhancers, silencers, and a vast array of transcription factors that respond to cellular signals, developmental cues, and environmental stimuli. The C-terminal domain (CTD) of Pol II, composed of heptapeptide repeats, plays a crucial role by undergoing dynamic —such as at serine 5 during and serine 2 during —to recruit and regulatory factors. structure further modulates accessibility, with variants, covalent modifications (e.g., and ), and ATP-dependent remodelers like ensuring that transcription is both activated and repressed as needed for processes like cell differentiation and stress response. This elaborate regulatory network underscores the evolutionary adaptation of eukaryotic transcription to support the complexity of higher organisms.

Core Transcription Machinery

RNA polymerases

Eukaryotic cells contain three distinct nuclear RNA polymerases, each specialized for transcribing specific classes of RNA essential for cellular function. (Pol I) primarily synthesizes the majority of (rRNA) precursors, excluding 5S rRNA, accounting for up to 60% of total cellular transcription in rapidly dividing cells. This polymerase is predominantly localized in the , where it associates with (rDNA) clusters to facilitate . Structurally, human Pol I comprises 13 subunits, including a large catalytic subunit and accessory components that enable high processivity on rDNA templates. RNA polymerase II (Pol II) is responsible for transcribing all messenger RNAs (mRNAs) and most small nuclear RNAs (snRNAs), which are critical for protein coding and RNA splicing, respectively. As the largest of the eukaryotic polymerases, Pol II consists of 12 subunits, with the largest subunit featuring a unique C-terminal domain (CTD) composed of tandem heptapeptide repeats with the consensus sequence YSPTSPS. The CTD's phosphorylation states, particularly on serine residues (e.g., Ser2 and Ser5), dynamically regulate transcription stages from initiation to termination by serving as a binding platform for RNA processing factors. RNA polymerase III (Pol III) transcribes transfer RNAs (tRNAs), 5S rRNA, and various small non-coding RNAs involved and RNA processing. This polymerase is composed of 17 subunits in humans, including core elements shared with other polymerases and unique lobes for promoter recognition on internal control regions of its target genes. Pol III exhibits distinct promoter specificity compared to Pol I and Pol II, relying on III C (TFIIIC) for recruitment to A- and B-box elements in tRNA and 5S rRNA genes. All three polymerases are multi-subunit enzymes with conserved core architectures derived from bacterial RNA polymerase, featuring a claw-like structure with a clamp domain that grips DNA to stabilize the transcription bubble, as exemplified in high-resolution structures of Pol II. Evolutionary conservation is evident in the shared catalytic subunits (e.g., RPB1 homologs) across Pol I, II, and III, reflecting a common ancestry while allowing specialization through unique accessory subunits. These polymerases differ markedly in sensitivity to the fungal toxin α-amanitin: Pol II is highly sensitive (inhibited at nanomolar concentrations), Pol III shows intermediate resistance (requiring micromolar levels), and Pol I is largely resistant even at high doses. This differential inhibition has been instrumental in distinguishing their activities in biochemical assays.

General transcription factors

In eukaryotic cells, general transcription factors (GTFs) are a set of basal proteins that enable polymerases to accurately recognize promoters and initiate transcription, distinct from sequence-specific activators. For (Pol II), which transcribes protein-coding genes, the GTFs include TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIIH, which collectively form the preinitiation complex (PIC) in a stepwise manner. These factors ensure promoter-specific initiation by interacting with core promoter elements like the and facilitating DNA unwinding. TFIID initiates PIC assembly by binding the TATA box via its TATA-binding protein (TBP) subunit, which bends the DNA, while TBP-associated factors (TAFs) contact other promoter motifs and stabilize the interaction; TFIIA enhances this binding by preventing non-specific associations. TFIIB then docks adjacent to TBP, recognizing the BRE element and bridging to Pol II to define the start site. Pol II arrives pre-bound to TFIIF, a heterotetramer that positions the enzyme over the promoter and suppresses non-productive binding. TFIIE recruits TFIIH and aids in open complex formation, while TFIIH's XPB and XPD helicase subunits unwind ~15 bp of DNA and its cdk7 kinase phosphorylates the Pol II C-terminal domain (CTD) heptapeptide repeats to promote escape into elongation. The assembly order is TFIID → TFIIA/TFIIB → Pol II-TFIIF → TFIIE → TFIIH. The Mediator complex, comprising over 20 subunits, serves a basal co-activator role by stabilizing the PIC through interactions with Pol II's CTD and GTFs. For RNA polymerase I (Pol I), which synthesizes from rDNA promoters, the primary GTFs are SL1 and UBF. SL1, a TBP-containing with TAF subunits (TAF1A, TAF1B, TAF1C, TAF1D, TAF12), binds the core promoter (roughly -45 to +20 relative to the start site) to specify Pol I recruitment. UBF, an HMG-box protein, binds upstream control elements (UCE and proximal control element), enhancing DNA flexibility and stabilizing SL1 binding to facilitate PIC formation. TIF-IA (mammalian Rrn3 homolog) associates with Pol I's stalk and bridges it to the SL1-promoter , enabling closed-to-open transition. Assembly proceeds with UBF and SL1 binding first, followed by Pol I-TIF-IA recruitment. RNA polymerase III (Pol III) transcribes short non-coding RNAs like tRNAs and 5S rRNA using TFIIIA, TFIIIB, and TFIIIC, which recognize internal promoters. TFIIIA, featuring nine zinc fingers, specifically binds the internal control region of 5S rRNA genes to initiate assembly. TFIIIC, a large complex, recognizes A-box and B-box elements in internal promoters of tRNA and other genes, then recruits TFIIIB—a complex of TBP, TFIIB-related factor 1 (Brf1), and Bdp1—which positions Pol III via its TFIIF-like and TFIIE-like subunits. For type 1 promoters (e.g., 5S rRNA), the order is TFIIIA → TFIIIC → TFIIIB → Pol III; for type 2 (tRNA), it is TFIIIC → TFIIIB → Pol III; type 3 (U6 snRNA) uses SNAPc instead of TFIIIC. Across Pol I, II, and III, share conserved modules like TBP for promoter anchoring and TFIIB/Brf homologs for polymerase docking, underscoring mechanistic similarities.

Initiation of Transcription

Eukaryotic promoters and enhancers

In eukaryotic transcription, promoters are cis-regulatory DNA sequences that specify the site of transcription initiation by RNA polymerases. For RNA polymerase II (Pol II), which transcribes mRNA-encoding genes, the core promoter is a modular region typically spanning about 40 base pairs upstream to 40 base pairs downstream of the transcription start site (TSS). This region contains sequence motifs that collectively determine the basal level of transcription and facilitate recognition by general transcription factors. One prominent core promoter element is the TATA box, with the consensus sequence TATAAA, positioned approximately 25 to 35 base pairs upstream of the TSS. Found in roughly 10-30% of human Pol II promoters, the TATA box is more prevalent in genes with regulated, tissue-specific expression, such as those involved in development or stress responses, compared to housekeeping genes that often lack it. The Initiator (Inr) element, encompassing the TSS, has the consensus sequence YYANWYY (Y = pyrimidine, N = any nucleotide, W = A or T) and is present in nearly all Pol II-transcribed genes, aiding in precise start site selection. Additional Pol II core elements include the TFIIB recognition element (BRE), located upstream of the with consensus sequences like SSRCGCC (BREu) or RKYACCC (BREd), which modulates initiation efficiency. Downstream elements such as the downstream promoter element (DPE), with consensus RGWYVT at +28 to +32 bp relative to the TSS, and the motif ten element (MTE), consensus CSRSC at +20 to +27 bp, cooperate with Inr to enhance promoter strength, particularly in TATA-less promoters common in and mammals. These elements exhibit variability; for instance, housekeeping genes like those encoding or GAPDH often feature CpG-rich, TATA-less promoters with dispersed Inr and DPE usage, while tissue-specific genes, such as β-globin, more frequently incorporate TATA boxes for sharp TSS selection. RNA polymerase I (Pol I) promoters, responsible for ribosomal RNA (rRNA) synthesis from rDNA repeats, consist of a core element spanning -45 to +20 bp around the TSS and an upstream control element (UCE) from -156 to -107 bp. The core element directs basal initiation, while the UCE boosts transcription efficiency up to 10-fold, with both elements showing species-specific sequence conservation but high overall homology in vertebrates. RNA polymerase III (Pol III) promoters, which drive small non-coding RNAs like tRNAs and 5S rRNA, are classified into three types based on element positioning. Type 1 promoters, as in 5S rRNA genes, feature internal A box (consensus TRGCNNGNG) and C box (GWTCRNNAGC) sequences within the transcribed region, separated by an intermediate element. Type 2 promoters, typical for tRNA genes, contain internal A box (TGGCNNAGTG) and B box (GGTTCGATTCC) elements downstream of the TSS. Type 3 promoters, found in U6 snRNA genes, resemble Pol II promoters with an upstream proximal sequence element (PSE) at -60 to -40 bp and a distal sequence element (DSE) further upstream, often including a . Enhancers are distal cis-regulatory elements that increase transcription rates from promoters, often located up to 100 kilobases away, either upstream or downstream, and function independently of orientation relative to the . First identified in the viral , enhancers loop to contact promoters via interactions, enabling tissue-specific regulation; for example, the enhancer in B cells drives lymphoid-specific expression through combinatorial binding motifs. Unlike core promoters, enhancers show greater sequence diversity and are enriched in housekeeping genes for broad activity, whereas tissue-specific enhancers, like those in muscle genes, exhibit precise motif arrangements for developmental control.

Preinitiation complex assembly

The assembly of the preinitiation complex () in eukaryotic transcription initiation for (Pol II) occurs through a highly ordered, sequential of general transcription factors (GTFs) to the core promoter, typically involving recognition of elements such as the . The process begins with the binding of TFIID, a multi-subunit complex containing the (TBP), which recognizes and sharply bends the DNA at the , facilitating subsequent interactions. TFIIA then associates with the TFIID-DNA complex, stabilizing the TBP-DNA interaction and preventing non-specific binding. Following this, TFIIB binds to the TBP-DNA-TFIIA platform, bridging the promoter to the arriving Pol II core enzyme, which is escorted by TFIIF; TFIIF helps position Pol II over the start site and stabilizes the downstream DNA duplex. The concludes with TFIIE, which binds to the partial complex and recruits TFIIH, a multi-functional / complex essential for completing formation. The resulting closed complex maintains the promoter DNA in a double-stranded configuration, with Pol II positioned upstream of the transcription start site (+1). TFIIH's XPB subunit harbors activity that supports the structural integrity of this closed state, although is poised for subsequent steps. The fully assembled Pol II PIC is a massive macromolecular assembly, approximately 2 MDa in size, comprising over 50 polypeptides that collectively ensure precise positioning of the catalytic at the +1 for accurate initiation. This stability is critical for the complex's role in scanning the promoter to identify the start site, enhancing transcriptional fidelity. In contrast, PIC assembly for (Pol I), which transcribes genes, involves the upstream binding factor (UBF) and the selectivity factor (a TBP-TAF complex analogous to TFIID). UBF binds to upstream control elements and nucleates , while SL1 recognizes the core promoter and recruits Pol I alongside the core factor CF or Rrn3. For (Pol III), which handles small non-coding RNAs, assembly starts with TFIIIC binding to internal promoter elements (A and B boxes), followed by recruitment of TFIIIB (comprising TBP, Brf1, and Bdp1), which then positions Pol III at the start site. These Pol I and Pol III mechanisms highlight specialized adaptations from the Pol II paradigm, tailored to their distinct promoter architectures.

Promoter melting and open complex formation

Following the assembly of the preinitiation complex () at the promoter, promoter melting marks the transition to the open complex, where the DNA duplex unwinds to expose single-stranded template DNA for initial synthesis. This critical step is driven by the general TFIIH, whose XPB subunit (Ssl2 in ) exhibits ATP-dependent activity that rotates and threads downstream DNA into the () active site cleft, generating torsional stress that facilitates DNA unwinding. The resulting transcription bubble typically spans approximately 12-15 base pairs, centered around the transcription start site, with an initial ~6 bp opening that expands upon by XPB. In the open complex, the unwound promoter DNA separates into template and non-template strands, positioning the single-stranded template within the Pol II while Pol II adopts a post-translocation state, ready for the first formation without an initial RNA nucleotide. This ATP-dependent process requires hydrolysis by XPB to power the helicase-like translocation, ensuring stable bubble formation despite the energy barrier of DNA duplex separation. TFIIE plays a key role in stabilizing the open complex by interacting with Pol II and TFIIH, buttressing the Pol II clamp to prevent collapse of the unwound DNA and enhancing XPB activity through conformational regulation. High-resolution cryo-EM structures of the eukaryotic open complex reveal the precise separation of and non-template strands, with TFIIH in expanded conformations that accommodate the and highlight XPB's arched insertion into the downstream DNA duplex. These models demonstrate how the non-template strand is expelled from the Pol II cleft, while the strand threads through the , underscoring the coordinated dynamics of PIC components during melting.

Abortive initiation and promoter escape

Following the formation of the open complex, (Pol II) engages in , a phase characterized by the repeated synthesis and release of short transcripts typically ranging from 2 to 10 in length without polymerase clearance from the promoter. This process involves multiple cycles where Pol II initiates chain extension but dissociates prematurely due to the instability of the early transcription complex and high off-rates, resulting in non-productive transcripts that do not progress to . In eukaryotic systems, is less pronounced than in , with fewer cycling events, as evidenced by assays showing reduced abortive products beyond 4-10 nt when NTP concentrations are physiological (around 200 µM). Promoter escape marks the critical transition from this inefficient initiation to stable , where Pol clears the promoter and forms a processive elongation complex. A key regulatory event is the of serine 5 (Ser5) on the C-terminal (CTD) heptapeptide repeats of Pol 's largest subunit by the subunit of TFIIH, specifically Kin28 in or Cdk7 in metazoans. This disrupts stable interactions between the unphosphorylated CTD and preinitiation complex (PIC) components, such as and general transcription factors, thereby releasing Pol from promoter-proximal contacts and facilitating escape around 15-20 nt of synthesis. Structural studies reveal that escape proceeds in three major steps: displacement of the TFIIB B-reader element by the growing , collapse of the transcription bubble at approximately 17-18 bases, and dissociation of initiation factors like TFIIB and TFIIH, enabling the polymerase to advance beyond +30 nt. The scaffold complex model describes how the complex and the Pol II holoenzyme contribute to this transition, with initially stabilizing the and then dissociating upon CTD to allow promoter escape while leaving a residual "scaffold" at the promoter for subsequent initiation events.00257-3) In this framework, the holoenzyme—comprising Pol II, , and associated factors—undergoes a compositional shift, where TFIIH-mediated CTD Ser5 promotes release, enhancing the efficiency of Pol II clearance without fully committing to elongation.00257-3) This model is supported by and evidence showing that remains partially bound post-escape, coordinating reinitiation in a "transcription factory" manner. The efficiency of promoter escape is modulated by promoter strength, with stronger promoters—such as those featuring optimal to transcription start site spacing (30-31 )—exhibiting faster escape rates and reduced abortive cycling due to more stable initial complexes. For instance, canonical elements promote bubble collapse at 9-10 nt, accelerating clearance compared to weaker or non-consensus promoters, which prolong and lower productive transcription yields. This promoter-dependent variation underscores how sequence context influences the energetic barriers to escape, ensuring regulated .

Elongation of Transcription

Elongation factors and processivity

During the elongation phase of eukaryotic transcription, (Pol II) is assisted by various accessory proteins known as factors, which enhance its processivity—the ability to synthesize long RNA transcripts without dissociating from the DNA template. These factors counteract barriers such as nucleosomes and secondary DNA structures, ensuring efficient progression at rates sufficient for cellular demands. A key positive elongation factor is P-TEFb, composed of cyclin-dependent kinase 9 (CDK9) and cyclin T, which promotes productive elongation by phosphorylating serine 2 (Ser2) residues on the C-terminal domain (CTD) heptapeptide repeats of Pol II. This phosphorylation releases Pol II from promoter-proximal pausing and facilitates the transition to sustained elongation by modifying interactions with other factors. P-TEFb is recruited to gene bodies in a transcription-dependent manner, amplifying its impact on global mRNA synthesis. The negative elongation factors DSIF (Spt4/Spt5 heterodimer) and NELF initially contribute to promoter-proximal pausing but are repurposed for upon phosphorylation by P-TEFb. Phosphorylation of DSIF and NELF dissociates NELF from the elongation complex, converting DSIF into a positive factor that stabilizes Pol II on the template and suppresses premature termination. This transition is essential for efficient escape from initiation into , enabling processive transcription. DSIF (Spt4/Spt5) and the histone chaperone Spt6 further support elongation by maintaining integrity during Pol II passage. Spt4/Spt5 binds to Pol II and nascent , promoting forward translocation and preventing , while Spt6 associates with the to facilitate disassembly ahead of the transcription bubble and reassembly behind it through H3-H4 binding and recycling. These actions preserve epigenetic marks and prevent chromatin disruption, allowing Pol II to traverse nucleosomal arrays without stalling. In eukaryotes, these factors collectively enable Pol II to elongate at speeds of approximately 20-60 nucleotides per second in vivo, varying by gene context and cell type, which is critical for timely gene expression. DSIF and Spt6, in particular, play anti-backtracking roles by tethering Pol II to DNA, reducing dissociation events and enhancing overall processivity. The super-elongation complex (SEC), incorporating AFF4, ELL proteins, ENL/AF9, and P-TEFb, further boosts processivity for rapid transcription of specific genes, such as immediate early genes and those involved in development. AFF4 scaffolds the complex, while ELL prevents Pol II pausing by suppressing transient termination, enabling burst-like transcription rates that exceed standard elongation. SEC is particularly vital in contexts requiring high transcriptional output, like cellular stress responses.

Transcription fidelity and proofreading

During eukaryotic transcription elongation by RNA polymerase II (Pol II), fidelity is maintained through multiple checkpoints to ensure accurate nucleotide incorporation, with an overall error rate of approximately 1 in 10^6 nucleotides. The primary mechanism begins with base selection in the active site, where the trigger loop of the RPB1 subunit collapses to position the incoming nucleoside triphosphate (NTP) for correct Watson-Crick base pairing with the template DNA, discriminating against mismatches with kinetic rates that are 10^3 to 10^4 times slower than for correct NTPs. This selectivity arises from the precise geometry of the active site, which stabilizes matched base pairs while imposing energetic penalties on mismatches, resulting in an initial misincorporation error rate of about 1 in 10^4 to 10^5. Post-incorporation fidelity is enhanced by kinetic barriers that hinder the extension of mismatched , preventing their propagation into the chain. Mismatch-specific strategies vary: purine-purine mismatches (e.g., G·A) induce structural distortions that slow translocation and extension by over 4,000-fold relative to correct pairs, while pyrimidine-pyrimidine mismatches (e.g., T·U) form wobble pairs that fray the 3' end, displacing it by more than 2 Å from the catalytic site and disrupting metal ion coordination essential for formation. These barriers provide a window for detection before further elongation. Proofreading occurs when a mismatch triggers backtracking of the transcription complex, extruding the erroneous 3' RNA end into the secondary channel of Pol II. The rudder and lid domains of RPB1 play critical roles here: the rudder separates the RNA from the DNA template downstream of the active site, while the lid grips the RNA upstream, stabilizing the RNA:DNA hybrid and facilitating backtracking upon mismatch detection by altering hybrid stability. This backtracked state positions the mismatched RNA for cleavage, where Pol II's intrinsic endonucleolytic activity hydrolyzes the phosphodiester bond 2-9 nucleotides upstream, excising the error as a dinucleotide or oligonucleotide. The proofreading step further reduces the error rate by 10- to 100-fold, depending on the mismatch type. The factor TFIIS ( IIS) significantly amplifies efficiency by binding to the secondary channel of backtracked Pol II, opening the to promote of mismatched up to 10^5 times faster than the intrinsic rate. TFIIS stabilizes the backtracked conformation and coordinates a second catalytic metal ion, enabling precise endonucleolytic at the mismatch site, which is particularly effective for single-nucleotide errors. This stimulated ensures during , with TFIIS-dependent accounting for much of the overall accuracy . factors like Spt4/5 can indirectly support by modulating dynamics, though their primary role is processivity.

Pausing, poising, and backtracking

In eukaryotic transcription , promoter-proximal pausing occurs shortly after , where (Pol II) stalls approximately 20-60 nucleotides downstream of the transcription start site, preventing premature progression into productive . This pause is mediated by the negative (NELF) complex and the DRB sensitivity-inducing factor (DSIF), which form a stable complex with the initiating form of Pol II (Pol IIa) and the nascent transcript. NELF binds directly to the emerging chain, while DSIF associates with Pol II to restrict its forward movement, thereby poising the polymerase for rapid activation upon receiving appropriate regulatory signals. A classic example of promoter-proximal pausing and poising is observed in the heat shock genes, such as , where Pol II accumulates in a paused state under non-stress conditions, enabling swift transcriptional activation in response to heat shock. This poised configuration, established by NELF and DSIF, allows for the near-instantaneous recruitment of additional Pol II molecules and the transition to the elongating form (Pol IIo) upon induction, facilitating a robust protective response against cellular stress. In uninduced cells, the paused Pol II at is transcriptionally engaged but stalled, ensuring that gene expression can be rapidly amplified without the delay of initiation complex assembly. Backtracking represents another regulatory mechanism during , wherein Pol II undergoes reverse translocation due to slippage in the RNA:DNA hybrid, displacing the 3' end of the nascent into the polymerase's secondary by 8-14 nucleotides.00598-1) This backward movement stabilizes misaligned complexes, leading to transcriptional pausing or , particularly at obstacles or during error correction.00142-1) Resolution of backtracked states requires cleavage of the extruded RNA segment to realign the 3' end with the , restoring elongation competence.00142-1) In eukaryotes, the GreB homolog transcription factor IIS (TFIIS) enhances Pol II's intrinsic endonucleolytic activity to cleave backtracked transcripts, typically producing fragments of 8-14 that match the extent of reversal.00598-1) TFIIS inserts into the polymerase's secondary channel, donating acidic residues to stabilize the catalytic magnesium ions and promote . This cleavage not only rescues stalled complexes but also contributes to transcriptional by removing mismatched or aberrant segments, as backtracking often occurs following incorporation errors.00142-1) In the context of promoter-proximal pausing, such as at , TFIIS is essential for efficient release from stall sites, further underscoring its role in poising and activation.00720-8)

Co-transcriptional RNA processing

Co-transcriptional RNA processing encompasses the modification events that mature the nascent pre-mRNA during transcription elongation by (Pol II), ensuring efficient coupling between synthesis and quality control of the transcript. These processes are orchestrated primarily through the C-terminal domain (CTD) of Pol II's largest subunit (RPB1), a tail of 52 heptapeptide repeats (consensus YSPTSPS) that undergoes dynamic . Phosphorylation patterns on CTD residues, particularly serines 2 and 5, serve as a platform for recruiting distinct machineries, preventing uncapped or improperly spliced RNAs from accumulating and linking processing fidelity to transcriptional progression. The initial modification, 5' capping, occurs co-transcriptionally shortly after transcription , when the nascent RNA chain reaches 20-30 in length. This step involves the addition of a 7-methylguanosine (m7GpppN) to the 5' triphosphate end by a tripartite complex: RNA triphosphatase (removes the γ-phosphate), guanylyltransferase (adds GMP), and methyltransferase (adds the ). Recruitment of the to Pol II is mediated by direct binding of its guanylyltransferase subunit to the CTD phosphorylated at serine 5 (Ser5-P), a mark established by the Kin28/Cdk7 during promoter clearance; this interaction allosterically activates the and restricts capping to Pol II transcripts.00446-3) Splicing, the removal of introns and ligation of exons, also proceeds co-transcriptionally for most s, with assembly initiated as soon as splice sites emerge from the exit channel. The U1 small nuclear ribonucleoprotein () is the first component recruited, binding the 5' splice site at exon- boundaries via base-pairing with its U1 snRNA; this commitment complex forms within seconds of transcription across the site in and mammalian systems. Subsequent maturation involves U2 binding the and recruitment of tri-snRNP particles, facilitated by interactions with the CTD; at serine 2 (Ser2-P), catalyzed by kinases like P-TEFb (Ctk1 in ), promotes association of splicing factors such as U2AF and , enhancing splice site recognition and enabling sequential removal during elongation. Preparation for 3' end formation begins during , with and factors scanning the nascent for processing signals. The and specificity factor (CPSF) complex, comprising subunits like CPSF-160 and CPSF-73, recognizes the signal (PAS) AAUAAA (or variants like AUUAAA) located 10-30 upstream of the site; this hexameric is bound by the recognition (RRM) domains of CPSF-160, often in concert with downstream GU/U-rich elements recognized by the stimulation factor (CstF). CPSF recruitment to the elongating Pol II occurs via interactions with the Ser2-P CTD and associated factors, positioning the endonuclease CPSF-73 for precise and subsequent poly(A) polymerase addition of the upon signal .

Termination of Transcription

Pol II-dependent termination mechanisms

In eukaryotic cells, transcription termination by RNA polymerase II (Pol II) primarily occurs through the torpedo model, where cleavage and polyadenylation at the poly(A) site generate a free 5' end on the downstream RNA fragment that is rapidly degraded by the 5'-3' exonuclease Rat1 (in yeast) or Xrn2 (in humans), acting as a "torpedo" to catch up with and displace the elongating Pol II complex from the DNA template. This process is tightly coupled to 3' end formation, ensuring efficient release of the mature mRNA and recycling of Pol II for subsequent rounds of transcription. The model integrates elements of an older allosteric mechanism, where conformational changes in Pol II triggered by processing factors contribute to weakening polymerase-DNA interactions, but the nuclease chase is the key effector for termination. The cleavage at the poly(A) site is mediated by the and specificity () , which recognizes the AAUAAA signal and associated downstream elements, producing the upstream for and the downstream fragment for degradation. Rat1/Xrn2 loads onto this 5' end and processively degrades the nascent , requiring association with the Rai1 cofactor for efficient activity near Pol II; upon reaching the polymerase, the displaces Pol II by disrupting its interactions with the RNA-DNA hybrid and template DNA. loading and function depend on the including Pcf11 and Ctf20 (in ), which bridge the processing machinery to Pol II via interactions with the phosphorylated C-terminal domain (CTD) of the largest subunit, Rpb1. Specifically, of serine 2 (Ser2-P) in the CTD heptapeptide repeats (YSPTSPS) during recruits termination factors such as Pcf11 and Rtt103, which bind cooperatively to Ser2-P marks to stabilize the subcomplex at the 3' end of genes and facilitate access. As a mechanism, Pol II employs intrinsic pausing and , where the stalls and extrudes a 3' flap that can be cleaved endonucleolytically by the of Pol II itself, generating entry points for exonucleases if the primary pathway is impaired. This backtracked cleavage, enhanced by factors like TFIIS, prevents persistent stalling and ensures termination even in the absence of poly(A)-dependent processing, as observed in some non-coding transcripts or under mutagenic conditions. These redundant pathways highlight the robustness of Pol II termination, minimizing read-through transcription that could interfere with downstream .

Pol I- and Pol III-specific termination

In eukaryotic cells, termination of RNA polymerase I (Pol I) transcription, which synthesizes the majority of ribosomal RNA (rRNA), relies on sequence-specific roadblock factors that pause the elongating polymerase. In mammals, the transcription termination factor TTF-I binds to Sal box sequences—short DNA motifs (11–18 bp) located in the intergenic spacer downstream of the pre-rRNA coding region—inducing Pol I pausing and subsequent dissociation of the transcription complex. Multiple Sal box elements (T1–T10) enhance termination efficiency and may facilitate gene looping, bringing terminators into proximity with upstream promoters to promote Pol I recycling and reinitiation. In yeast, the analogous mechanism involves Reb1 (or its homolog Nsi1) binding to terminator sites in the ribosomal DNA intergenic spacer, such as the T1 site ~93 nucleotides downstream of the 25S rRNA gene, which pauses Pol I. Efficient termination requires the 5′–3′ exonuclease Rat1 (Xrn2 in mammals), recruited following endonucleolytic cleavage of the nascent pre-rRNA by Rnt1 at sites in the 3′ external transcribed spacer; Rat1 then degrades the RNA from the 5′ end, "torpedoing" the paused polymerase and promoting its release. Without Rat1 activity, Pol I often reads through the primary terminator, relying on secondary sites like T2 or the replication fork barrier for eventual dissociation. RNA polymerase III (Pol III), responsible for transcribing small non-coding RNAs such as tRNAs and 5S rRNA, employs an mechanism independent of additional protein factors in many cases. Termination occurs at stretches of 4–6 thymidines (oligo-dT) in the non-template strand, corresponding to oligo-dA in the template, which produce a run of uridines (oligo-U) at the RNA 3′ end; the weak rU:dA base pairs destabilize the RNA: hybrid, causing Pol III to stall and dissociate without precise cleavage. Efficiency depends on the length and flanking sequences, with at least 4 Ts sufficient in vertebrates and 5 or more in , often resulting in heterogeneous oligo-U tracts of variable length. In , the ATP-dependent Sen1 enhances Pol III termination by binding to the stalled complex and unwinding the RNA:DNA hybrid, facilitating polymerase displacement and transcript release, particularly for genes with suboptimal oligo-dT signals; this acts as a mechanism to prevent transcription. Sen1's role underscores a conserved function across polymerases, though Pol III termination remains more direct than the coupled cleavage-exonuclease processes in Pol I and Pol II.

Transcription Regulation

Chromatin remodeling and epigenetic control

Chromatin remodeling and epigenetic modifications play crucial roles in regulating access to DNA for transcription in eukaryotes by dynamically altering nucleosome structure and stability. These processes enable the transition between condensed, transcriptionally repressive chromatin (heterochromatin) and open, accessible states (euchromatin), thereby controlling the recruitment of RNA polymerase II and associated factors. Epigenetic marks, including covalent modifications on histones and DNA, provide heritable information that influences transcriptional states without altering the underlying genetic sequence. Histone acetylation, catalyzed by histone acetyltransferases (HATs) such as p300 and CBP, neutralizes the positive charge on residues, reducing the affinity between histones and negatively charged DNA to promote an open conformation conducive to transcription activation. For instance, acetylation of and H4 tails at promoter regions facilitates the binding of bromodomain-containing proteins that further stabilize the transcriptional machinery. In contrast, histone methylation exhibits context-dependent effects: trimethylation of at 4 (H3K4me3), deposited by SET1/MLL complexes, marks active promoters and enhancers, correlating with high levels of transcription by recruiting readers like TAF3 in the preinitiation complex. Conversely, trimethylation of H3 at 27 (H3K27me3), mediated by the Polycomb Repressive Complex 2 (PRC2) containing , enforces gene repression by compacting and inhibiting polymerase progression, particularly at developmental loci. ATP-dependent complexes, such as the family, utilize the energy from to slide, eject, or restructure , thereby exposing promoter regions for transcriptional initiation. The complex, featuring the BRG1 or BRM subunits, disrupts -DNA contacts to facilitate activator binding and recruitment at target genes. Complementarily, the ISWI family remodelers, including complexes like ACF and CHRAC, promote regular spacing and array assembly, which supports both activation and repression by maintaining ordered architecture that influences access. These remodelers often cooperate with modifications to fine-tune dynamics during . DNA methylation, primarily at cytosine residues in CpG dinucleotides, contributes to transcriptional silencing by recruiting methyl-CpG-binding domain proteins that compact and exclude transcription factors, with dense methylation at promoter-proximal CpG islands strongly correlating with gene repression. Demethylation is facilitated by ten-eleven translocation () enzymes, which oxidize to and further intermediates, enabling active removal of repressive marks and potential reactivation of silenced loci. This balance ensures stable yet reversible control over transcriptional accessibility. Epigenetic inheritance of these modifications occurs during , where parental with marks like or are randomly segregated to daughter strands, and histone chaperones such as CAF-1 facilitate the propagation of these patterns through read-write mechanisms involving methyltransferases that restore marks on newly synthesized . This semi-conservative process maintains transcriptional memory across cell divisions, ensuring lineage-specific profiles. These mechanisms collectively influence preinitiation complex assembly by modulating accessibility at promoters.

Gene-specific activation and enhancers

Gene-specific activation in eukaryotic transcription is primarily mediated by sequence-specific transcription activators that bind to enhancer regions, distal DNA sequences that boost the activity of target promoters. These activators recruit the transcriptional machinery to specific genes, enabling precise control over expression in response to cellular signals. Enhancers can function over long distances, often tens to hundreds of kilobases away from their target promoters, and their activity is orientation- and position-independent. Transcription activators typically consist of a (DBD) and an activation domain (AD). Common DBD motifs include , which coordinate ions to stabilize interactions with DNA bases, as seen in the , and motifs, where an recognizes the major groove of DNA, exemplified by homeodomain proteins. Notable examples include , a Rel family member with a Rel domain featuring a variant structure that binds κB sites to activate genes, and , which uses a -based DBD to recognize response elements in genes involved in arrest and . Co-activators bridge activators to the core transcriptional machinery, enhancing recruitment and activity. The complex, a large multi-subunit co-activator, interacts with activator ADs via its head, middle, and tail modules and binds the C-terminal domain (CTD) of (Pol II) to stabilize pre-initiation complex formation. p300, a (HAT), serves as another key co-activator by acetylating histones to loosen and directly interacting with activators like to facilitate Pol II recruitment; its HAT activity is essential for enhancer function. Enhancer-promoter communication often involves chromatin looping, where enhancers are brought into physical proximity with promoters. The Mediator complex, in conjunction with cohesin, facilitates these loops by tethering enhancer-bound activators to promoter-bound Mediator, while CTCF proteins at loop boundaries insulate domains to prevent inappropriate interactions. This looping mechanism enables enhancers to contact promoters across large genomic distances, as demonstrated in studies of developmental genes where cohesin extrusion along DNA fibers positions enhancers near targets. Super-enhancers represent clusters of multiple enhancers bound by high densities of activators, , and Pol II, driving robust expression of genes critical for cell identity, such as those encoding master transcription factors in embryonic stem cells. These regions span larger genomic areas (typically >12.5 kb) with elevated levels of histone acetylation and protein occupancy compared to typical enhancers, leading to heightened transcriptional output and sensitivity to perturbations. However, the concept of super-enhancers has been subject to debate, with some researchers arguing they represent a quantitative rather than qualitative difference from typical enhancers.

Gene-specific repression and silencers

Gene-specific repression in eukaryotic transcription involves targeted mechanisms that inhibit the expression of particular genes, often through sequence-specific DNA-binding proteins known as . These directly bind to regulatory elements near target genes, preventing the assembly or activity of the transcription initiation complex. A prominent example is the element-1 silencing (REST), also known as neuron-restrictive silencer factor (NRSF), which binds to repressor element-1 (RE1) sites in the promoters of neuronal genes, thereby blocking their transcription in non-neuronal cells to maintain cell-type specificity during development. Another mode of repression, termed quenching, occurs when repressors compete with activators for binding to overlapping or adjacent DNA sites, thereby reducing activator occupancy and transcriptional activation without directly altering chromatin structure. This competitive quenching is particularly evident in short-range regulatory contexts, such as in Drosophila loci where repressors like those in the even-skipped stripe 2 enhancer displace activators to fine-tune spatial gene expression patterns. Co-repressors enhance the repressive activity of DNA-binding repressors by recruiting chromatin-modifying enzymes that compact nucleosomes and inhibit access. deacetylases (HDACs), such as and HDAC2, are key co-repressors that remove acetyl groups from , promoting a closed conformation that silences transcription. The Sin3 complex, a multi-subunit assembly containing HDACs, acts as a scaffold for these interactions; for instance, recruits the Sin3-HDAC complex to RE1 sites, amplifying repression of neuronal genes through deacetylation of H3 and H4. Silencers are cis-regulatory DNA elements that mediate long-range repression, distinct from promoters and enhancers, by insulating genes from activating signals or imposing heritable silencing. Insulator elements, such as those bound by , block enhancer-promoter interactions by forming loops that physically separate regulatory domains, thereby preventing inappropriate activation in eukaryotic genomes. Polycomb repressive complexes (PRCs) associate with silencers to enforce stable repression; PRC2 catalyzes trimethylation of at lysine 27 (), which recruits PRC1 to monoubiquitinate at lysine 119 (H2AK119ub), compacting and inhibiting Pol II elongation at developmental genes like Hox clusters. A major class of eukaryotic repressors featuring silencer-like activity includes KRAB-containing proteins (KRAB-ZFPs), the largest family of transcription factors in vertebrates, which bind specific DNA motifs via their C2H2 domains. The KRAB domain recruits co-repressors like KAP1 (KRAB-associated protein 1), which in turn assembles HDACs and (HP1) to silence endogenous retroviruses and developmental genes, often over long distances through spreading. Unlike bacterial , which coordinates population-level responses via diffusible signals, KRAB-ZFPs provide precise, cell-intrinsic repression tailored to eukaryotic complexity, such as in transposable elements to maintain genomic .

Post-initiation control of elongation and termination

Post-initiation control in eukaryotic transcription primarily modulates the progression of (Pol II) through elongation and ensures precise termination, thereby fine-tuning levels and preventing aberrant transcription. After promoter escape, Pol II encounters regulatory checkpoints that influence its processivity, such as inhibitory complexes that maintain a paused state or promote premature termination in non-productive regions. These mechanisms integrate signals from cellular conditions to activate productive elongation or enforce termination fidelity, often coupling transcription with downstream processing events. A key regulator of elongation is the positive transcription elongation factor b (P-TEFb), a complex comprising CDK9 and T that phosphorylates the C-terminal domain (CTD) of Pol II at serine 2, as well as negative elongation factors DSIF and NELF, to facilitate pause release and productive . P-TEFb activity is tightly controlled through sequestration in an inhibitory complex with HEXIM1 or HEXIM2 proteins and the 7SK small nuclear ribonucleoprotein (), where HEXIM binds 7SK to expose domains that inhibit CDK9 activity, thereby preventing premature activation. Release of P-TEFb from this complex occurs upon recruitment by activators like or Tat in viral contexts, allowing and progression; this dynamic inhibition-activation cycle ensures responds to developmental or signals. Attenuation mechanisms involve premature termination of Pol II shortly after , particularly in promoter-proximal or non-coding regions, to silence spurious transcription and maintain genomic stability. For instance, promoter upstream transcripts (PROMPTs), short non-coding transcribed bidirectionally from sequences ~1-2 kb upstream of protein-coding gene start sites, are rapidly terminated via the torpedo pathway, where the XRN2 degrades the RNA and displaces Pol II after cleavage by the cleavage and specificity factor (CPSF). This process is widespread, affecting over 50% of genes, and helps prevent antisense or unproductive transcripts from accumulating. Termination fidelity is enhanced by coupling with 3' end processing, where the complex plays a pivotal role in snRNA genes by recognizing the snRNA promoter and cleaving nascent transcripts ~20-40 nucleotides downstream of the mature 3' end, triggering Pol II release through recruitment of termination factors like XRN2. For protein-coding genes, termination often involves alternative (polyA) site choice, where Pol II processivity and CTD influence selection among tandem polyA signals; high rates favor distal sites for longer 3' UTRs, while pausing or promotes proximal cleavage, linking termination efficiency to mRNA isoform diversity and stability. Antisense transcription can interfere with sense strand elongation and termination by converging Pol II complexes that collide, leading to backtracking, stalling, or premature dissociation of the sense polymerase, particularly in head-on orientations that exacerbate roadblock effects. This interference is condition-specific, as seen in where antisense RNAs repress sense genes by extending into promoters and altering , thereby modulating elongation rates without direct sequence complementarity.

Specialized Transcription Processes

Transcription-coupled DNA repair

Transcription-coupled nucleotide excision repair (TC-NER) is a specialized subpathway of (NER) that preferentially repairs lesions on the transcribed strand of active genes in eukaryotes. When (Pol II) encounters a bulky lesion, such as a cyclobutane induced by (UV) light, transcription elongation stalls, triggering the recruitment of repair factors to resolve the blockage and restore transcription. This process ensures that actively transcribed genes are repaired more rapidly than non-transcribed regions, prioritizing the maintenance of fidelity. Central to TC-NER initiation are the Cockayne syndrome proteins B (CSB, encoded by ERCC6) and A (CSA, encoded by ERCC8), which form a complex that recognizes the stalled Pol II and facilitates the assembly of the repair machinery. CSB, an SNF2 family ATPase, binds directly to the stalled polymerase and promotes its , a where Pol II reverses along the DNA template to extrude the 3' end of the nascent RNA and expose the DNA lesion for verification and incision. CSA, a WD40-repeat protein, acts as an adaptor that ubiquitinates and recruits additional factors, including the TFIIH and core NER proteins like XPA, to the site, enabling dual incision of the damaged strand approximately 20-30 nucleotides away from the lesion. This step is crucial, as it provides physical access to the lesion without dissociating Pol II from the DNA, and in some contexts, factors like Rad51 may stabilize the reversed complex to prevent collapse into double-strand breaks. Recent advances as of 2024 have elucidated transcription-coupled repair of DNA-protein crosslinks (DPCs), where stalled Pol II recruits factors for DPC , and coordination with repair-independent eviction via the p97-proteasome pathway to maintain genome stability. Mutations in CSA or CSB genes disrupt TC-NER, leading to , a rare autosomal recessive disorder characterized by severe neurological dysfunction, premature aging, and extreme sensitivity to UV-induced DNA damage due to inefficient repair of transcription-blocking lesions. Patients with exhibit hyperphotosensitivity and defective recovery of RNA synthesis following UV exposure, underscoring the pathway's role in cellular viability. Beyond canonical TC-NER, non-canonical transcription-coupled (TC-BER) operates for oxidative lesions, where Pol II stalling recruits DNA glycosylases such as NEIL1 and NEIL2 to initiate repair of base modifications like on the transcribed strand. This coupling enhances the efficiency of repairing transcription-impeding oxidative damage in eukaryotes.

Dysregulation of transcription in cancer

Dysregulation of eukaryotic transcription plays a central role in oncogenesis by altering patterns that promote uncontrolled , survival, and . and aberrant activity in (Pol II) components, including its C-terminal domain (CTD), contribute to these changes. In cancer, Pol II factors such as POLR2A (encoding the RPB1 subunit with the CTD) are frequently overexpressed, driving tumor progression in gastric, ovarian, and cells through enhanced transcriptional output and cell cycle advancement. Additionally, oncogenic super-enhancers hijack Pol II machinery to amplify proto-oncogenes like , forming large clusters of enhancers that sustain high MYC expression in and other malignancies, thereby promoting transcriptional addiction and tumor maintenance. Epigenetic alterations further dysregulate transcription in cancer, often manifesting as global DNA hypomethylation that reactivates silenced oncogenes and transposons, alongside focal hypermethylation of tumor suppressor promoters. This imbalance disrupts chromatin accessibility and Pol II recruitment, fostering a pro-tumorigenic transcriptional landscape common across various cancer types. Histone deacetylase (HDAC) inhibitors counteract these changes by increasing histone acetylation to restore normal gene expression; vorinostat (Zolinza), approved by the FDA in 2006 for cutaneous T-cell lymphoma, exemplifies this therapeutic approach by inducing cell cycle arrest and apoptosis in transformed cells. Fusion proteins from chromosomal translocations exemplify direct interference with transcriptional activation. In (), the aberrantly recruits corepressors to response elements, blocking Pol II-mediated activation of genes and maintaining the leukemic state; all-trans reverses this by relieving repression and promoting of the . Recent advances, particularly post-2020, highlight vulnerabilities in transcriptional elongation as therapeutic targets. Genome-wide / screens in patient-derived stem cells have identified dependencies on Pol II elongation factors, such as those in the super elongation complex, revealing that inhibiting YY1-driven elongation machinery not only impairs tumor growth but also activates responses to potentiate anti-PD-1 . Similarly, screens have uncovered roles for Pol II pausing factors like NELF in enhancing + T cell anti-tumor immunity, where NELF suppression boosts effector functions and tumor infiltration, suggesting pausing modulation as a to improve efficacy in solid tumors. Post-2022 developments include enhancer reprogramming as a driver of transcriptional dysregulation in diverse cancers and transcription-replication conflicts leading to oncogenic mutations, opening avenues for targeted therapies like inhibiting ubiquitinated oncogenic transcription factors (as of 2025).

Comparisons with Prokaryotic Transcription

Differences in machinery and initiation

Eukaryotic transcription employs three distinct s— (Pol I), II (Pol II), and III (Pol III)—each specialized for transcribing specific classes of genes, such as ribosomal RNAs by Pol I, protein-coding messenger RNAs by Pol II, and transfer RNAs and by Pol III. In contrast, prokaryotes utilize a single (RNAP) enzyme, primarily the housekeeping σ70-RNAP holoenzyme, which handles all transcription needs. This multiplicity in eukaryotes reflects the compartmentalization of transcription within the and the need to coordinate diverse programs, whereas the singular prokaryotic RNAP enables rapid, unified responses to environmental cues. A key distinction lies in the auxiliary factors required for promoter recognition and assembly. Eukaryotic Pol II transcription depends on general transcription factors (), including TFIID, TFIIA, TFIIB, TFIIF, TFIIE, and TFIIH, which collectively form the preinitiation complex (PIC) to recruit and position the at promoters. Prokaryotes, however, rely on (σ) factors that associate transiently with the RNAP core to form the holoenzyme, with σ70 being the primary factor for most housekeeping genes in bacteria like Escherichia coli. These σ factors directly recognize promoter sequences without the need for an extensive multi-component assembly, allowing for simpler and faster initiation compared to the GTF-mediated process in eukaryotes. Initiation in eukaryotes involves a multi-step, ordered of the PIC on the core promoter. The process begins with TFIID binding to promoter elements via its (TBP) subunit, followed by recruitment of TFIIA and TFIIB, then Pol II pre-associated with TFIIF, and finally TFIIE and TFIIH to melt the DNA and form the open complex. This stepwise recruitment ensures precise positioning but introduces regulatory checkpoints absent in prokaryotes, where the pre-formed holoenzyme binds directly to the promoter, scans for the start site, and initiates without intermediate . Notably, while many eukaryotic promoters feature a recognized by TBP, prokaryotic promoters lack a true TATA equivalent in the eukaryotic sense, relying instead on conserved motifs for holoenzyme docking. Promoter architecture further highlights these differences. Eukaryotic promoters consist of a core region (often including the , initiator, and downstream elements) proximal to the transcription start site, augmented by distal enhancers that loop to interact with the core via mediator complexes for combinatorial regulation. Prokaryotic promoters, by comparison, are more compact, defined primarily by the -35 box (TTGACA) and -10 box (TATAAT) sequences recognized by σ70, with minimal reliance on distant regulatory elements. This core-only simplicity in prokaryotes supports polycistronic operons, whereas eukaryotic enhancers enable tissue-specific and signal-responsive expression across large genomic distances.

Differences in regulation and termination

Eukaryotic transcription regulation is markedly more complex than prokaryotic, incorporating layers such as chromatin remodeling and epigenetic modifications that control DNA accessibility for RNA polymerase II (Pol II). In contrast, prokaryotes primarily rely on operons—clusters of genes transcribed as polycistronic mRNAs under a single promoter—to coordinate expression of related functions, with attenuation mechanisms allowing premature termination based on translation rates. Eukaryotes employ distal enhancers, which are cis-regulatory DNA sequences that loop to promoters via mediator complexes and transcription factors to activate gene-specific expression over long distances. Prokaryotes, lacking such spatial separation, use riboswitches—RNA elements in the nascent transcript that sense metabolites and alter secondary structure to modulate transcription or translation without requiring additional proteins. This fundamental difference in logic stems from the need for eukaryotes to regulate diverse cell types and developmental stages, whereas prokaryotic regulation emphasizes rapid responses to environmental cues through direct promoter-operator interactions.81693-4) Transcription termination in eukaryotes for Pol II-transcribed genes follows the torpedo model, where cleavage and specificity factor (CPSF) recognizes signals, triggering endonucleolytic cleavage of the nascent ; a 5'-3' (Xrn2 in mammals) then degrades the downstream fragment, catching up to Pol II and promoting its release via the rat1/Xrn2-dependent pathway. Prokaryotic termination, by comparison, occurs via two main mechanisms: Rho-dependent, where the Rho binds to unstructured and translocates 5'-3' to dissociate from the template, or intrinsic (Rho-independent), relying on GC-rich loops in the followed by a uridine-rich tract that weakens polymerase-DNA interactions. Unlike eukaryotes, prokaryotes lack co-transcriptional events such as capping, splicing, or during elongation, as transcription and are coupled in the without nuclear barriers. The evolutionary divergence in these processes is largely attributed to nuclear compartmentalization in eukaryotes, which physically separates transcription from , allowing for intricate post-transcriptional controls and enabling the of complex multicellularity through fine-tuned . In prokaryotes, the absence of a facilitates immediate of nascent mRNAs, prioritizing efficiency over the layered regulatory depth seen in eukaryotes.81693-4) This compartmentalization thus underpins the greater regulatory sophistication in eukaryotes, accommodating larger genomes and diverse cellular functions.