Fact-checked by Grok 2 weeks ago

RNA polymerase II

RNA polymerase II (Pol II) is a multi-subunit complex in eukaryotic cells that catalyzes the transcription of protein-coding into (mRNA) and synthesizes many non-coding critical for and cellular processes. Composed of 12 subunits with a total molecular weight of approximately 514 kDa, Pol II forms a core structure featuring a central cleft that binds the DNA template and emerging RNA chain during synthesis. The largest subunit, known as RPB1, bears a distinctive C-terminal domain (CTD) consisting of tandem heptapeptide repeats with the consensus sequence YSPTSPS—26 repeats in yeast and 52 in humans—which undergoes phosphorylation and other modifications to orchestrate transcription phases and recruit processing factors. Pol II-mediated transcription proceeds through initiation at gene promoters via assembly of the preinitiation complex with general transcription factors like TFIID and TFIIH, followed by promoter DNA melting and RNA synthesis; elongation, where it navigates nucleosomes with aid from factors such as P-TEFb; and termination, often involving cleavage and polyadenylation signals. This process is dynamically regulated by associated complexes, including for enhancer-promoter communication and DSIF/NELF for promoter-proximal pausing, enabling precise control of in response to developmental and environmental cues.

History

The of RNA polymerase II (Pol II) emerged from efforts to understand the multiplicity of DNA-dependent RNA in eukaryotic cells during the late 1960s. In 1969, Robert G. Roeder and William J. Rutter developed a method to fractionate extracts from eukaryotic cells, including embryos and liver, revealing multiple distinct enzymatic activities responsible for RNA . These were separated based on ion-exchange and differential sedimentation properties, with early designations including forms I and II in liver according to elution order; forms A, B, and C were used in subsequent studies, with form A (later Pol I) associated with the , form B (Pol II) with the nucleoplasm, and form C (Pol III) as a soluble initially thought to be cytoplasmic based on extraction but later confirmed nuclear. Subsequent 1970 work by Roeder confirmed three forms in mammalian cells such as human KB cells. Form B, later standardized as Pol II, was characterized as the nucleoplasmic primarily responsible for synthesizing heterogeneous nuclear RNA (hnRNA), the large precursor transcripts that undergo processing to form (mRNA). Key experiments utilized inhibitors to confirm the functional roles and distinctions among these polymerases. Actinomycin D, which intercalates into DNA and blocks elongation by all DNA-dependent RNA polymerases, was employed to verify that the observed activities were indeed template-dependent and not due to contaminating RNA or other non-specific processes. This helped distinguish the eukaryotic polymerases from bacterial systems and confirmed their reliance on double-stranded DNA templates. Further differentiation came from sensitivity to , a toxin derived from the mushroom . In 1970, Thomas J. Lindell and colleagues demonstrated that low concentrations of (approximately 10 ng/ml) specifically inhibited the nucleoplasmic form B (Pol II) activity by over 90%, while higher concentrations were required to affect the nucleolar form A (Pol I), and form C (later Pol III) showed partial resistance. This pharmacological specificity provided a critical tool for assigning Pol II to hnRNA/mRNA synthesis , as inhibition correlated with reduced production of polyadenylated nuclear RNAs destined for export as mRNA. Early nomenclature varied across studies, with Roeder and Rutter initially referring to the enzymes as polymerases A, B, and C based on chromatographic behavior, a rooted in prior bacterial and work. This lettering persisted in some literature (e.g., polymerase B for Pol II), but by the mid-1970s, the field adopted the Roman numeral system—Pol I, II, and III—for clarity, reflecting their sequential discovery and functional specialization: Pol I for , Pol II for mRNA precursors, and Pol III for small RNAs like tRNA and 5S rRNA. These findings established Pol II as the central enzyme for in eukaryotes, laying the foundation for subsequent biochemical and genetic studies.

Purification and Characterization

The purification of RNA polymerase II (Pol II) from eukaryotic tissues in the 1970s marked a pivotal advancement in understanding its biochemical properties, with seminal work by the groups of Pierre Chambon and Robert G. Roeder. Chambon's team isolated Pol II from calf thymus using to resolve it from Pol I and Pol III based on differing elution profiles, followed by additional steps like to achieve near-homogeneity. Similarly, Roeder and colleagues purified Pol II from cells and rat liver nuclei, employing to separate the enzyme classes and demonstrating its localization in the nucleoplasm. Subsequent refinement of purification protocols incorporated heparin-Sepharose chromatography, leveraging Pol II's strong binding to heparin under low-salt conditions, and velocity sedimentation or glycerol gradient centrifugation to fractionate based on the enzyme's ~15–20 S sedimentation coefficient. These techniques enabled the isolation of intact Pol II preparations, with early estimates placing the core enzyme's molecular weight at approximately 500 kDa through sucrose density gradient analysis and SDS-PAGE of subunits. Pol II preparations were sensitive to divalent cations, requiring Mg²⁺ (optimal at 5–10 mM) or Mn²⁺ for activity, and absolutely dependent on double-stranded DNA templates and all four NTPs for RNA synthesis. Functional characterization via transcription assays highlighted Pol II's distinct properties compared to bacterial counterparts; purified Pol II could elongate RNA chains but failed to initiate de novo on nonspecific DNA templates without accessory factors, producing only short transcripts or requiring nicked DNA for primer-dependent activity. This dependency underscored the complexity of initiation. A key insight from these studies was the existence of multiple Pol II forms, including the unphosphorylated IIA (migrating faster on gels) and phosphorylated IIO variants, identified through and of extracts from calf thymus and cells; the IIO form exhibited heightened sensitivity to α-amanitin and predominated in transcriptionally active nuclear preparations.

Structure

Subunits

Eukaryotic RNA polymerase II (Pol II) is a multi-subunit enzyme complex composed of 12 core subunits, designated RPB1 through RPB12, which together form the catalytic center responsible for synthesizing messenger RNAs from DNA templates. These subunits assemble into a crab-claw-like structure with two major lobes connected by a central cleft that accommodates the DNA-RNA hybrid and facilitates nucleotide addition. The core is evolutionarily conserved, with high sequence homology across eukaryotes such as yeast (Saccharomyces cerevisiae) and humans, exceeding 80% for key catalytic regions, though surface residues show greater divergence to accommodate species-specific regulatory interactions. The largest subunit, RPB1 (approximately 220 kDa in humans), forms one lobe of the enzyme and houses critical elements of the , including the bridge helix and trigger loop that coordinate substrate selection and formation with two magnesium ions. RPB1 also contributes to the wall of the RNA exit channel and the dock region for binding transcription factors. RPB2, the second-largest subunit, constitutes the other lobe and includes the clamp , which grips the downstream DNA duplex via switch regions to maintain processivity during elongation; it also participates in the catalytic center by positioning the -DNA hybrid and the second magnesium ion. Together, RPB1 and RPB2 create the narrow DNA- channel, ensuring precise alignment for incorporation. RPB3 and RPB11 form a heterodimer that bridges the two lobes, stabilizing the overall architecture and facilitating interactions with general transcription factors during pre-initiation complex assembly. The RPB4/RPB7 subcomplex acts as a mobile stalk protruding from the upstream face of the core, dissociating during promoter opening and reassociating to support initiation and termination; this module is a eukaryotic absent in bacterial polymerases, enhancing regulatory flexibility. Smaller subunits, including RPB5, RPB6, RPB8, RPB9, RPB10, and RPB12 (ranging from ~7-20 ), primarily contribute to structural stability, DNA/ binding, and inter-polymerase shared functions; for instance, RPB5 coordinates cleft opening for DNA entry, while RPB9 modulates elongation rates and transcript cleavage. These subunits are essential for enzyme integrity, with RPB10 and RPB12 also present in RNA polymerases I and III. Mutations in core subunits can disrupt Pol II fidelity, particularly in the . For example, the E1103G substitution in the RPB1 trigger loop stabilizes its closed conformation, reducing mismatch detection and increasing transcription errors by up to 10-fold in assays. Similarly, RPB9 , such as those altering its zinc-binding domain, impair and elevate error rates during , linking subunit integrity to accurate .

Assembly and Architecture

RNA polymerase II (Pol II) assembles through a stepwise pathway in the of cells, ensuring the proper integration of its 12 subunits into a functional . The process initiates with the synthesis and nuclear import of the largest subunit, Rpb1, which forms an early subassembly with Rpb6 and Rpb10; Rpb2 and Rpb5 then join to create a comprising Rpb1–Rpb5 along with associated smaller subunits. This expands by incorporating the Rpb3 subassembly (Rpb3, Rpb11, and Rpb10), followed by Rpb8, Rpb9, and Rpb12, yielding the 10-subunit ; the peripheral heterodimer Rpb4–Rpb7 associates last to complete the holoenzyme. This nuclear assembly pathway, distinct from cytoplasmic maturation of ribosomal subunits, relies on chaperone proteins like for subunit stabilization and involves quality-control mechanisms to degrade incomplete intermediates. The overall architecture of Pol II resembles a crab , with a central cleft (~25 wide) that cradles the DNA-RNA hybrid during transcription; the upper jaw consists of the mobile (primarily from Rpb1 and Rpb2), while the lower jaw is formed by the wall (Rpb1, Rpb5, Rpb9) and lobe (Rpb2, Rpb5) , connected by a funnel above the . The measures approximately 150 in height and 140 in width, enabling it to encase ~15–17 base pairs of downstream DNA in the cleft. Pore structures at the base, including the secondary , facilitate entry of triphosphates (NTPs) to the , while the (from Rpb1) and (from Rpb2) loops protrude into the cleft to separate the nascent RNA from the template DNA, directing RNA exit through a dedicated . Recent cryo-EM structures resolved between 2023 and 2025 have illuminated dynamic aspects of Pol II architecture, particularly the trigger loop's conformational flexibility in the and epistatic interactions among residues that fine-tune . The trigger loop, part of Rpb1, alternates between open and closed states to accommodate NTP binding and formation, with bridge helix (also Rpb1-derived) kinking to propel translocation. During , the clamp oscillates between open (relaxed DNA grip) and closed (secure hybrid binding) states, modulating processivity without external factors. These insights underscore how architectural modularity supports Pol II's catalytic versatility across eukaryotic genomes.

C-terminal Domain

The C-terminal domain (CTD) of RPB1, the largest subunit of RNA polymerase II, is composed of tandem heptapeptide repeats with the Tyr-Ser-Pro-Thr-Ser-Pro-Ser (YSPTSPS). This repetitive structure was first identified through sequencing of the RPB1 gene in and , revealing its unusual composition dominated by serines and prolines. In , the CTD contains 26 repeats, while in mammals including humans and mice, it has 52 repeats, with variations in repeat number observed across eukaryotes from as few as 5 in some protists to over 60 in certain metazoans. In isolation, the CTD adopts an unstructured, intrinsically disordered conformation, lacking stable secondary or tertiary elements. Structurally, the CTD manifests as an extended, flexible polymeric tail that protrudes from the globular of RNA polymerase II, enabling dynamic interactions during transcription. Evolutionarily, the proximal repeats (closer to the ) exhibit strong across , supporting essential basal transcription functions, whereas distal repeats display greater and variability, facilitating species-specific regulatory adaptations. The biophysical properties of the CTD arise from its high (positions 3 and 6) and serine (positions 2, 5, and 7) content, which promotes intrinsic disorder, polyelectrolyte behavior, and tendencies under physiological conditions. Notably, CTD positively correlates with organismal complexity and , suggesting an adaptive role in scaling transcriptional output across evolutionary lineages. Mutations altering CTD integrity, such as truncations, profoundly impact viability; for instance, partial deletions in result in cold-sensitive and transcription defects, underscoring the domain's indispensability. In humans, frameshift mutations in POLR2A that truncate the CTD—such as one reducing the repeat count by 20—cause a heterogeneous featuring profound , intellectual disability, seizures, and transcriptional dysregulation.

Transcription Machinery

Holoenzyme

The RNA polymerase II (Pol II) holoenzyme is defined as the core Pol II enzyme stably associated with the SRB/ complex, forming a large macromolecular essential for regulated transcription of protein-coding genes. This complex integrates the 12-subunit core Pol II with the multi-subunit , enabling the polymerase to respond to transcriptional activators and enhancers. The concept of the holoenzyme emerged from biochemical purifications in , where it was isolated as a functional unit capable of activator-stimulated transcription . The SRB proteins, core components of , were discovered in the early 1990s through genetic screens in for suppressors of growth defects caused by truncations in the C-terminal domain (CTD) of the RPB1 subunit of Pol II. These screens identified SRB2, SRB4, SRB5, and SRB6 as dominant suppressors that restored viability and transcriptional activity, leading to the purification of a ~0.5–1 MDa multisubunit complex tightly bound to Pol II and containing these SRB proteins along with (TBP). The full complex, now known to comprise approximately 21–25 subunits in yeast (organized into head, middle, tail, and optional kinase modules), bridges the core Pol II to activators by providing binding surfaces for regulatory factors, thereby coupling promoter recognition to polymerase recruitment. In total, the yeast holoenzyme encompasses ~30–35 subunits, with a molecular weight exceeding 1.2 MDa. Formation of the holoenzyme involves stable interactions primarily mediated by the Gal11/Srb module in the tail domain in , where Gal11 anchors the complex to Pol II via contacts with the RPB1 and RPB2 subunits. This association is dynamic , with Pol II forming the holoenzyme for transcription . In mammalian cells, analogous holoenzymes form through conserved Mediator-Pol II interfaces, but exhibit variations such as the integration of the CDK8 module (comprising CDK8, C, MED12, and MED13), which can reversibly associate and phosphorylate CTD residues to fine-tune activation or repression. These structural differences reflect adaptations to higher eukaryotic gene regulation, though the core bridging function remains conserved. The holoenzyme functions to increase transcription re-initiation efficiency, allowing Pol II to undergo multiple rounds of promoter clearance and without requiring full re-assembly of transcription factors, as demonstrated in assays where holoenzyme supports higher rates of activated transcription compared to core Pol II alone. Recent cryo-EM structures (as of 2025) reveal that many PIC components, including , can be retained after promoter escape, facilitating rapid re-initiation and transcriptional bursting. It also stabilizes the pre-initiation complex () at promoters by reinforcing activator--Pol II interactions, thereby enhancing overall processivity and fidelity of mRNA synthesis.

Pre-initiation Complex Assembly

The pre-initiation complex (PIC) assembles at eukaryotic promoters to position RNA polymerase II (Pol II) for accurate transcription initiation, comprising Pol II and the general transcription factors TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIIH. TFIID serves as the core scaffold, containing the (TBP) that recognizes promoter DNA elements. These components collectively ensure promoter-specific recruitment and DNA unwinding, with Pol II often arriving as a pre-formed complex with TFIIF. Assembly proceeds in a stepwise manner, initiated by TFIID binding to the or adjacent core promoter motifs, bending the DNA to facilitate subsequent factor docking. TFIIA and TFIIB then associate, with TFIIA stabilizing TBP-DNA interactions and TFIIB bridging to Pol by recognizing the promoter melt site. Recruitment of the Pol -TFIIF module follows, where TFIIF tethers Pol to the upstream factors and positions its active center over the transcription start site. Finally, TFIIE and TFIIH integrate into the complex; TFIIE recruits TFIIH and modulates its activity, while TFIIH drives promoter opening through ATP-dependent DNA unwinding. Promoter melting requires energy from ATP hydrolysis by the TFIIH helicase subunit XPB (Ssl2 in yeast), which translocates along the nontemplate DNA strand to separate ~12-15 base pairs and form a transcription bubble centered around the start site. The XPD subunit contributes minimally to this process in transcription, primarily serving structural roles within TFIIH. This unwinding is stabilized by interactions between Pol II's clamp domain and the single-stranded DNA. PIC assembly varies by promoter architecture: TATA-containing promoters rely heavily on TBP-TATA interactions for efficient , whereas TATA-less promoters, common in vertebrates, depend on initiator (Inr) and downstream elements for TFIID recruitment, yet TBP still induces similar DNA bending in the PIC. Recent cryo-EM structures from 2024 have captured partial PIC intermediates, revealing dynamic conformational changes during TFIIE/TFIIH integration and highlighting branched assembly pathways that allow flexibility in factor ordering.

Transcription Cycle

Initiation

Initiation of transcription by (Pol II) begins immediately following the assembly of the (PIC) at the promoter, where the synthesizes the first few phosphodiester bonds to form short nascent transcripts. This phase involves two distinct pathways: , in which Pol II repeatedly synthesizes and releases short RNA oligomers (typically 2-10 ) without dissociating from the template, and productive initiation, where the polymerase transitions to stable elongation after synthesizing a longer . is characterized by scrunching, a process in which the downstream is unwound and pulled into the polymerase while the enzyme remains anchored at the promoter, allowing multiple short transcripts to be produced without promoter clearance. This mechanism ensures fidelity by testing promoter compatibility before committing to full transcription, with productive occurring when the nascent reaches a length sufficient for stable complex formation, often around 10-15 . A critical early modification during is the co-transcriptional addition of the 5' cap to the nascent pre-mRNA, which occurs when the RNA chain extends to approximately 20-30 . The complex, consisting of RNA triphosphatase, guanylyltransferase, and methyltransferase, is recruited to the phosphorylated C-terminal domain (CTD) of Pol II's largest subunit (RPB1), specifically binding to Ser5-phosphorylated heptapeptide repeats. This capping not only protects the nascent from but also facilitates promoter escape by stabilizing the early complex and promoting the recruitment of additional factors. Capping efficiency is enhanced by the pausing factors DSIF and NELF, which position the polymerase for timely modification during the promoter-proximal phase. Promoter clearance, the transition from initiation to early elongation, is driven by phosphorylation of the Pol II CTD at serine 5 (Ser5) by the TFIIH-associated kinase CDK7, which releases the polymerase from promoter-proximal contacts and allows isomerization of the clamp domain for stable RNA-DNA hybrid formation. This phosphorylation event, peaking at the transcription start site, also dissociates initiation factors like TFIIB and Mediator from the core promoter, enabling the holoenzyme to proceed downstream. Following clearance, Pol II typically pauses after synthesizing 20-60 nucleotides, mediated by the negative elongation factor NELF and DSIF (Spt4/5), which stabilize a backtracked conformation and inhibit further extension to poise genes for regulated activation. This promoter-proximal pausing is a widespread checkpoint in metazoans, ensuring coordinated recruitment of elongation factors before productive synthesis. Recent insights into transcriptional reveal that after the first round of and promoter clearance, Pol II can undergo rapid re- from the same promoter, generating bursts of multiple transcripts in quick succession followed by inactive periods. This bursty behavior, observed in reconstituted systems, is facilitated by lingering and transcription factors at the promoter, allowing efficient recycling of the for subsequent rounds without full disassembly. Enhancer-promoter looping, stabilized by complex bridging distal enhancers to the , further enhances efficiency by increasing local concentrations of activators and promoting multiple re-initiations per looping event.

Elongation

During the elongation phase of transcription, RNA polymerase II (Pol II) synthesizes RNA processively by catalyzing phosphodiester bond formation in a cyclical manner. The catalytic cycle begins with nucleotide selection in the active site, where the trigger loop—a mobile domain in the largest subunit Rpb1—closes over the incoming nucleoside triphosphate (NTP) that matches the DNA template base. This closure forms specific interactions with the NTP's base, sugar, and phosphates, ensuring high fidelity (approximately 10^{-4} to 10^{-5} error rate) by discriminating against mismatches and deoxyribonucleotides. The trigger loop's histidine residue (His1085) positions the NTP's β-phosphate to facilitate metal ion-mediated catalysis, leading to bond formation and pyrophosphate release. Following catalysis, translocation occurs as Pol II advances along the DNA, driven by conformational changes in the bridge helix—a conserved α-helical element in Rpb1 that spans the active site. The bridge helix straightens to propel the RNA-DNA hybrid forward, with its bending promoting the enzyme's forward movement at rates of 20-50 nucleotides per second (nt/s) in vitro on naked DNA templates. In vivo, effective elongation is slower, averaging 1-2 kb/min (∼17-33 nt/s) in yeast due to regulatory pauses, though post-pause rates approach in vitro speeds. Error correction involves backtracking, where mismatched nucleotides cause the bridge helix to bend at a conserved threonine (T831), fraying the RNA 3'-end and extruding it into the secondary channel for cleavage by Pol II's intrinsic RNase activity or cofactor TFIIS, restoring processivity. Recent structural genetics studies have revealed widespread within the trigger loop and adjacent regions, such as the bridge helix, that fine-tune and rate. For instance, gain-of-function like E1103G accelerate but reduce by altering residue networks, while loss-of-function variants like H1085Y impair , with epistasis quantified by deviation scores exceeding 1 for suppressive effects. Elongation efficiency is modulated by factors that overcome pausing and enhance processivity. The positive transcription elongation factor b (P-TEFb), comprising CDK9 and T, phosphorylates serine 2 of the C-terminal domain (CTD) and the negative elongation factor (NELF), releasing the promoter-proximal pause and converting DSIF (Spt4/Spt5) from a pausing complex to a processive one. Phosphorylation of Spt5's repeat region stabilizes the Pol II clamp on DNA, suppressing dissociation. Emerging 2025 research highlights spatial organization in nuclear compartments, where phase-separated condensates form around transcribing Pol II to co-regulate elongation of related genes. These dynamic foci, often involving DSIF and SPT6, facilitate nucleosome disassembly during chromatin traversal, ensuring efficient progression.

Termination

RNA polymerase II (Pol II) termination occurs at the 3' end of protein-coding genes, where the enzyme recognizes specific signals to halt transcription, cleave the nascent RNA, and dissociate from the DNA template, ensuring proper mRNA 3'-end formation and preventing read-through into downstream genes. This process is tightly coupled to 3'-end processing, involving cleavage and polyadenylation factors that recognize the polyadenylation signal, typically the AAUAAA motif located 10-30 nucleotides upstream of the cleavage site in the pre-mRNA. Recognition of this motif by the cleavage and polyadenylation specificity factor (CPSF) induces Pol II pausing, followed by endonucleolytic cleavage of the transcript by the cleavage stimulation factor (CstF) and associated factors, generating a 5'-capped upstream fragment for polyadenylation and a downstream fragment for degradation. Two primary models explain the termination mechanism: the torpedo model and the allosteric model, often operating in concert. In the torpedo model, prevalent in eukaryotes, the 5'-3' exoribonuclease Rat1 (in ) or Xrn2 (in mammals) is recruited to the cleaved downstream RNA via interactions with phosphorylated Ser2 residues on the Pol C-terminal domain (CTD-Ser2P) and factors like Rtt103 or Pcf11; this enzyme degrades the RNA toward Pol , displacing the polymerase from the DNA by invading the RNA-DNA and promoting . The allosteric model posits that binding of termination factors to CTD-Ser2P induces conformational changes in Pol , weakening the RNA-DNA and facilitating release, with CPSF and CstF playing key roles in recruiting additional factors like the Pcf11-Ctf2 complex to destabilize elongation. The subcomplex RPB4/7 of Pol contributes to termination efficiency, potentially by stabilizing factor interactions or aiding in at ends, as its depletion leads to transcription and altered 3'-end formation. Following termination, Pol II is released from the and recycled for re-initiation at promoters, a process facilitated by of the CTD and interactions with recycling factors that promote its relocation via gene looping or . Recent studies highlight how termination integrates with transcriptional bursting, where rapid Pol II release after short bursts enables high-frequency re-initiation. In non-coding RNAs, such as snoRNAs or lncRNAs, termination diverges from the poly(A)-dependent pathway, relying instead on the Sen1 (or SETX in humans) for torpedo-like or the for rapid cleavage, often independent of AAUAAA signals and CTD-Ser2P, to prevent pervasive transcription.

Regulation

CTD Phosphorylation

The C-terminal domain (CTD) of RNA polymerase (Pol II) undergoes reversible phosphorylation that transitions the enzyme from a hypophosphorylated form, designated Pol IIA, to a hyperphosphorylated form, Pol , marking progression through the transcription cycle. This process primarily targets serine 2 (Ser2), serine 5 (Ser5), and serine 7 (Ser7) within the heptapeptide repeats, with 1 (Tyr1) also serving as a regulatory site. The unphosphorylated IIA state predominates prior to , enabling promoter association, while hyperphosphorylation to IIO supports and processive transcription. Phosphorylation patterns are temporally regulated, with Ser5 phosphorylation peaking early during and promoter clearance, followed by progressive Ser2 phosphorylation during , and Ser7 phosphorylation aiding snRNA-specific . Tyr1 phosphorylation accumulates downstream of promoters, potentially inhibiting premature termination. These modifications are orchestrated by cyclin-dependent kinases (CDKs): CDK7, within the TFIIH complex, primarily phosphorylates Ser5 and Ser7 to facilitate escape from promoter-proximal pausing; CDK9, as part of positive transcription elongation factor b (P-TEFb), targets Ser2 to promote productive ; and additional kinases like CDK12/13 reinforce Ser2 marks in gene bodies. is mediated by phosphatases such as FCP1, which removes Ser2 and Ser5 phosphates at transcription termination sites to recycle Pol II for reinitiation. The phospho-CTD acts as a dynamic scaffold for recruiting RNA processing machinery, ensuring co-transcriptional maturation of nascent transcripts. Ser5 phosphorylation specifically binds the guanylyltransferase component of the capping enzyme complex, stimulating 5' cap addition shortly after transcription initiation to protect mRNA and enhance export. Ser5 and Ser2 phosphorylations cooperatively recruit splicing factors, such as the U1 snRNP via Prp40 for Ser5-P and U2 snRNP components for Ser2-P, promoting efficient intron removal. Ser2 phosphorylation further facilitates 3' end processing by interacting with cleavage and polyadenylation specificity factor (CPSF) and cleavage stimulation factor (CstF), enabling poly(A) tail addition and transcript release. Recent models from 2025 highlight how CTD dynamics underpin transcriptional , where alternating phospho-states on Ser5 and Ser2 modulate pause release, re-initiation, and burst by recruiting factors like SETD1A/B to suppress premature termination and enable rapid Pol II recycling within seconds. Dysregulation of these modifications contributes to cancer progression; for instance, overexpression of CDK7 and CDK9 in tumors like pancreatic and MYC-driven cancers leads to hyperphosphorylation at Ser5 and Ser2, altering splicing patterns and promoting oncogenic isoform production.

Chromatin Interactions

RNA polymerase II (Pol II) recruitment to promoters is facilitated by specific histone modifications that create binding platforms for components of the transcription initiation machinery. Trimethylation of histone H3 at lysine 4 (H3K4me3), enriched at active promoters, directly interacts with the plant homeodomain (PHD) finger of TAF3, a subunit of the TFIID complex, thereby stabilizing the pre-initiation complex and promoting Pol II recruitment. This interaction ensures efficient assembly of the transcription apparatus at gene starts, with disruptions in H3K4me3 leading to reduced TFIID occupancy and impaired initiation. During elongation, trimethylation of histone H3 at lysine 36 (H3K36me3), deposited co-transcriptionally by SETD2 in association with elongating Pol II, recruits the FACT (facilitates chromatin transcription) histone chaperone complex. H3K36me3 guides FACT to reassemble nucleosomes in the wake of Pol II, maintaining chromatin integrity and supporting processive elongation by alleviating nucleosomal barriers. Nucleosomes pose significant physical obstacles to Pol II progression, particularly the +1 positioned immediately downstream of the transcription start site, where Pol II frequently pauses to coordinate with . This promoter-proximal pausing is exacerbated by the +1 nucleosome's stable positioning, which hinders the enzyme's forward movement and requires coordinated eviction for release into productive elongation. ATP-dependent remodelers of the family, including the BAF complex, synergize with Pol II and transcription factors to unwrap and evict nucleosomes at these sites, dynamically clearing paths for transcription and enhancing processivity. Such remodeling activities are essential for overcoming chromatin barriers, with SWI/SNF depletion resulting in persistent nucleosome occupancy and stalled Pol II. Higher-order structures further modulate Pol II activity through . Recent studies highlight the role of phase-separated biomolecular condensates, driven by intrinsically disordered regions (IDRs) in Pol II, , and associated factors, in forming dynamic hubs that concentrate the transcription machinery at active loci. These IDR-mediated condensates facilitate efficient Pol II clustering and processive transcription, as observed in real-time imaging of reconstituted systems. Enhancer-promoter looping, mediated by and , brings distant regulatory elements into proximity with Pol II-occupied promoters, stabilizing interactions and boosting transcription rates without relying solely on linear diffusion. Epigenetic modifications like also influence promoter accessibility for Pol II. Hypermethylation of CpG islands in promoters directly inhibits binding of transcription factors such as , thereby repressing Pol II recruitment and maintaining . Demethylation at these sites enhances openness, allowing greater Pol II access and transcriptional activation, underscoring the interplay between DNA modifications and structure in regulating polymerase dynamics.

Kinetics and Inhibitors

The kinetics of RNA polymerase II (Pol II) elongation are governed by the Michaelis-Menten equation, where the velocity v is given by v = \frac{k_{\text{cat}} [\text{NTP}]}{K_m + [\text{NTP}]}, with K_m values for triphosphates (NTPs) typically in the range of 10–100 μM, such as approximately 39 μM for wild-type Pol II under saturating conditions. The catalytic rate constant k_{\text{cat}} corresponds to a maximum rate of about 25 per second (nt/s) for pause-free transcription . Pausing indices, which quantify transcriptional pauses, reveal a pause density of approximately 0.045 pauses per at saturating NTP concentrations, reflecting the enzyme's propensity for and regulatory stalling during . Pharmacological inhibitors provide key tools for dissecting Pol II kinetics. , a bicyclic octapeptide from , potently inhibits Pol II with an IC50 of approximately 10 nM by to the RPB1 subunit. This inserts a wedge-like structure into the funnel domain of the , sterically hindering bridge helix mobility and blocking translocation after addition, thereby halting without preventing initial NTP . Another , , acts by chelating essential Zn²⁺ ions required for the 's structure, thereby disrupting its activity and reducing catalytic efficiency. Recent single-molecule assays have refined our understanding of Pol II dynamics . For instance, high-resolution tracking in mammalian cells shows transcriptional bursts with durations on the order of 1 minute, during which Pol II maintains processive before pausing or termination. These inhibitors are widely employed to probe transcription rates; treatment with , for example, allows quantification of nascent decay to infer speeds in cellular contexts.

Specialized Functions

Transcription-Coupled Nucleotide Excision Repair

Transcription-coupled (TC-NER) is a specialized subpathway of that prioritizes the removal of DNA lesions encountered by RNA polymerase II (Pol II) during active transcription, ensuring the fidelity and resumption of . When Pol II encounters bulky DNA lesions, such as UV-induced cyclobutane pyrimidine dimers (CPDs), it stalls at the site of damage, serving as the initial sensor in the pathway. This stalling halts transcriptional elongation and triggers the recruitment of group B protein (CSB, also known as ERCC6), which binds directly to the stalled Pol II complex to initiate repair signaling. The TC-NER pathway proceeds through a series of coordinated steps involving ubiquitination and factor recruitment to facilitate lesion access and excision. CSB recruitment promotes the assembly of the (ERCC8)-containing Cullin-RING ubiquitin ligase complex (CRL4CSA), which includes DDB1 as a core component, leading to the ubiquitination of stalled Pol II—primarily on the RPB1 subunit—and CSB itself to modulate complex stability and dynamics. This ubiquitination enables Pol II backtracking, where the polymerase retreats from the lesion to expose the damaged site within the transcription bubble, allowing handover to core factors such as transcription factor IIH (TFIIH), XPA, and the structure-specific endonucleases XPF-ERCC1 and XPG. XPG and XPF then incise the damaged strand upstream and downstream of the lesion, respectively, excising the oligonucleotide containing the damage and enabling gap-filling by and to restore the strand.00870-0) A hallmark of TC-NER is its strand bias, with lesions repaired preferentially on the transcribed (template) strand of active genes compared to the non-transcribed strand or inactive genomic regions, reflecting the pathway's coupling to Pol II progression. This asymmetry was first demonstrated in mammalian cells using the dihydrofolate reductase (DHFR) gene, where pyrimidine dimers were removed up to 10-fold faster from the transcribed strand following UV exposure. Defects in TC-NER, particularly due to mutations in the CSA or CSB genes, underlie Cockayne syndrome, a severe autosomal recessive disorder characterized by UV hypersensitivity, neurological degeneration, and premature aging, as these mutations impair the recruitment and function of repair factors at stalled Pol II sites.90504-3) Recent structural studies using cryo-electron microscopy (cryo-EM) have provided atomic-level insights into Pol II-lesion complexes in TC-NER. For instance, the 2023 cryo-EM of Rad26 (CSB homolog) bound to a CPD-stalled Pol II elongation complex revealed how Rad26 stabilizes the backtracked state and interfaces with Pol II to facilitate lesion exposure, highlighting conserved interactions that bridge transcription and repair machineries across eukaryotes. These findings underscore the dynamic role of Pol II stalling in orchestrating repair efficiency without requiring Pol II eviction in most cases.

Collision with DNA Replication Forks

During the of the , RNA polymerase II (RNAPII) and the DNA replication machinery can collide on the same genomic template, leading to transcription-replication conflicts (TRCs) that threaten . These collisions are more frequent in highly transcribed regions, such as protein-coding genes, where transcription and replication overlap. Head-on collisions, where the replication fork progresses toward the oncoming RNAPII, are particularly disruptive and prone to formation—stable RNA:DNA hybrids that impede progression—whereas co-directional collisions, with both machineries moving in the same orientation, are generally less severe but can still cause fork slowing. Mechanistically, upon collision, RNAPII can backtrack, forming a transcription bubble that physically blocks the replicative MCM, resulting in replication fork stalling or reversal. In head-on encounters, the nascent can invade the displaced DNA strand, exacerbating accumulation and activating DNA damage responses. Studies in and cells demonstrate that elongating RNAPII acts as a polar roadblock, with its strong DNA-binding affinity preventing efficient replisome passage unless resolved by accessory factors. For instance, defective RNAPII mutants, such as rpb1-1 in , increase chromatin retention of the polymerase, heightening collision frequency and leading to replication fork slowdown, as evidenced by reduced inter-origin distances from 161 kb in wild-type to 105–106 kb in mutants. The consequences of these collisions include replication stress, double-strand breaks (DSBs), and , particularly insertions/deletions at 5' ends in co-directional TRCs and promoter mutations in head-on cases. Unresolved TRCs can activate the ATR/ kinases; notably, elongating RNAPII recruits the MRN complex to collision sites, nucleating ATM-dependent signaling for repair, as shown in human cells where HUWE1 mutations dissociate WRNIP1 from RNAPII, elevating DSBs by up to 3.6-fold. Genomic instability arises if forks collapse, contributing to fragile sites and diseases like cancer. Resolution involves multiple pathways: helicases like Senataxin (SETX) and Rrm3 promote fork progression by displacing RNAPII or resolving R-loops, with SETX associating directly with s to protect against RNAPII blocks in human and models. Other mechanisms include skipping via PRIMPOL-mediated repriming, proteasome-dependent RNAPII degradation facilitated by PNUTS-PP1, and fork restart through RAD51-mediated reversal. These processes ensure minimal disruption, though head-on TRCs remain a major source of endogenous DNA damage.31458-3)

References

  1. [1]
    Structure and mechanism of the RNA polymerase II transcription ...
    Apr 1, 2020 · In this review, we detail the structure and mechanism of over a dozen factors that govern Pol II initiation (eg, TFIID, TFIIH, and Mediator), pausing, and ...
  2. [2]
    Total molecular weight of RNA Polymerase - BNID 111569
    Pol I, II, and III comprise 14, 12, and 17 subunits, respectively, and have a total molecular weight of 589, 514, and 693 kDa, respectively. Ten subunits ...
  3. [3]
    Extension of the minimal functional unit of the RNA polymerase II ...
    May 15, 2019 · The carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (Pol II) consists of 26 and 52 heptad-repeats in yeast and ...
  4. [4]
    Recent advances in understanding RNA polymerase II structure and ...
    Nov 17, 2020 · This article highlights a selection of recent findings that reveal some of the resolved workings of RNA polymerase II and its ensemble of supporting factors.
  5. [5]
    Multiple Forms of DNA-dependent RNA Polymerase in Eukaryotic ...
    ROEDER, R., RUTTER, W. Multiple Forms of DNA-dependent RNA Polymerase in Eukaryotic Organisms. Nature 224, 234–237 (1969). https://doi.org/10.1038/224234a0.
  6. [6]
    Specific Nucleolar and Nucleoplasmic RNA Polymerases - PNAS
    As reported earlier (Roeder, R. G., and W. J. Rutter, Nature, 224, 234 (1969)), two major chromatographically distinct enzymatic species (I and II) are present ...
  7. [7]
    Specific Inhibition of Nuclear RNA Polymerase II by α-Amanitin
    α-Amanitin, a toxic substance from the mushroom Amanita phalloides, is a potent inhibitor of DNA-dependent RNA polymerase II (the nucleoplasmic form)
  8. [8]
    A new method for the purification of RNA polymerase II (or ... - PubMed
    Jan 2, 1978 · The combination of precipitation using poly(ethylene imine) with chromatography on heparin-Sepharose may prove useful in the preparation of ...
  9. [9]
  10. [10]
  11. [11]
    Biogenesis of RNA Polymerases in Yeast - PMC - NIH
    The Rpb3 subassembly complex is formed prior to its interaction with the Rpb2 ... Rpb1 subassembly complex junction leads to the core formation of RNA pol II.
  12. [12]
    Involvement of multiple subunit–subunit contacts in the assembly of ...
    The four subunits, Rpb1, Rpb2, Rpb3 and Rpb11, are therefore supposed to form an enzyme core which corresponds to the core enzyme, α2ββ′, of the prokaryotic RNA ...
  13. [13]
    Multiple structures of RNA polymerase II isolated from human nuclei ...
    May 28, 2025 · RNAPII consists of twelve subunits, which are highly conserved from yeast to humans. During gene expression, RNAPII is first assembled on the ...
  14. [14]
    Widespread epistasis shapes RNA polymerase II active site function ...
    Aug 27, 2025 · Multi-subunit RNA Polymerases are responsible for transcription in all kingdoms of life. These enzymes rely on dynamic, highly conserved ...<|control11|><|separator|>
  15. [15]
    Bridge helix bending promotes RNA polymerase II backtracking ...
    Apr 19, 2016 · We find that the continuous bending motion of the Bridge helix (BH) serves as a critical checkpoint, using the highly conserved BH residue T831 as a sensing ...
  16. [16]
    An RNA polymerase II holoenzyme responsive to activators - Nature
    Mar 31, 1994 · Transcription by this holoenzyme is stimulated by the activator protein GAL4-VP16, a feature not observed with purified RNA polymerase II and general ...
  17. [17]
    A multisubunit complex associated with the RNA polymerase II CTD ...
    We report genetic and biochemical evidence that the RNA polymerase II carboxy-terminal domain (CTD) interacts with a large multisubunit complex that contains ...Missing: holoenzyme | Show results with:holoenzyme
  18. [18]
    A mammalian SRB protein associated with an RNA polymerase II ...
    The SRB proteins are a hallmark of this RNA polymerase II holoenzyme as they are found only in this complex, where they contribute to the response to regulators ...
  19. [19]
  20. [20]
  21. [21]
    Structures of transcription preinitiation complex engaged with the +1 ...
    Nov 21, 2022 · In the yeast Saccharomyces cerevisiae, PIC assembly occurs adjacent to the +1 nucleosome that is located downstream of the core promoter. Here ...
  22. [22]
    Mechanism of Promoter Melting by the Xeroderma Pigmentosum ...
    New promoter contacts by RNAPII subunits at positions −5 and +1 are induced by ATP. TFIIH, mainly through its p89/XPB DNA helicase, is essential for the melting ...
  23. [23]
    In TFIIH, XPD Helicase Is Exclusively Devoted to DNA Repair
    The eukaryotic XPD helicase is an essential subunit of TFIIH involved in both transcription and nucleotide excision repair (NER).
  24. [24]
    Structural insights into preinitiation complex assembly on core ...
    Unexpectedly, TBP of TFIID similarly bends TATA box and TATA-less promoters in PIC. Our study provides structural visualization of stepwise PIC assembly on ...
  25. [25]
    Mechanisms and Functions of the RNA Polymerase II General ...
    Feb 1, 2024 · The Pol II core enzyme can itself synthesize RNA using a template DNA, but promoter-specific transcription initiation requires the canonical ...Missing: hnRNA | Show results with:hnRNA
  26. [26]
  27. [27]
  28. [28]
  29. [29]
    Nuclear ANLN regulates transcription initiation related Pol II ... - Nature
    Feb 2, 2025 · We find that nuclear ANLN directly interacts with the RNA polymerase II (Pol II) large subunit to form transcriptional condensates.
  30. [30]
    Page not found | Nature
    - **Status**: Insufficient relevant content.
  31. [31]
  32. [32]
  33. [33]
    Quantitative Proteomics Demonstrates That the RNA Polymerase II ...
    Two of the subunits of RNA polymerase II, Rpb4 and Rpb7, have been shown to dissociate from the enzyme under a number of specific laboratory conditions.
  34. [34]
  35. [35]
    Efficient termination of nuclear lncRNA transcription promotes ... - eLife
    Mar 5, 2018 · RNA Polymerase II (Pol II) transcribes non-coding DNA into long non-coding RNAs (lncRNAs), but biological roles of lncRNA are unclear. We find ...
  36. [36]
    Orchestrating transcription with the pol II CTD - Nature
    Feb 18, 2015 · Elegant biochemical and genetic studies carried out in the past 30 years have revealed that the Pol II CTD becomes extensively phosphorylated ...
  37. [37]
    The RNA Polymerase II Carboxy-Terminal Domain (CTD) Code
    Aug 16, 2013 · ... Cdk7 as a dual specificity kinase with activity also for Ser7 phosphorylation. ... Cdk9 and BRD4 kinases for phosphorylation of CTD Ser2 residues.
  38. [38]
    The RNA polymerase II CTD coordinates transcription and RNA ...
    The CTD of fission yeast contains 29 heptads, 24 of which are all-consensus, whereas the CTD of budding yeast consists of 26 heptads, 19 of which are perfect ...
  39. [39]
    Minireview - Progression through the RNA Polymerase II CTD Cycle
    In mammalian cells, the Cdk9 kinase subunit of the positive elongation factor P-TEFb phosphorylates. CTD Ser2 as well as the elongation factor Spt4/Spt5 (also ...
  40. [40]
    Short Article Distinct Roles for CTD Ser-2 and Ser-5 Phosphorylation ...
    Capping is targeted to pre-mRNAs through binding of the guanylyltransferase component of the capping apparatus to the phosphorylated CTD of RNA polymerase ...
  41. [41]
    CTD serine-2 plays a critical role in splicing and termination factor ...
    Dec 25, 2012 · CTD Ser2 plays critical roles in co-transcriptional pre-mRNA maturation in vivo: It likely recruits U2AF65 to ensure an efficient co-transcriptional splicing.
  42. [42]
    3′ end formation of pre-mRNA and phosphorylation of Ser2 on the ...
    Jan 29, 2014 · We found that the CTD is heavily phosphorylated on Ser2 (Ser2p) at poly(A) (pA) signals coincident with recruitment of the CstF77 CPA factor.
  43. [43]
  44. [44]
    Oncogenic dysregulation of pre‐mRNA processing by protein ...
    Jun 6, 2021 · Phosphorylation of RNA polymerase II and regulatory RNA-binding proteins represents a major modality of pre-mRNA-processing regulation.Oncogenic Dysregulation Of... · The Sr-Protein Kinase Family · The Cyclin-Dependent Kinase...
  45. [45]
    Structure of transcribing RNA polymerase II-nucleosome complex
    Dec 21, 2018 · The sites of Pol II pausing are generally found upstream of the first nucleosome in the promoter-proximal region ('+1 nucleosome'). Pausing ...
  46. [46]
    The BAF chromatin remodeler synergizes with RNA polymerase II ...
    Dec 4, 2023 · We show that BAF dynamically unwraps and evicts nucleosomes at accessible chromatin regions, while RNAPII promoter-proximal pausing stabilizes BAF chromatin ...
  47. [47]
    Nonlinear control of transcription through enhancer–promoter ...
    Apr 13, 2022 · TAD boundaries and CTCF loops are thought to favour enhancer–promoter communication within specific genomic regions and disfavour it with ...
  48. [48]
    Evidence that direct inhibition of transcription factor binding is the ...
    Dec 5, 2022 · Methylation of CpG-dense promoters causes stable transcriptional repression and is the basis for long-term monoallelic silencing, such as X ...
  49. [49]
    Unraveling the functional role of DNA demethylation at specific ...
    Sep 29, 2021 · We demonstrate a highly effective method using only nuclease-dead Cas9 and guide RNA to physically block DNA methylation at specific targets.
  50. [50]
  51. [51]
    Mammalian Transcription-Coupled Excision Repair - PMC - NIH
    In eukaryotic cells, initiation of TC-NER likely occurs by the physical blockage of RNA polymerase II (RNAPII) on lesions. Lesion-stalled RNAPII subsequently ...Missing: papers | Show results with:papers
  52. [52]
    Transcription-coupled nucleotide excision repair in mammalian cells
    Jan 1, 2008 · The hallmark of TC-NER is the accelerated repair of DNA lesions that efficiently block the elongating RNA polymerase II complex (RNAPIIo). It is ...Missing: seminal papers
  53. [53]
    Transcription-coupled Nucleotide Excision Repair: New Insights ...
    Cells activate transcription-coupled nucleotide excision repair (TC-NER) to remove Pol II-impeding damage and allow transcription resumption.Missing: seminal papers
  54. [54]
    Transcription-Coupled Nucleotide Excision Repair and the ...
    Jun 20, 2023 · Here, we first summarize the current understanding of these repair mechanisms, specifically focusing on the roles of stalled RNA polymerase II, ...Missing: seminal | Show results with:seminal
  55. [55]
    Timely upstream events regulating nucleotide excision repair by ...
    Ubiquitination-mediated regulation of TC-NER: stalled elongating complex brings E3s to take on CSB and RNAPII. 2.3.1. CSB and its E3s. CSB, a well-known ...
  56. [56]
    Consequences and Resolution of Transcription–Replication Conflicts
    The aim of this review is to discuss the consequences of ... Transcription-Replication Conflicts by Promoting RNA Polymerase II Degradation on Chromatin.3.1. Histone Ptms As... · 3.2. Chromatin As An... · 4.1. Rnap Skipping And...
  57. [57]
  58. [58]
    RNA polymerase II contributes to preventing transcription‐mediated ...
    Dec 1, 2014 · Collisions between RNA transcription and DNA replication machineries impair replication fork (RF) progression and cause DNA breaks ...
  59. [59]
    RNA Polymerase II is a Polar Roadblock to a Progressing DNA Fork
    Oct 13, 2024 · Their experiments revealed that the elongating RNA polymerase can lead to RNA-DNA hybrid formation, effectively locking itself at the fork and ...
  60. [60]
    RNAPII-dependent ATM signaling at collisions with replication forks
    Aug 24, 2023 · Here we provide evidence that elongating RNAPII nucleates activation of the ATM kinase at TRCs to stimulate DNA repair.<|control11|><|separator|>
  61. [61]
  62. [62]
  63. [63]