Fact-checked by Grok 2 weeks ago

TATA box

The TATA box is a conserved DNA sequence motif located in the core promoter region of genes in archaea and eukaryotes, serving as a key regulatory element for transcription initiation by RNA polymerase II. It typically consists of the consensus sequence 5'-TATAAA-3' (with variants such as TATAWAWR, where W is A or T and R is A or G), positioned approximately 25 to 35 base pairs upstream of the transcription start site. This A/T-rich sequence facilitates the binding of the TATA-binding protein (TBP), a critical subunit of the general transcription factor TFIID, which bends the DNA and recruits other components of the pre-initiation complex (PIC) to position the polymerase accurately. The primary function of the TATA box is to promote precise and efficient assembly of the , enabling the recruitment of transcription factors like TFIIA, TFIIB, and TFIIF, followed by and additional factors such as TFIIE and TFIIH. In TATA-containing promoters, this interaction sterically constrains the positioning of the , ensuring transcription starts at the correct and influencing the overall rate and fidelity of mRNA synthesis. Mutations in the TATA box can significantly reduce or abolish promoter activity, highlighting its essential role in . While the TATA box is a hallmark of certain promoters, it is absent in a majority of eukaryotic genes; for instance, only about 10-24% of promoters contain a functional TATA box, with higher prevalence in stress-responsive or rapidly inducible genes. In contrast, TATA-less promoters often rely on alternative elements like the initiator (Inr) or downstream promoter element (DPE) for assembly. Evolutionarily, TBP and the TATA box mechanism trace back to a common ancestor of and eukaryotes, underscoring their ancient origin in transcriptional machinery.

Structure and sequence

Consensus sequence

The TATA box is defined by a conserved DNA sequence motif typically spanning 8 base pairs, with the core "TATA" region exhibiting high sequence conservation across eukaryotic promoters, while the flanking positions display greater flexibility. The consensus sequence is most commonly represented as TATAAA, derived from alignments of promoter sequences in viral and cellular genes during the late 1970s and early 1980s. A more detailed consensus, TATAWAWR—where W denotes A or T, and R denotes A or G—accounts for observed variability in functional TATA elements, with this 8-base-pair motif identified through systematic sequencing of metazoan promoters. In vertebrates, the extended core sequence TATAAAAG is particularly prevalent, reflecting adaptations for optimal recognition in higher eukaryotes. This consensus was experimentally established through comparative sequencing of promoter regions from viruses such as and adenovirus, as well as cellular genes like those encoding ovalbumin and , revealing recurring T-A rich patterns approximately 25-35 base pairs upstream of the transcription start site. DNase I assays in the further confirmed the motif's boundaries, showing that binding protects a region of about 8-10 base pairs centered on the TATA core from digestion. Deviations from the ideal TATAAA sequence reduce binding affinity for the (TBP); for instance, TATAAA exhibits the highest affinity, with single substitutions in the core decreasing association rates by up to 10-fold in structural and biochemical studies. These variations allow flexibility in promoter strength, enabling differential regulation while maintaining the motif's role as a core promoter element.

Prevalence in promoters

The TATA box is present in approximately 10–20% of mammalian protein-coding promoters. This prevalence is notably higher in promoters associated with stress-response genes, which often require dynamic , compared to CpG promoters that are frequently TATA-less and linked to constitutive expression. In humans, TATA-containing promoters are enriched in categories related to regulated processes, such as development and response to stimuli, while showing underrepresentation in functions. Genome-wide analyses estimate that approximately 24% of core promoters harbor a TATA-like , with enrichment in functional categories related to inducible processes. TATA-driven promoters, exemplified by those in stress-response pathways, contrast sharply with the majority of GC-rich, TATA-less promoters that dominate expression. Statistical models for predicting TATA box presence rely on promoter architecture features, such as sequence motifs and ; position weight matrices (PWMs) for the enable scoring potential sites, while more advanced approaches like neural-statistical models integrate contextual elements to improve accuracy in distinguishing TATA-positive from TATA-less promoters. These predictive tools highlight how TATA prevalence correlates with AT-rich regions and specific upstream motifs, aiding in the classification of promoter types across eukaryotic genomes.

Location and features

Position in the genome

The TATA box is characteristically positioned 25 to 35 base pairs upstream of the transcription start site (TSS) in eukaryotic gene promoters, serving as a key landmark for transcription initiation. This placement ensures precise alignment of the basal transcription machinery with the TSS. Within the core promoter region, the TATA box is often situated between approximately -30 and -40 bp relative to the TSS, which provides the optimal spacing required for efficient assembly of the preinitiation complex (PIC). Experimental studies have identified an ideal distance of 30 to 31 bp from the TSS to maximize transcriptional accuracy and efficiency. On a chromosomal scale, TATA boxes are enriched in intergenic regions proximal to TSSs across eukaryotic genomes, where they define the boundaries of core promoters and contribute to the spatial organization of transcriptional units. This distribution is absent in prokaryotic (bacterial) genomes, which rely on distinct promoter elements like the -10 box for recognition, but is conserved in archaeal promoters, where TATA boxes similarly occupy positions about 25 bp upstream of the TSS to direct transcription by . Genome-wide mapping of TATA box positions has been facilitated by high-throughput techniques such as followed by sequencing (ChIP-seq), which targets the (TBP) to identify binding sites and infer TATA locations across entire genomes. These methods have revealed consistent positional patterns in diverse eukaryotic species, aiding in the annotation of promoter landscapes.

Variations and analogous elements

The TATA box exhibits sequence variations that deviate from the strict consensus TATAAA, often involving single nucleotide substitutions that result in TATA-like sequences with reduced affinity for the TATA-binding protein (TBP). For instance, base pair changes within the TATA element increase the dissociation rate of TBP from DNA, thereby lowering binding stability and potentially modulating transcription efficiency.92584-6/pdf) Such variants, including those with one mismatch to the consensus, maintain nanomolar binding affinity for TBP and are functional in vivo, appearing in a substantial fraction of promoters originally classified as TATA-associated, estimated at around 20-30% depending on the organism and detection method.00205-3.pdf) In TATA-less promoters, which predominate in metazoan genomes, analogous core promoter elements compensate for the absence of the TATA box to facilitate preinitiation assembly. The Initiator (Inr) , typically encompassing the transcription start site with a YYANWYY (where Y is and N is any ), directs precise initiation and is recognized by TFIID subunits. The Downstream Promoter Element (DPE), located approximately 30 nucleotides downstream of the start site with RGWYVT (where R is , G is , W is A or T, V is A/C/G, and Y is ), synergizes with Inr to enhance transcription, particularly in TATA-less contexts; its mutation can decrease promoter activity by 10- to 50-fold. Similarly, the TFIIB Recognition Element (BRE), an upstream either before (BREu) or after (BREd) the TATA box position, interacts with TFIIB to stabilize the initiation in both TATA-containing and TATA-less promoters. Functional equivalents of the TATA box appear in plant promoters, where sequence variants contribute to the regulation of light-responsive genes. In , the TATA box in the basal promoter of genes like CAB2 modulates transcription in response to signals, with specific compositions influencing inducibility under activation. In fungi, such as , TATA elements often feature shorter or variant motifs that deviate from the canonical length, yet retain inducibility for stress-responsive genes; approximately 20% of yeast promoters harbor these functional TATA variants, which correlate with higher expression noise and rapid activation.00205-3.pdf) Comparative genomics highlights TATA box analogs in prokaryotes, notably the -10 box (Pribnow box) in bacterial sigma70-dependent promoters, with a consensus sequence TATAAT that positions RNA polymerase for initiation approximately 5-9 nucleotides downstream. This element, bound by the sigma70 subunit, serves a parallel role to the eukaryotic TATA box in unwinding DNA and specifying the start site, underscoring evolutionary parallels in promoter architecture.

Function in transcription

Role in initiation complex assembly

The TATA box plays a central role in nucleating the assembly of the pre-initiation complex (PIC) by serving as the primary recognition site for the (TBP), a core subunit of the general transcription factor TFIID. TBP binds specifically to the consensus TATA sequence located approximately 25–35 base pairs upstream of the transcription start site, initiating a stepwise recruitment of other general transcription factors and ( II). This binding event is the foundational step in PIC formation, the ordered assembly required for accurate and efficient transcription initiation by II in eukaryotes. Upon TBP engagement, the TATA box undergoes significant deformation, including sharp bending of the DNA helix by about 80–90 degrees and partial unwinding, which exposes structural features that promote subsequent factor recruitment. TFIIA and TFIIB are then recruited to the TBP-TATA complex; TFIIA stabilizes the initial TBP-DNA interaction, while TFIIB binds downstream and upstream elements to bridge the complex toward the Pol II machinery. This is followed by the association of the TFIIF-Pol II holoenzyme, which positions Pol II over the promoter and facilitates the transition to an open complex conformation, where the DNA template is fully unwound for transcription bubble formation. These sequential interactions ensure directional and stable PIC assembly, with the TATA box acting as the anchor point. In reconstituted transcription systems using purified components, the TATA box markedly enhances basal transcription levels relative to promoters lacking this element, by promoting robust PIC formation and stability. Electrophoretic mobility shift assays (gel shift assays) further confirm this, showing that TATA-dependent PICs form more stable complexes with reduced dissociation rates compared to non-TATA sequences, as the deformation induced by TBP creates a high-affinity platform for factor retention. These assays highlight the TATA box's mechanistic contribution to overcoming kinetic barriers in PIC assembly, ensuring efficient recruitment of Pol II without activators.

Interactions with transcription factors

The TATA box primarily binds the (TBP), a saddle-shaped subunit of the TFIID complex that recognizes the through extensive contacts in the minor groove of the DNA. TBP inserts four conserved residues (Phe237, Phe262, Phe297, and Phe322 in TBP) between adjacent pairs of the TATA , causing partial unwinding and intercalation that kinks the at the TA interfaces. This interaction induces a sharp bend in the DNA of approximately 80 degrees toward the major groove, facilitating the assembly of the preinitiation complex (PIC). Crystal structures determined in the 1990s, such as those of yeast and human TBP bound to TATA elements (e.g., PDB entries 1TBP and 1CDW), reveal that the core contacts occur primarily at positions -28 to -32 base pairs upstream of the transcription start site, involving hydrogen bonds, van der Waals interactions, and hydrophobic packing between TBP's C-terminal stirrups and the deoxyribose-phosphate backbone. These structures highlight the specificity of TBP for AT-rich sequences, where the flexible minor groove allows the protein's concave undersurface to clamp onto the DNA without major groove penetration. Variations in TATA sequence subtly alter the bend angle and stability, but the phenylalanine intercalations remain conserved across eukaryotic TBPs. The stability of the TBP-TATA complex is enhanced by cooperative interactions with general transcription factors TFIIA and TFIIB. TFIIA binds directly to the N- and C-terminal domains of TBP as well as to DNA sequences immediately upstream of the TATA box, stabilizing the bent conformation and preventing dissociation; this multi-valent interaction increases binding affinity by approximately 10-fold . TFIIB, in turn, associates with the convex upper surface of TBP via its core domain, forming a ternary complex that positions the B-reader and B-linker helices to bridge the TBP-TATA assembly to the arriving and TFIIF. Regulation of TBP-TATA affinity is further modulated by co-activators, notably the complex, which interacts with TBP's N-terminal region to enhance DNA binding in a manner dependent on upstream activators. Structural and biochemical studies show that Mediator subunits (e.g., MED15) allosterically influence TBP recruitment, increasing the complex's stability on TATA-containing promoters without directly contacting the DNA. This regulatory layer allows fine-tuned control over transcription initiation rates.

History and discovery

Initial identification

The TATA box was first identified in 1978 by Michael L. Goldberg, Richard P. Lifton, R. W. Karp, and David S. Hogness during sequencing of promoters in . They observed a conserved AT-rich octamer sequence, TATTTATA, positioned approximately 30 base pairs upstream of the transcription start site in multiple s, marking it as a potential regulatory element analogous to bacterial promoter motifs. This discovery, published as part of studies on organization, initially named the sequence the Goldberg-Hogness box after its key contributors. The term "TATA box" emerged in 1980 from work by Richard Breathnach, Keith O'Hare, and Pierre Chambon on the chicken ovalbumin gene promoter, where they highlighted a conserved TATAAA motif across diverse eukaryotic protein-coding genes transcribed by . In their analysis of putative control regions, they emphasized the motif's role in positioning the transcription initiation site and adopted "TATA box" to reflect its A/T-rich composition, distinguishing it from earlier nomenclature while building on the findings. Studies in the on viral promoters further illuminated the motif, notably following the complete sequencing of the simian virus 40 (SV40) genome by Walter Fiers and colleagues in 1978, which revealed a similar TATTTAT sequence upstream of the cap site. This contributed to early recognition of the TATA motif's prevalence in eukaryotic systems. Initially, the Goldberg-Hogness box was sometimes conflated with the bacterial (a -10 element identified in 1976), leading to confusion over its distinct eukaryotic function in preinitiation complex formation. Subsequent functional validation confirmed its specificity for promoters.

Key experimental milestones

In the 1980s, mutagenesis studies provided critical evidence for the essential role of the TATA box in transcription initiation by demonstrating that targeted alterations in this sequence impair accurate start site selection. For instance, point mutations or deletions in the TATA box of the early promoter disrupted precise initiation in transfected mammalian cells, although overall transcription levels were not drastically reduced. Similar results were observed in promoters, where mutagenesis of the TATA element in the HIS3 gene reduced basal transcription to less than 10% of wild-type levels, highlighting the box's necessity for efficient pre-initiation complex formation . During the late 1980s, the purification and of the (TBP), the specific DNA-binding factor that recognizes the TATA box, represented a major advance. Cloned in 1990, TBP was shown to be the core subunit of the general TFIID, capable of directly binding the TATA motif and nucleating PIC assembly. Advancing into the 1990s, techniques offered atomic-level insights into TATA box recognition. The landmark of the TATA-box binding protein (TBP) complexed with a TATA box DNA element, determined at 2.8 Å resolution using , revealed how TBP binds the minor groove of the DNA, inducing a sharp bend of approximately 80 degrees to facilitate recruitment. This structure, reported by Nikolov et al. in 1993, confirmed the sequence-specific interactions involving the eight base pairs of the TATA box and the concave underside of TBP's C-terminal core domain, providing a mechanistic foundation for subsequent studies on promoter architecture. In the 2000s, genome-wide approaches using technology linked TATA box presence to specific regulatory patterns. A comprehensive analysis in identified that TATA-containing promoters are enriched in stress-responsive and inducible , comprising about 20% of the but overrepresented among highly regulated transcripts, with confirming higher TBP occupancy at these sites during activation. This work, building on earlier profiling of under stress conditions, established that TATA boxes correlate with dynamic, condition-specific transcription rather than constitutive housekeeping activity. More recently, in the 2020s, cryo-electron microscopy (cryo-EM) has enabled visualization of the full pre-initiation complex (PIC) on TATA-containing promoters, resolving structures at near-atomic resolution. For example, a 2.9 Å cryo-EM structure of the human RNA polymerase II PIC captured an intermediate state with open DNA bubbles 30–35 base pairs downstream of the TATA box, illustrating how TBP-TATA binding nucleates the stepwise assembly of TFIIA, TFIIB, TFIIE, TFIIF, TFIIH, and polymerase II. These structures have clarified conformational dynamics, such as the partial unwinding of promoter DNA and interactions stabilizing the PIC scaffold, advancing understanding beyond isolated TBP-TATA complexes.

Evolutionary conservation

Across eukaryotes and archaea

The TATA box exhibits high sequence conservation across eukaryotic organisms, maintaining a core consensus of TATAAA or closely related variants such as TATAWAWR, where W represents A or T and R represents A or G. This motif is present in promoters from simple eukaryotes like the yeast Saccharomyces cerevisiae, where approximately 20% of genes feature a TATA box, to complex metazoans including humans, in which about 24% of core promoters contain a TATA-like element.00205-3) The structural features enabling TBP binding, including DNA bendability and AT-rich composition, remain consistent, underscoring the motif's role in basal transcription initiation despite variations in promoter architecture. In , the TATA box is similarly conserved and functional, binding to homologs of the eukaryotic (TBP) to recruit . For instance, in the hyperthermophilic archaeon Sulfolobus shibatae, the TATA box is located approximately 25-30 base pairs upstream of the transcription start site and is essential for efficient initiation, with mutations reducing transcription levels dramatically. This TBP-TATA interaction is universal across archaeal phyla, from crenarchaeotes like to euryarchaeotes, reflecting a shared mechanistic foundation with eukaryotes. The presence of the TATA box in both domains points to an ancient evolutionary origin following the (LUCA) but predating the divergence of and eukaryotes, as archaeal transcription machinery closely resembles the eukaryotic system, including TBP and TFIIB homologs, while lack these components. Comparative phylogenetic studies further reveal that TATA box prevalence correlates with Pol II/III-like transcription apparatuses, being more abundant in genes requiring precise in both and eukaryotes. Metagenomic analyses from the have extended this conservation to uncultured archaeal lineages, identifying TBP genes and associated promoter motifs in diverse environmental samples, such as hypersaline mats and deep-sea sediments, confirming the motif's ubiquity beyond cultured representatives. These findings, derived from assemblies of uncultured genomes, highlight TATA boxes in novel archaeal clades like thaumarchaeota, supporting a deep-rooted distribution.

Functional divergence

The functional role of the TATA box exhibits notable divergence across evolutionary lineages, reflecting adaptations to diverse transcriptional demands. In , TATA-containing promoters are predominantly associated with stress-responsive and inducible genes, characterized by high turnover and rapid activation under environmental challenges, whereas TATA-less promoters support constitutive, basal expression of genes. In contrast, metazoans like humans and display TATA boxes in only about 10-20% of promoters, where they facilitate dynamic, tissue-specific, and inducible transcription, including rapid responses in heat shock genes via enhanced recruitment of under proteotoxic stress. This shift underscores a broader reliance on TATA-less architectures in complex multicellular organisms for stable gene regulation. Certain eukaryotic lineages, such as higher plants, have largely lost dependence on the TATA box, favoring TATA-less promoters enriched with initiator (Inr) and downstream promoter elements (DPE) to direct precise transcription initiation, particularly for and developmental genes. In , for instance, only approximately 29% of promoters contain TATA motifs, with Inr/DPE combinations enabling efficient, nucleosome-resistant initiation suited to photosynthetic and growth-related expression. Conversely, exhibit a widespread retention and potential gain of TATA boxes in their promoters, integral to the simplified basal transcription machinery that supports adaptation in extremophilic environments, such as high-temperature or hypersaline conditions, by ensuring robust pre-initiation complex assembly with minimal factors. Evolutionary models propose that the TATA box co-evolved with the (TBP) through duplication events in the TBP domain approximately 2 billion years ago, coinciding with the emergence of complex eukaryotic regulation and divergence from archaeal ancestors. studies highlight how TATA presence correlates with distinct requirements; for example, TATA promoters often necessitate ATP-dependent remodelers for eviction to enable induction, a pattern more pronounced in stress genes than in TATA-less plant or metazoan counterparts. This linkage to dynamics illustrates functional divergence, where TATA facilitates poised, remodeler-dependent states in lineages requiring rapid transcriptional shifts.

Genetic mutations

Types and molecular effects

Genetic alterations to the TATA box primarily consist of insertions, deletions (indels), and point mutations, each disrupting the precise required for binding and preinitiation complex () formation. Indels in the TATA box region or adjacent sequences alter the spacing between the TATA element and the transcription start site (TSS), typically shifting the TSS by 1-10 base pairs and decreasing transcriptional efficiency or redirecting to alternative sites favoring the preferred spacing of 30-31 bp, as observed in mammalian core promoters where non-optimal spacing lowers promoter specificity and output. Point mutations within the (typically TATAAAAG) variably impair or modulate TBP recognition. A such as A to T at position -28 (resulting in TATTAAAG) reduces transcriptional activity to approximately 14% of the level while maintaining comparable TBP frequency (8.4%), due to preserved groove interactions with TBP residues like Asn-27 and Asn-117. More disruptive changes, such as those converting core T/A residues to C or G, can abolish TBP affinity almost entirely; for example, certain non- variants exhibit transcription levels below 5% and fail to support stable formation. Conversely, adjustments like A to G at select positions may slightly enhance affinity by improving sequence flexibility for DNA distortion, though such gains are context-dependent and rarely exceed 10-20% improvement in efficiency. These mutations exert direct biochemical effects by perturbing DNA conformation and PIC assembly. Wild-type TBP binding induces an ~80-90° bend in the TATA box DNA, facilitating recruitment of TFIIA/B and RNA polymerase II; mutations often diminish this bending to <60°, weakening the structural scaffold for PIC stability and reducing overall complex persistence. Electrophoretic mobility shift assays (EMSA) quantify these impacts, revealing that mutated TATA sequences form less stable TBP-DNA complexes with dissociation rates increased by 2-5 fold compared to consensus, directly correlating with impaired PIC formation and transcriptional initiation. Quantitative models for predicting mutation effects include scoring systems based on position-specific weight matrices, which assign values to nucleotide changes in the TATA sequence to forecast TBP binding and transcription efficiency losses; for example, log2-transformed relative affinity scores from such matrices closely predict promoter strength reductions of 50-80% for single-point variants. These tools, refined from early consensus analyses, enable computational assessment of indel-induced spacing disruptions and point mutation impacts without experimental assays.

Impact on gene expression

Mutations in the TATA box significantly reduce promoter strength, leading to substantial decreases in transcription levels. In reporter assays, such mutations can lower mRNA output by 20- to 100-fold or more, depending on the specific alteration and context. For instance, mutating the TATA box in the U1 snRNA gene promoter resulted in approximately a 30-fold reduction in transcription efficiency. These effects arise primarily from impaired recruitment of the transcription initiation complex, though the exact fold reduction varies with the position and nature of the mutation, such as single nucleotide changes within the core TATAAA sequence. TATA box mutations also dysregulate the temporal aspects of gene expression, particularly in stress-responsive genes. Genes with TATA boxes are often involved in rapid induction during environmental stresses, and mutations disrupt this timing, leading to delayed or attenuated responses. This dysregulation stems from altered bursty transcription patterns characteristic of TATA-containing promoters, where mutations shorten transcriptional bursts and impair timely activation. Epigenetic consequences of TATA box mutations include altered histone acetylation at promoters, creating ripple effects on chromatin structure. The TATA box is essential for recruiting chromatin-modifying complexes that promote histone hyperacetylation, facilitating open chromatin for transcription. Mutations in the TATA box, such as those in the yeast CUP1 promoter, severely impair this hyperacetylation, reducing acetylated histones H3 and H4 levels and destabilizing promoter nucleosomes. These changes propagate to broader epigenetic remodeling, affecting long-term gene accessibility without directly altering DNA sequence. Genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) analyses have linked TATA box variants to variations in gene expression across populations. Polymorphisms in TATA boxes act as cis-regulatory elements, influencing mRNA levels and contributing to eQTL signals in human promoters. For example, over 40 described TATA box mutations in human genes are associated with altered expression quantitative traits, often correlating with disease susceptibility through dysregulated transcription. These findings highlight TATA variants as key contributors to inter-individual differences in gene expression, as identified in large-scale eQTL datasets.

Disease associations

Linked disorders

Mutations in the TATA box of the HBB gene are associated with β-thalassemia, an autosomal recessive blood disorder characterized by reduced or absent β-globin chain production, leading to ineffective erythropoiesis and hemolytic . The -28 A>G variant in the TATA box (HBB:c.-78A>G) impairs binding of the , resulting in 3-5-fold lower β-globin mRNA levels in heterologous cells and up to 10-fold reduction in patient erythroid cells, causing a β⁺-thalassemia with mild to moderate . This mutation has been identified in diverse populations, including those of Mediterranean descent, where β-thalassemia carrier prevalence is 5-15% and promoter mutations such as -28 A>G contribute to approximately 1-2% of cases.

Pathogenic roles

Alterations in the TATA box can lead to , particularly in dosage-sensitive genes where a in a single significantly reduces overall output. In the case of the hemoglobin beta (HBB) , the -80 T>C in the promoter decreases beta-globin transcription, effectively halving functional from the affected and contributing to the imbalanced globin chain synthesis characteristic of beta-thalassemia major when combined with another defective . This reduced expression exacerbates ineffective and , as the precise of alpha and beta s is critical for stable formation. TATA box mutations exert tissue-specific effects by impairing promoter integrity, which disrupts the spatial organization required for enhancer-promoter interactions in differentiated cells. In cancer cells, such promoter alterations can interfere with looping, preventing enhancers from effectively recruiting transcriptional machinery to TATA-dependent promoters and leading to dysregulated expression of lineage-specific genes. For instance, TATA box polymorphisms that weaken promoter activity have been linked to increased to gastric cancer through diminished transcription in gastric epithelial tissues. This disruption is particularly pronounced in TATA-containing promoters prevalent in stress-responsive or tissue-restricted genes, amplifying oncogenic transformation in affected cellular contexts. Distinctions between and TATA box alterations highlight their roles in tumor progression versus inherited disorders. mutations, such as those in HBB, are constitutionally present and drive systemic diseases like beta-thalassemia from birth. In contrast, mutations in the TATA box arise during tumorigenesis and are detected in tumor tissues, where they contribute to aberrant ; for example, only a few changes were identified in TATA boxes across samples, yet they correlate with promoter dysfunction and activation. Acquired TATA alterations in tumors facilitate overexpression by enhancing responsiveness to enhancers in the neoplastic environment. Animal models provide mechanistic insights into these pathogenic roles, with knock-in approaches recapitulating phenotypes. Transgenic mice carrying HBB genes with TATA box mutations, such as novel variants in the promoter region, exhibit reduced beta-globin RNA levels to approximately 25% of normal , mirroring the mild beta-plus- phenotype observed in affected individuals and demonstrating ineffective hematopoiesis. These models confirm that TATA disruption leads to dosage imbalances and hemolytic features akin to , underscoring the sequence's vulnerability . Recent studies as of 2025 suggest TATA box variants, like -78A>G in HBB, may serve as targets for to elevate in β-.

Biotechnological applications

In genetic engineering

In genetic engineering, the TATA box serves as a key core promoter element in synthetic constructs designed to precisely regulate transgene expression, particularly in CRISPR-based vectors. Minimal TATA box promoters enable promoterless transgene activation by recruiting RNA polymerase II through targeted binding of deactivated Cas9 (dCas9) fused to activation domains like VP64. For instance, the CRISPReader system incorporates a consensus TATA box (TATATAA) upstream of a transgene such as Renilla luciferase, where dCas9-VP64 guided by a single-guide RNA (sgRNA) binds adjacent sites to initiate transcription, achieving up to a 30-fold increase in luciferase activity in HEK-293T cells compared to non-targeted controls. This approach has been extended to in vivo applications via adeno-associated virus (AAV) delivery, where a minimal TATA promoter drives initial expression of dCas9-VP64, amplified by sgRNA binding upstream to form a positive feedback loop that enhances target gene activation, such as upregulation of Apoa1 for cholesterol reduction in hypercholesterolemic mouse models. Synthetic TATA variants are also engineered by fusing upstream activation sequences from natural promoters to TATA-containing core regions, yielding compact, inducible promoters for controlled delivery. This modular design allows tailoring of expression strength and specificity, with TATA box variants optimizing basal transcription levels while enhancers confer inducibility. techniques are utilized to modify the TATA box in the beta-globin promoter, creating cellular models of to investigate strategies. By introducing specific point mutations in the TATA box, researchers assess molecular effects on erythroid and production. For example, mutagenesis of the beta-globin TATA box in constructs transfected into erythroid cells revealed impaired recruitment of erythroid Krüppel-like factor (EKLF), leading to reduced promoter activity and mimicking -associated expression defects. Programmable nucleases like TALENs and ZFNs have been used in preclinical paradigms to edit endogenous sites and restore balanced chain synthesis. TATA-luciferase reporter constructs provide a robust platform for of transcription modulators targeting TATA-dependent initiation. These assays typically feature a or Renilla luciferase downstream of a minimal TATA box, often with upstream response elements to mimic native promoters, allowing quantification of transcriptional changes via readout. In one optimized system, a TATA-driven luciferase reporter integrated into stable cell lines screened over 6,700 compounds, identifying small-molecule inhibitors of receptor-mediated transcription with values as low as 0.48 μM, demonstrating the assay's sensitivity for detecting modulators that alter TATA box functionality. Such constructs support 384-well format screening with low , facilitating of novel regulators for therapeutic intervention in TATA-associated dysregulation. Advances in the 2020s have leveraged CRISPR/Cas9 to delete TATA sites, such as in the PMP22 promoter, reducing overexpression and demyelination in mouse models of Charcot-Marie-Tooth type 1A. In models, base editing of promoter variants in patient-derived iPSCs has been applied to correct point mutations, enhancing reactivation and erythroid maturation efficiency, with editing rates exceeding 40% in corrected clones for simulation and therapy validation. These methods prioritize minimal off-target effects, supporting scalable iPSC-based platforms for screening TATA-targeted interventions.

Therapeutic and diagnostic uses

The TATA box serves as a target in strategies aimed at correcting promoter mutations that impair in hereditary disorders. In Charcot-Marie-Tooth disease type 1A (CMT1A), a demyelinating neuropathy caused by PMP22 , CRISPR-Cas9 editing of the PMP22 TATA box has demonstrated therapeutic potential by reducing gene overexpression and alleviating demyelination in models, providing a proof-of-concept for promoter-specific editing. Although TATA box mutations contribute to beta-thalassemia by disrupting HBB promoter activity, leading to reduced beta-globin expression, current CRISPR-based clinical trials such as CTX001 (now exagamglogene autotemcel, approved in 2023 following phase 3 trials) primarily target BCL11A to enhance production rather than directly correcting TATA variants; preclinical CRISPR approaches have successfully repaired other HBB promoter mutations, suggesting applicability to TATA defects. In diagnostics, TATA box-associated single nucleotide polymorphisms (SNPs) are leveraged for cancer through methods. The rs10993994 in the MSMB promoter, located adjacent to the TATA box and altering its function, significantly increases susceptibility by reducing MSMB expression, a tumor suppressor; this variant is detected via -based assays in peripheral blood DNA for risk stratification, enabling early screening in high-risk populations. Emerging applications include detection of such TATA-related SNPs in from liquid biopsies, enhancing non-invasive monitoring and personalized risk profiling for progression. Drug development has explored small molecules that inhibit (TBP) to selectively disrupt transcription of TATA-dependent oncogenes in cancer cells. Compounds such as bind TBP and prevent its interaction with the TATA box, suppressing Pol II recruitment and reducing expression of proliferation-related genes, with preclinical evidence of antitumor effects in models of various malignancies. Similarly, inhibitors targeting TBP-associated factors (TAFs), such as bromodomain blockers for TAF1, have shown promise in preclinical cancer models by impairing TATA-initiated transcription of oncogenic networks. Recent advancements as of 2024 incorporate to predict the pathogenicity of genetic for in rare . AI-driven models, such as algorithms, forecast the impact of variants on function and , facilitating in genomic diagnostics; these tools integrate with clinical workflows to guide targeted therapies.

References

  1. [1]
    Molecular determinants underlying functional innovations of TBP ...
    May 13, 2020 · TATA-box binding protein (TBP) is required for every single transcription event in archaea and eukaryotes. It binds DNA and harbors two ...
  2. [2]
    Core promoter elements of eukaryotic genes have a highly ...
    The TATA box has a consensus TATA(A/T)A(A/T) sequence and is usually located ∼25–31 bp upstream of the transcription start site, +1 (1,2).
  3. [3]
    RNA polymerase II transcription initiation: A structural view - PNAS
    The best characterized core promoter elements, which can function independently or synergistically, are the TATA element (located 25 bp upstream of the ...Rna Polymerase Ii... · Promoter Anatomy And The... · Tata Box-Binding Protein
  4. [4]
    On the Role of TATA Boxes and TATA-Binding Protein in ...
    Three TATA box sequences (TATATATA, TATAAATA, and TATATAAA), which are effective in the transcription of eukaryotic genes, were also effective in transcription ...
  5. [5]
    Transcriptional and structural impact of TATA-initiation site spacing ...
    Aug 17, 2006 · In TATA driven promoters, the primary role of the TATA box is to anchor the PIC. In higher eukaryotes, this process sterically constrains ...
  6. [6]
    The TATA-Box Sequence in the Basal Promoter Contributes to ... - NIH
    The experiments give direct evidence for the role of a core TATA-box sequence in determining the level as well as selectivity of gene expression in plants.<|control11|><|separator|>
  7. [7]
    Frequency distribution of TATA Box and extension sequences on ...
    Dec 12, 2006 · In this study, we conduct a dedicated analysis on the frequency distribution of TATA Box and its extension sequences on human promoters.
  8. [8]
    Core Promoters in Transcription: Old Problem, New Insights - NIH
    The DPE (consensus sequence: A/GGA/TCGTG), which occurs mostly but not exclusively occurs in TATA-less promoters, acts in conjunction with the Inr and is ...The Core Promoter: A... · Figure 1 · Core Promoters Of Non-Coding...
  9. [9]
  10. [10]
  11. [11]
    TATA Binding Proteins Can Recognize Nontraditional DNA ... - NIH
    ... 25–35 bp upstream of the TSS ... This suggests that a poly-T stretch could serve the function of a TATA box when the promoter region lacks a TATA box.
  12. [12]
    Select / Download tool - EPD The Eukaryotic Promoter Database
    TATA box: a promoter is with a TATA box if the motif is found at position −28 (± 3 bp) from the TSS (evaluated using FindM). Initiator: a promoter is with ...
  13. [13]
    Systematic interrogation of human promoters - PMC - NIH
    We used this method to investigate thousands of native promoters and preinitiation complex (PIC) binding regions followed by in-depth characterization of the ...
  14. [14]
    Transcription Regulation in Archaea | Journal of Bacteriology
    Jun 27, 2016 · There are four DNA elements currently known to regulate archaeal transcription initiation: (i) the TATA box located approximately 25 bp upstream ...
  15. [15]
    The evolution of TBP in archaea and their eukaryotic offspring - NIH
    The TATA-box consensus shows strong conservation from archaea to eukaryotes, reflecting the restrictions on the physicochemical properties of the DNA to allow ...
  16. [16]
    Comparing genome-wide chromatin profiles using ChIP-chip or ...
    2.2.2 ChIP-seq data. The unpublished ChIP-seq dataset investigates the differences in the distribution of TATA-binding protein (TBP) across the human ...
  17. [17]
    Mutations on the DNA Binding Surface of TBP Discriminate ... - NIH
    Initial in vitro studies showed that TBP has nanomolar affinity for several TATA sequences with one mismatch to the consensus (36).
  18. [18]
    The RNA polymerase II core promoter - Genes & Development
    The consensus sequence for the TATA box is TATAAA. It has been observed, however, that a wide range of sequences can function as a TATA box in vivo (Singer et ...The Tata Box · The Tfiib Recognition... · Other Core Promoter...Missing: original | Show results with:original
  19. [19]
    Affinity and competition for TBP are molecular determinants of gene ...
    Feb 2, 2016 · In fact, TATA-box genes that are highly expressed display a higher CV, suggesting that the higher noise cannot simply be explained due to low ...Missing: TATAAA | Show results with:TATAAA
  20. [20]
    Pribnow Box - an overview | ScienceDirect Topics
    The Pribnow box is located approximately 10 bp upstream from the RNA startsite (one DNA helix turn), whereas the TATA box is separated from the mRNA startsite ...
  21. [21]
    Assembly of RNA polymerase II transcription initiation complexes - NIH
    The first step of eukaryotic gene expression is the assembly of RNA polymerase (Pol) II and general transcription factors on promoter DNA.
  22. [22]
    Structural basis of preinitiation complex assembly on human Pol II ...
    It is generally accepted that binding of TBP to the TATA‐box nucleates the formation of the PIC either through a stepwise assembly of other basal factors ...
  23. [23]
    A potential role for TATA box stabilization of the TFIID:TFIIA:DNA ...
    We infer that the TATA sequence plays a role in directing the rate at which the TFIID:A:B complex can complete assembly of functional pre-initiation complexes.Gel Mobility Shift Assays · Results · Tfiia Dissociation And The...
  24. [24]
    Co-crystal structure of TBP recognizing the minor groove of a TATA ...
    Oct 7, 1993 · Side-chain/base interactions are restricted to the minor groove, and include hydrogen bonds, van der Waals contacts and phenylalanine–base ...
  25. [25]
    Crystal structure of a human TATA box-binding protein/TATA ... - PNAS
    The TATA box-binding protein (TBP) is required by all three eukaryotic RNA polymerases for correct initiation of transcription of ribosomal, messenger, ...Missing: seminal papers
  26. [26]
    Human Mediator Enhances Activator-Facilitated Recruitment of RNA ...
    Moreover, Mediator- P.5 is able to enhance TBP binding to the TATA box in an activator-dependent manner. Our data provides biochemical evidence that Mediator ...
  27. [27]
    In vivo sequence requirements of the SV40 early promoter region
    Mar 26, 1981 · The TATA box region is apparently involved infixing initiation precisely within a narrow area, but is dispensable for gene expression, while ...
  28. [28]
    Crystal structure of a yeast TBP/TATA-box complex - Nature
    Oct 7, 1993 · The 2.5 Å crystal structure of a TATA-box complex with yeast TBP shows that the eight base pairs of the TATA box bind to the concave surface of TBP.
  29. [29]
    Prevalence of the initiator over the TATA box in human and yeast ...
    About 24% of human genes have a TATA-like element and their promoters are generally AT-rich; however, only approximately 10% of these TATA-containing promoters ...Missing: percentage | Show results with:percentage<|separator|>
  30. [30]
    Cloning and functional analysis of the TATA binding protein from ...
    May 25, 1995 · Here we report the cloning of a gene from the Archaeon Sulfolobus shibatae that encodes a protein with strong homology to the TATA binding protein (TBP) of ...
  31. [31]
    Molecular Mechanisms of Transcription Initiation - structure, function ...
    First, TBP binds to the TATA box of the promoter and distorts this by bending the DNA roughly at a right angle [25]. The kinetics of archaeal TBP-TATA box ...
  32. [32]
    Early Evolution of Transcription Systems and Divergence of Archaea ...
    May 5, 2021 · To recognize a core promoter, Archaea utilize TBP (TATA-box binding ... According to our view, pre-life evolved to LUCA, which we interpret as an ...
  33. [33]
    Evolutionary history of the TBP-domain superfamily - Oxford Academic
    The TATA binding protein (TBP) is an essential transcription initiation factor in Archaea and Eucarya. Bacteria lack TBP, and instead use sigma factors for ...
  34. [34]
    Uncovering ancient transcription systems with a novel evolutionary ...
    Jun 16, 2016 · TBP is a GTF that binds to an upstream sequence of a target gene, the so-called TATA-box, after which another GTF, TFIIB, binds to TBP and its ...
  35. [35]
    Insights into the evolution of Archaea and eukaryotic protein modifier ...
    Dec 15, 2010 · Archaeal homologs related to transcriptional initiation such as transcription factor B (TFB), TATA-binding protein (TBP) and transcription ...
  36. [36]
    Phylogenetically Driven Sequencing of Extremely Halophilic ...
    We describe the expansion and differentiation of haloarchaeal general transcription factor families, including two novel expansions of the TATA-binding protein ...
  37. [37]
    On the Role of TATA Boxes and TATA-Binding Protein in ... - MDPI
    SAGA-dependent genes of yeast contain a consensus TATA box with the sequence TATAWAWR, whereas TFIID-dependent ones used to be considered TATA-free; however ...1. Introduction · 2. Structure Of Tbp · 6. Mutations In Tata Boxes...
  38. [38]
    Prevalence of the Initiator over the TATA box in human and yeast ...
    The consensus sequences included the mammalian INR (YYANWYY), the BRE (SSRCGCC), the DPE (RGWCGTG), the canonical TATA box sequence TATAWAWR (TATA-8) (Smale and ...
  39. [39]
    Promoter architecture and the evolvability of gene expression
    Dec 14, 2009 · One particular promoter binding site stands out for its large influence on expression divergence: promoters that contain a TATA box show a ...
  40. [40]
    Differentiation of core promoter architecture between plants and ...
    Several core promoter elements have been identified: the TATA box, the TFIIB-Recognition Element (BRE), Inr, the Downstream Promoter Element (DPE) for ...
  41. [41]
    Genome wide analysis of Arabidopsis core promoters
    Feb 25, 2005 · Using Nucleotide Frequency Matrices (NFM), we carried out a genome-wide analysis for the presence and position of TATA-box elements. Our ...
  42. [42]
    Transcription Regulation in Archaea - PMC - NIH
    May 2, 2016 · TBP is needed to recognize the TATA box, bend the DNA, and recruit TFB (46); its role had therefore been deemed equivalent to the role of ...
  43. [43]
    Toward an elucidation of the process underlying the evolution of ...
    Aug 27, 2018 · ... TATA box-Binding Protein (TBP), Transcriptional Factor IIB (TFIIB) ... 2 to 2.5 billion years ago, as described below [9]. In the ...
  44. [44]
    The impact of chromatin remodelling on cellulase expression in ...
    Aug 7, 2015 · In this study we aimed to learn if the chromatin status changes in context to the applied conditions (repressing/inducing), and if the presence ...<|control11|><|separator|>
  45. [45]
    Comparative genomics of transcription factors and chromatin ...
    In this article, we use the vast array of data from recently published protist genome sequences to present a comparative genomic overview of chief aspects of ...
  46. [46]
    Transcriptional and structural impact of TATA-initiation site spacing ...
    Insertions between the TATA boxes and the initiation sites conferred either a significant decrease in transcription or creation of new TSS with a more ...
  47. [47]
    TATA element recognition by the TATA box-binding protein has ...
    Cocrystal structures of wild-type TATA box-binding protein (TBP) recognizing 10 naturally occurring TATA elements have been determined at 2.3–1.8 Å resolution.
  48. [48]
    DNA Sequence-dependent Differences in TATA-binding Protein ...
    We show in this study that the osmolytes used to crystallize TBP·TATA complexes induce profound increases in the DNA bends of two transcriptionally active TBP- ...
  49. [49]
    TATA-binding Protein Variants That Bypass the Requirement for ...
    The idea that appropriately tuned PIC instability is required for proper gene expression is supported by the similarities in the behavior of TBP Y185C and F207L ...
  50. [50]
    Protein-coding gene promoters in Methanocaldococcus ...
    Apr 9, 2009 · It is known that the spacing between the TATA box and the TSS can vary by a few nucleotides (1). To refine the alignment of the upstream ...
  51. [51]
    Entry - *141900 - HEMOGLOBIN--BETA LOCUS; HBB - OMIM
    ... beta-globin structural genes in Mediterranean persons with beta-thalassemia. ... thalassemia due to a -29A-G promoter mutation (141900.0379). Neither ...
  52. [52]
  53. [53]
    [PDF] TATA Box Polymorphisms in Human Gene Promoters and ...
    The TATA box flanking sequences can influence TBP affinity as well as the level of basal and activated tran- scription. The possibility of mediated involvement ...
  54. [54]
    TATA Box Polymorphisms in Human Gene Promoters and ... - PubMed
    A table containing data on TATA element polymorphisms in human gene promoters (about 40 mutations have been described), associated with particular pathologies.Missing: eQTL | Show results with:eQTL
  55. [55]
    The mechanism by which TATA-box polymorphisms associated with ...
    ... TATA box of the triosephosphate isomerase gene promoter, associated with neurological and muscular disorders, were observed to have a 36-fold decrease in ...
  56. [56]
    VCV000015467.120 - ClinVar - NCBI - NIH
    -80T>A pathogenic variant (also known as -30T>A) is located in the TATA-box of the HBB gene promoter, and decreases transcription of the beta-globin gene. This ...
  57. [57]
    The Molecular Basis of β-Thalassemia - PMC - NIH
    The syndrome comprises a distinct set of mutations affecting the HBB gene that are associated with typical hematological features of β-thalassemia (i.e. ...
  58. [58]
    Looping versus linking: toward a model for long-distance gene ...
    Enhancers tested thus far appear to favor promoters containing strong consensus TATA boxes over those with weak TATA elements and strong consensus initiator ...
  59. [59]
    Single nucleotide polymorphisms and cancer susceptibility
    Dec 15, 2017 · Numerous genetic polymorphisms in lncRNAs are associated with increased risk in gastric cancer, CRC and prostate cancer [84–86]. SNPs in the ...
  60. [60]
    TATA and paused promoters active in differentiated tissues have ...
    Feb 5, 2021 · While promoters with Pol II pausing elements are active throughout development, TATA promoters are highly active in differentiated tissues. We ...<|separator|>
  61. [61]
    Core promoter mutation contributes to abnormal gene expression in ...
    Jan 15, 2022 · Reflecting the fact that TATA box is not tolerable for base changes [13], only 2 somatic and 1 germline mutations were located at the TATA box.
  62. [62]
    TATA Box and Spl Sites Mediate the Activation of c-myc Promoter P1 ...
    ... 50-fold molar excess of oligonucleotides -66/ -48 did not affect any ... To determine the role of the TATA box in basal and enhanced transcription we ...
  63. [63]
    β-Thalassemia in American Blacks: Novel mutations in the 'TATA ...
    This mutant gene directed β-globin RNA at 25% of normal levels both in heterologous cells and in vivo. It was associated with a mild β+-thalassemia phenotype. A ...
  64. [64]
    B6;129P2-Hbb tm2Unc /J - The Jackson Laboratory
    Jul 20, 2021 · These knock-in mice contain the human Β globin gene with the Β-thalassemia associated mutation, IVS-2-654. Read More. View Pricing.
  65. [65]
    Multiplexed promoterless gene expression with CRISPReader
    Jun 3, 2019 · These results confirmed that the CRISPR transcriptional activation system was useful for inducing transgene expression with a minimal TATA box.
  66. [66]
    CRISPR Based Synthetic Transcription Factors In Vivo - NIH
    dCas9- VP64 was expressed using a minimal TATA promoter; its own transcription was then further promoted by the binding of gRNA upstream of the TATA box, along ...
  67. [67]
    A Fruitful Decade Using Synthetic Promoters in the Improvement of ...
    Generally, synthetic promoters are designed by joining the upstream activation sequence of one promoter to a TATA box-containing region of another promoter for ...<|separator|>
  68. [68]
    The β-globin promoter is important for recruitment of erythroid ...
    Site-directed mutagenesis of the promoter was done with plasmid 5′HS234-β as ... The mutation of the proximal CACCC box and the TATA box abolished the recruitment ...
  69. [69]
    Both TALENs and CRISPR/Cas9 directly target the HBB IVS2 ... - NIH
    Jul 9, 2015 · CRISPR/Cas9 induced DSBs with greater efficiency than TALENs. For gene targeting near the IVS2-654 C > T mutation site, TALENs mediated higher ...Missing: TATA box
  70. [70]
    Development of a High-Throughput Screening Assay for Small ... - NIH
    In parallel, a luciferase reporter gene driven by two GREs from the tyrosine amino transferase gene were cloned upstream of the TATA box in plasmid pGL4.26, to ...
  71. [71]
    Targeted PMP22 TATA-box editing by CRISPR/Cas9 reduces ...
    Nov 12, 2019 · An in vitro DNA cleavage score was calculated at each nucleotide position across the genome with a DNA cleavage scoring system described ...
  72. [72]
    Efficient gene editing in induced pluripotent stem cells enabled ... - NIH
    Dec 11, 2023 · The development of base editing technology has significantly expanded the capability for rapid isogenic iPSC generation for disease modeling, ...
  73. [73]
    Targeted PMP22 TATA-box editing by CRISPR/Cas9 reduces ... - NIH
    Nov 12, 2019 · Our data present a proof-of-concept that CRISPR/Cas9-mediated targeting of TATA-box can be utilized to treat CMT1A.
  74. [74]
    A new TATA box mutation detected at prenatal diagnosis for beta ...
    During the course of prenatal diagnosis for beta-thalassemia in Chinese patients, we encountered a mutation that was not detectable by oligonucleotides for ...Missing: site- directed TALENs ZFNs therapy
  75. [75]
    CRISPR-Cas9 Gene Editing for Sickle Cell Disease and β ...
    Dec 7, 2020 · Both diseases are caused by mutations in the hemoglobin β subunit gene (HBB). Mutations in HBB that cause TDT result in reduced (β+) or absent ( ...
  76. [76]
    In situ correction of various β-thalassemia mutations in human ...
    Jan 24, 2024 · In this study, a universal gene correction method was devised to achieve in situ correction of most types of HBB mutations by using validated CRISPR/sgRNA–RNP ...Missing: TATA | Show results with:TATA
  77. [77]
    Fine mapping and functional analysis of a common variant in MSMB ...
    May 12, 2009 · Two recent genome-wide association studies have independently identified a prostate cancer susceptibility locus on chromosome 10q11.2.
  78. [78]
    Validation of prostate cancer risk variants rs10993994 and ... - NIH
    GWAS have identified numerous SNPs associated with prostate cancer risk. One such SNP is rs10993994. It is located in the β-microseminoprotein (MSMB) promoter ...
  79. [79]
    (PDF) Single nucleotide polymorphisms and cancer susceptibility
    Aug 6, 2025 · SNPs in the TATA box affect promoter activity with A to C ... (SNPs) that are associated with prostate cancer risk. However, the vast ...
  80. [80]
    Targeting Transcription Through Inhibition of TBP - Oncotarget
    Feb 1, 2011 · A common target of these compounds is the inhibition of the TATA binding protein which helps position Pol II along the transcription start site ...<|control11|><|separator|>
  81. [81]
    [PDF] Therapeutic potential of TAF1 bromodomains for cancer treatment
    Aug 17, 2018 · TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs ...
  82. [82]
    New deep learning algorithm predicts effects of rare genetic variants
    Sep 25, 2024 · The method allows persons with high risk of disease to be distinguished more precisely and facilitates the identification of genes that are ...Missing: TATA | Show results with:TATA
  83. [83]
    New AI Model Predicts Gene Variants' Effects on Specific Diseases
    Mar 24, 2025 · Cedars-Sinai investigators have developed DYNA, an AI model that accurately identifies harmful genetic mutations for precise diagnoses and ...