Fact-checked by Grok 2 weeks ago

Overlapping gene

An overlapping gene is a genomic region in DNA or RNA that encodes multiple distinct proteins or functional RNAs by sharing nucleotide sequences, typically through translation in different reading frames, use of alternative initiation codons, or transcription from complementary strands. These genes are defined by the overlap of at least one nucleotide between the coding sequences (CDSs) of two or more genes, enabling efficient use of genetic material in compact genomes. Overlapping genes can be unidirectional (same transcriptional direction), bidirectional (opposite directions), or nested (one gene embedded within another), and they occur across viruses, prokaryotes, and eukaryotes. Overlapping genes are particularly prevalent in viral genomes, where they facilitate extreme compaction; for instance, the bacteriophage phiX174 contains multiple overlapping genes within its small 5,386-base-pair genome, a discovery that highlighted their role in encoding up to 11 proteins from limited sequence. In prokaryotes like bacteria, short overlaps of 1–5 nucleotides are common, especially in phase 2 of bacterial genomes, and they contribute to mutational robustness by allowing coordinated expression of related functions. Eukaryotic genomes also harbor overlapping genes, though they are less frequent and often involve tail-to-tail (66.42%) or head-to-head (30.81%) configurations in humans, with hundreds of such pairs identified in mammalian species. Notable eukaryotic examples include the overlapping tRNA genes in the human mitochondrial genome, such as tRNAIle and tRNAGln sharing a 3-nucleotide sequence (5′-CTA-3′ and 5′-TAG-3′), and nested genes in Drosophila where one gene resides within an intronic region of another. Evolutionarily, overlapping genes promote genome efficiency by maximizing coding density and providing flexibility for de novo gene emergence, often under positive or purifying selection to maintain functional integration. Proteins encoded by overlapping genes exhibit distinct sequence compositions compared to those from non-overlapping genes, with enrichment in high-degeneracy amino acids like arginine, serine, and proline, and depletions in low-degeneracy ones, which may reduce evolutionary constraints. In viruses such as SARS-CoV-2, overlapping genes like ORF3a evolve dynamically to enhance adaptability, underscoring their role in rapid viral evolution. While only six experimentally verified mammalian overlapping protein-coding genes exist, ribosome profiling techniques like Ribo-seq have revealed many more alternative open reading frames (ORFs) in eukaryotic transcriptomes, suggesting underappreciated prevalence.

Definition and Classification

Definition

An overlapping gene is a genomic feature where the coding sequences (CDS) of two or more genes share at least one nucleotide in the DNA or RNA sequence, enabling the production of multiple proteins or functional RNAs from the same stretch of genetic material. This arrangement contrasts with non-overlapping genes, whose CDS are entirely distinct and do not share nucleotides, thereby allowing overlaps to achieve greater informational density without relying on pseudogenes or non-coding regulatory elements. Such sharing is particularly prevalent in genomes with limited space, such as those of viruses and prokaryotes. Overlaps can occur on the same DNA strand, known as sense overlaps, where genes are translated in different reading frames—typically shifted by +1 or +2 nucleotides relative to each other—or on opposite strands, referred to as antisense overlaps, where one gene is encoded on the complementary strand. In sense overlaps, the frame shift ensures that the codons for each protein are distinct despite the nucleotide overlap, while antisense overlaps involve transcription from both strands, often producing complementary RNA molecules. These configurations allow for the simultaneous expression of multiple functional products from a single genomic locus. The concept of overlapping genes was first identified during the sequencing of the bacteriophage φX174 in 1977, where multiple protein-coding regions were found to share nucleotides within the compact viral . This discovery, achieved through pioneering methods, initially identified nine proteins encoded within the approximately 5,375-nucleotide through such overlaps; subsequent refinements determined the as 5,386 encoding 11 proteins.

Types of Overlaps

Overlapping genes can be classified by their topological arrangements, which describe how their coding sequences (CDS) or expressible regions interact spatially on the DNA strand. Fully overlapping genes share their entire CDS, meaning one gene's sequence completely encompasses the other's, often resulting in two proteins encoded from the same nucleotides but in different reading frames. Partially overlapping genes share only a subset of their CDS, typically at the 5' or 3' ends, allowing for independent transcription starts or stops while conserving a common region. Nested overlaps occur when one gene is entirely contained within the boundaries of another, such as an internal open reading frame (ORF) embedded in the introns or exons of a host gene. Convergent overlaps feature genes on opposite strands that share their 3' ends (tail-to-tail arrangement, →←), while divergent overlaps share their 5' ends (head-to-head, ←→), enabling bidirectional transcription from a common promoter region. Overlaps can further be distinguished by whether they involve protein-coding or RNA-coding elements. Protein-coding overlaps typically utilize alternative open reading frames (altORFs), where a secondary is embedded within the primary 's sequence, often shifted by one or two to avoid the main frame's stop codons and produce a distinct polypeptide. In contrast, RNA-coding overlaps involve non-protein-coding transcripts, such as microRNAs (miRNAs) or long non-coding RNAs (lncRNAs), that are transcribed from regions overlapping protein-coding genes, for instance, miRNAs hosted within the introns of a primary without altering its . These distinctions highlight how overlaps can encode both translated products and regulatory RNAs from shared genomic space. The extent of overlap varies widely, ranging from as little as 1 to the full length of the shorter , influencing the degree of sequence constraint between the genes. Frame differences are critical in same-strand overlaps, with common offsets of +1 or +2 relative to the primary frame; for example, +1 frame overlaps are prevalent in genomes, where a downstream initiates one after the upstream gene's start, maximizing coding density in compact sequences. These frame shifts ensure that start and stop codons align appropriately without premature termination in either ORF. A notable example of an overlapping gene pair in humans is , which encodes the catalytic subunit of polymerase, and POLGARF, an alternative ORF that overlaps extensively with POLG's in a +1 frame, initiating at a CUG codon within the primary mRNA to produce a distinct protein. This configuration demonstrates a sense-strand overlap where both genes are translated from the same transcript.

Evolutionary Aspects

Origins and Mechanisms

Overlapping genes emerge primarily through overprinting, in which a new (ORF) arises within the of an existing by exploiting an alternative , enabling the same stretch to encode multiple proteins. This process often begins with point mutations that introduce a viable or eliminate a in the secondary frame, as observed in genomes where compact organization favors such innovations. Another key mechanism is followed by a frameshift, where a copy of an existing undergoes a reading frame alteration—typically a +1 or +2 shift—leading to partial overlap with the ancestral while preserving partial identity. De novo emergence from non-coding regions involves successive mutations that generate a functional ORF overlapping adjacent , often activated by nearby regulatory elements. Additionally, retrotransposition contributes by integrating RNA-derived into introns or exons of existing , creating nested or antisense overlaps, particularly in eukaryotic genomes. Mutations play a central role in initiating and stabilizing these overlaps. Insertions and deletions (indels) frequently cause frameshifts that disrupt one while potentially creating a new functional ORF in an frame, as seen in bacterial and . Once formed, these overlapping configurations are maintained by purifying selection, which eliminates deleterious variants but retains those conferring adaptive advantages, such as enhanced regulatory control or compaction. In viruses, this selection is particularly stringent due to small sizes and high rates, preserving overlaps that optimize coding capacity. The historical recognition of overlapping genes began in the with early viral examples, such as the identification of multiple ORFs in the bacteriophage ΦX174 , marking the first confirmed case of dual-coding sequences. Prokaryotic overlaps were documented in the 1980s, with systematic reviews revealing their prevalence in bacterial operons, such as overlaps in termination and initiation sites. In eukaryotes, overlapping alternative ORFs (altORFs) gained prominence in the 2010s through and , uncovering thousands of translated overlaps previously overlooked in annotation efforts.

Advantages and Constraints

Overlapping genes confer evolutionary advantages by enhancing evolvability in compact genomes, where a single change can simultaneously alter proteins from multiple reading frames, enabling rapid acquisition of new functions. This dual-impact potential facilitates under selective pressure, as seen in viruses where overlaps allow for the emergence of multifunctional proteins without expanding . For example, in HIV-1, the overlapping env and rev genes exhibit positive selection (dN/dS > 1 in certain motifs), promoting functional segregation that purges unfit combinations and boosts viral population fitness. Another advantage is increased genetic robustness and protection against deleterious , as overlaps distribute functional constraints across frames, making the more resilient to errors. This buffers against loss-of-function , particularly in high-mutation-rate environments like , where overlaps maintain essential activities through compensatory changes. Theoretical models demonstrate that such antiredundancy in overlaps heightens overall genomic by linking the fates of co-encoded proteins. Despite these benefits, overlapping genes face substantial constraints due to mutational , where alterations in shared sequences disrupt both genes, restricting and imposing stricter evolutionary pressures. Reduced codon flexibility in overlaps often results in biases, as synonymous choices must satisfy dual coding requirements, limiting for optimization. Overlaps typically experience heightened purifying selection to conserve functionality, evidenced by lower substitution rates in fourfold degenerate sites compared to non-overlapping regions (P < 0.001). Empirical studies highlight these trade-offs: in viruses like HIV, overlaps show signatures of positive selection (e.g., dN/dS > 1) driving adaptive evolution, while prokaryotic overlaps often display neutral dynamics or stronger purifying selection (dN/dS < 1), with high turnover rates indicating constraints on long-term persistence. In bacterial genomes, the prevalence of specific overlap phases (e.g., +2 frames) reflects minimizing interference, yet overall, overlaps evolve more slowly than non-overlapping genes due to these limitations.

Taxonomic Distribution

In Viruses

Overlapping genes are particularly prevalent in viral genomes, which are often highly compact to fit within constrained capsid sizes. An analysis of 5,976 reference viral genomes from the NCBI Virus database revealed that 53% contain at least one gene overlap exceeding 50 nucleotides, with single-stranded DNA viruses showing the highest prevalence at 65%, followed by double-stranded DNA viruses at 61% and positive-sense single-stranded RNA viruses at 43%. In extremely compact genomes, such as those of bacteriophage φX174—a single-stranded DNA virus with a 5,386 bp genome encoding 11 genes—overlaps affect eight of the genes, allowing maximal coding efficiency in a minimal space. Similarly, positive-sense single-stranded RNA viruses in the family Picornaviridae exhibit high overlap frequencies, contributing to their dense genomic organization. Notable examples include the human immunodeficiency virus (HIV-1), where the tat and genes overlap, enabling coordinated regulation of viral transcription and RNA export through multifunctional proteins. In severe acute respiratory syndrome coronavirus 2 (), the ORF1a and ORF1b open reading frames overlap via a -1 mechanism, producing polyproteins that are cleaved into 16 nonstructural proteins essential for replication. Recent discoveries in the 2020s have identified additional novel overlapping genes in coronaviruses, such as ORF3d in , which emerges from an alternative frame and may enhance viral fitness. These overlaps confer adaptive advantages in viruses, facilitating rapid evolution by enabling de novo gene creation without genome expansion and promoting immune evasion through multifunctional proteins that suppress host defenses. For instance, in asymmetric overlaps, the newer frame often evolves under positive selection to encode accessory proteins that boost pathogenicity, as seen in various RNA viruses where overlaps stabilize under purifying selection post-origin. This strategy allows viruses to maintain genetic stability while adapting to host pressures, with overlaps arising primarily through overprinting of existing genes.

In Prokaryotes

Overlapping genes are a common feature in prokaryotic genomes, with approximately one-third of all genes participating in overlaps across diverse bacterial and archaeal species. These overlaps frequently occur at boundaries, facilitating coordinated regulation and expression. In bacterial genomes, such as those of , short overlaps of 4–50 base pairs are prevalent, often involving essential operons for metabolic pathways. A notable example is the menaquinone biosynthesis operon in E. coli, where three consecutive genes exhibit short stop-start overlaps that promote translational coupling, ensuring stoichiometric production of enzymes in the electron transport chain. Similarly, in Bacillus subtilis, ribosomal protein genes like rpsO and neighboring loci display overlaps that support ribosome assembly by linking translation initiation of downstream genes to upstream termination. These configurations are particularly abundant in minimal genomes, such as those of Mycoplasma pneumoniae, where overlaps constitute up to 10% of gene pairs, aiding genome compaction while maintaining essential functions. Functionally, these overlaps enable coordinate expression in operons critical for metabolic pathways, such as and ribosomal biogenesis, by allowing re-initiation without dedicated intergenic regions. In compact prokaryotic genomes, this arrangement imposes evolutionary constraints, favoring short overlaps to balance mutation rates and regulatory precision. Advanced proteomic and have revealed previously undetected long overlaps, such as a 603 bp nested overlap in the E. coli ompA gene with pH-regulated function. These studies highlight overlaps' role in enhancing expression efficiency under nutrient limitation, particularly in fast-growing .

In Eukaryotes

Overlapping genes are considerably rarer in eukaryotic genomes compared to prokaryotes and viruses, comprising less than 1% of protein-coding genes in humans when considering strictly translated overlapping reading frames, though broader definitions including antisense overlaps can reach up to 10% of genes in human and mouse genomes. These overlaps frequently occur within untranslated regions (UTRs), introns, or through alternative splicing, allowing for dual or multiple protein production from a single locus without extensive coding sequence overlap. A prominent example is the human CDKN2A locus, which encodes the tumor suppressors p16^INK4a and p14^ARF through alternative promoters, splicing, and partially overlapping reading frames, enabling distinct cell cycle regulation functions. This configuration is conserved across mammals and exemplifies how eukaryotic overlaps integrate with splicing machinery to enhance regulatory complexity in larger genomes. Additional examples include nested arrangements, such as the ribosomal protein RPL36A, which harbors an alternative overlapping (altORF) termed Alt-RPL36 within its coding sequence; this altORF produces a short that modulates the PI3K-AKT-mTOR signaling pathway, influencing cellular responses to stress and nutrient availability. In unicellular eukaryotes like (), altORFs are also prevalent, with revealing dozens of functional upstream or overlapping ORFs in genes involved in stress responses, such as those within the HSP82 chaperone transcript, contributing to diversity without disrupting the primary protein. These cases highlight how eukaryotic overlaps often rely on non-AUG initiation codons or internal ribosome entry to enable translation of secondary products. Recent advances in proteogenomics have uncovered previously overlooked eukaryotic overlapping genes, exemplified by the 2020 discovery of POLGARF, a 260-amino-acid protein encoded by a conserved CUG-initiated overlapping ORF within the human POLG mRNA, which encodes the mitochondrial DNA polymerase gamma; this overlap, detected via and , suggests roles in extracellular signaling and evolved around 160 million years ago in placental mammals. Similar proteogenomic approaches in and beyond, including CRISPR-based functional screens, have identified hundreds of altORFs and overlapping genes across eukaryotic species. In plants, emerging studies using large-scale transcriptomic and analyses have begun to reveal potential overlapping transcripts in species like , particularly cis-natural antisense pairs regulated by small RNAs, though comprehensive functional validation remains ongoing. Despite these insights, the functions of most eukaryotic overlapping genes remain understudied, with many altORFs and secondary products lacking clear phenotypic roles beyond preliminary associations with stress adaptation or signaling. Overlaps like those in and RPL36A are increasingly linked to diseases, including various cancers, where disruptions in dual-gene expression contribute to tumorigenesis through loss of tumor suppression or pathway dysregulation. This scarcity of knowledge underscores the need for integrated multi-omics approaches to elucidate their contributions to eukaryotic biology and pathology.

Biological Functions

Genome Compaction and Efficiency

Overlapping genes enable organisms to encode multiple proteins from shared sequences, thereby maximizing coding capacity within constrained genomic space. This mechanism is particularly vital for viruses and other genome size-limited entities, where every must contribute to essential functions. For instance, φX174 utilizes extensive overlaps across all three reading frames to produce 11 proteins from its compact 5,386-base-pair , demonstrating how overlaps expand protein output without proportional genome enlargement. In terms of , overlapping genes significantly boost coding density. Engineered minimal further illustrate this benefit; the synthetic bacterium JCVI-syn3.0, with its 531-kilobase-pair encoding 473 genes, relies on streamlined to achieve viability while minimizing , highlighting overlaps' in balancing and functionality. The biogenesis of these genes frequently involves polycistronic transcripts in prokaryotes and viruses, where a single mRNA carries multiple open reading frames, facilitating coordinated translation and further enhancing . Translational coupling in such transcripts ensures that the expression of downstream genes is linked to upstream ones, reducing regulatory overhead and promoting rapid in resource-limited environments.

Regulatory and Protective Roles

Overlapping genes play crucial roles in regulating through mechanisms such as translational , where the translation of an upstream (ORF) influences the efficiency of downstream gene , often repressing it to fine-tune protein levels. In , natural antisense RNAs derived from overlapping genes act as regulatory elements by base-pairing with target mRNAs, leading to degradation or translational inhibition that modulates stress responses and metabolic pathways. Eukaryotic overlapping genes can employ alternative start sites within shared sequences to generate diverse protein isoforms, enhancing functional versatility without expanding . Antisense-mediated silencing is another key regulatory function, where transcripts from overlapping genes form double-stranded RNA hybrids that recruit chromatin-modifying complexes or RNA interference machinery to suppress sense gene expression. For instance, in head-to-head overlapping configurations, RNA-DNA interactions from one transcript can block promoter access for the partner gene, providing a layer of transcriptional control observed in mammalian genomes. Non-coding antisense transcripts overlapping protein-coding genes also serve as sponges for microRNAs, indirectly stabilizing target mRNAs and creating feedback loops that sustain during development. In protective roles, overlapping buffer against loss-of-function by creating dual-essential configurations, where disruption of one impairs the overlapping partner, thereby purging deleterious variants and maintaining genomic integrity in high-mutation environments like viruses. In begomoviruses, such overlaps in essential like AC1 reduce mutation accumulation rates, providing evolutionary robustness against error-prone replication. Viral examples include potyviruses, where overlapping essential such as P3 and the polyprotein-encoded PIPO ensure that lethal to one frame are constrained by pleiotropic effects on the other, preserving . Bacterial overlapping genes contribute to anti-phage defense through toxin-antitoxin systems encoded as genes-within-genes, where the ORF overlaps the , enabling rapid activation upon to abort phage replication via host cell toxicity. This nested arrangement allows precise spatiotemporal control, as the neutralizes the under normal conditions but permits dominance during phage invasion, enhancing survival. A prominent example of regulatory overlap is found in HIV-1, where the tat and genes overlap in multiple reading frames, coordinating viral and ; mutations in the shared region disrupt Rev-mediated export of unspliced RNAs, promoting transcriptional silencing and persistence. Recent studies in 2025 have leveraged to design synthetic overlapping genes for precise regulatory control, demonstrating that deep generative models can create functional overlaps in bacterial genomes that couple translation of multiple outputs, offering tunable synthetic circuits for . These AI-engineered overlaps mimic natural regulatory logic while bypassing evolutionary constraints, enabling applications in .

Identification and Analysis Methods

Computational Methods

Computational methods for identifying and annotating overlapping genes rely on algorithmic approaches that scan genomic sequences for open reading (ORFs) in multiple , quantify overlaps, and assess functional potential through sequence conservation or bias analysis. These tools process sequences to predict candidate overlapping genes (OLGs), often integrating alignment-based searches and statistical models to distinguish true overlaps from artifacts. Key software includes OLGenie, which estimates purifying selection coefficients (dN/dS ratios) in potential OLGs using multiple sequence alignments to predict functional overlaps with low false-positive rates. OpenProt employs ORF prediction algorithms like to identify alternative ORFs (altORFs), including those overlapping genes, by scanning all reading and incorporating proteogenomic evidence for eukaryotic genomes. Additionally, GETORF, part of the suite, facilitates frame-specific ORF extraction across six reading , enabling initial detection of potential overlaps by identifying start and boundaries in non- frames. Algorithms underpinning these tools often utilize hidden Markov models (HMMs) for probabilistic ORF scanning, where states represent coding or non-coding regions, and transitions model frame shifts to tolerate overlaps in compact genomes like viruses. More recent advances incorporate . These methods extend traditional gene finders like Glimmer, which use HMMs trained on confirmed OLGs to improve sensitivity for overlapping predictions. A typical begins with input of assembled genomic sequences, followed by six-frame and of coding sequences () using tools like for detection or for profile-based searches to quantify overlap extent and conservation. Overlap regions are then scored for functionality, such as through selection analysis in OLGenie or bias detection in OpenProt, yielding annotated OLG candidates. However, these approaches can produce false positives in noisy or divergent data, such as low-coverage assemblies or highly variable viral sequences; integrating models mitigates this by filtering overlaps lacking organism-specific codon preferences. Experimental validation remains essential to confirm predicted OLGs.

Experimental Methods

Proteogenomics integrates mass spectrometry-based with genomic and transcriptomic data to detect peptides derived from alternative open reading frames (altORFs), including those overlapping annotated genes, thereby validating their in cells. In the , large-scale studies applied (MS/MS) to proteome samples, identifying novel peptides from unannotated regions that expanded the known by thousands of entries. For instance, a 2018 proteogenomics analysis of cell lines and tissues used MS/MS to discover coding regions in previously non-coding sequences, including overlapping altORFs, by matching spectra against custom databases derived from data. This approach has been pivotal in confirming the expression of small overlapping proteins in eukaryotes, with one 2024 study screening over 50,000 MS runs to identify more than 170,000 novel peptides, many from altORFs overlapping genes in tissues. Ribosome profiling, or Ribo-seq, maps ribosome-protected mRNA fragments to reveal actively translated regions, including overlapping frames that computational methods might overlook, providing direct evidence of altORF . This technique has been instrumental in genomes, where overlapping s are common; for example, Ribo-seq of SARS-CoV-2-infected cells in 2020 detected of and non-canonical ORFs, including the overlapping ORF3d within the N , with footprints accumulating at specific start sites. In eukaryotes, Ribo-seq variants have identified of upstream overlapping ORFs. These studies typically involve treating cells with inhibitors like , isolating ribosome-protected fragments, and sequencing them to quantify efficiency across frames. Functional assays confirm the biological roles of overlapping genes by perturbing one frame and observing impacts on the other, often using CRISPR-based knockouts or reporter gene fusions. CRISPR-Cas9 knockouts target specific altORFs to assess phenotypic effects, such as in a 2020 human genome-wide screen that disrupted overlapping genes and revealed dependencies in cancer cells, where loss of an altORF altered canonical protein function and cell viability. Reporter gene fusions, such as fusing overlapping ORF promoters or coding sequences to luciferase or GFP, quantify expression regulation; in bacterial systems, these have demonstrated how stop-start overlaps control translation efficiency, while in viruses like turnip yellow mosaic virus, dual-reporter constructs showed coordinated expression of nested overlapping ORFs. These assays highlight interdependencies, as mutating one overlapping gene often modulates the other's stability or activity. Recent advances in single-cell Ribo-seq, emerging around 2023, enable detection of altORF translation in individual eukaryotic , addressing heterogeneity in overlapping gene expression that bulk methods miss. Techniques like nanoRibo-seq, optimized for low-input samples, have revealed cell-type-specific translation of upstream altORFs in neuronal genes, with pausing patterns indicating regulatory roles. A 2021 Nature Reviews Genetics article on overlapping genes emphasized these methods' potential, citing examples from and viral systems where single-cell resolution uncovered dynamic altORF activity during stress or infection, paving the way for tissue-specific functional studies.

References

  1. [1]
    Overlapping genes in natural and engineered genomes
    Oct 5, 2021 · In this Review, we define a gene overlap in eukaryotes when at least one nucleotide is shared between the outermost boundaries of the primary ...
  2. [2]
    Overlapping Gene - an overview | ScienceDirect Topics
    Overlapping genes refer to segments of DNA that code for more than one gene product by utilizing different reading frames or initiation codons, and can also ...
  3. [3]
    Overlapping genes and the proteins they encode differ significantly ...
    Oct 19, 2018 · Overlapping genes, also called “dual-coding genes”, are regions of DNA or RNA that are translated in two different reading frames to yield two ...Missing: definition | Show results with:definition
  4. [4]
    Overlapping protein-coding genes in human genome and their ...
    Sep 16, 2019 · Overlapping genes are defined as chromosomal locations of two adjacent gene ... Molecular biology and evolution 32, 1748–1766, https://doi ...
  5. [5]
    Overlapping genes: a window on gene evolvability - PubMed Central
    Aug 27, 2014 · Overlapping genes (genes partially or entirely overlapping) represent a genomic feature that is shared widely across biological organisms ...
  6. [6]
    Same-strand overlapping genes in bacteria - Biology Direct
    Aug 21, 2008 · Overlapping-gene pairs, in which the overlap sequence is of length one to five bases (short overlaps), are abundant in phase 2, but rare in ...Reviewers' Comments · Reviewer's Report 2 · Reviewer's Report 3
  7. [7]
    Nucleotide sequence of bacteriophage φX174 DNA - Nature
    Feb 24, 1977 · A DNA sequence for the genome of bacteriophage φX174 of approximately 5,375 nucleotides has been determined using the rapid and simple 'plus ...Missing: first discovery
  8. [8]
    Frederick Sanger (1918–2013) - Cell Press
    In 1977, he devised an ingenious DNA sequencing method that has revolutionized ... ΦX174. In this study, the presence of overlapping genes was discovered.
  9. [9]
    Overlapping genes in natural and engineered genomes - PMC - NIH
    Oct 5, 2021 · The remaining two topologies occur between genes on opposite strands and are called convergent (→←) and divergent (←→) (Fig. 1b). ...
  10. [10]
    Overlapping genes in the human and mouse genomes
    Apr 14, 2008 · Based on the relative orientations of the genes involved in overlap, overlapping genes can be further classified into divergent (←→), convergent ...<|control11|><|separator|>
  11. [11]
    Classes of overlapping genes. OGC classification was based on the...
    Convergent overlaps involve the 3' termini of both genes, while divergent overlaps involve the 5' ends (UTR and/or CDS). Complete overlap occurs when the entire ...
  12. [12]
    The dark proteome: translation from noncanonical open reading ...
    Overlapping ORFs can encode extended or truncated variants of canonical proteins. In-frame overlapping ORFs, which present either as truncations, extensions, ...Missing: altORFs miRNAs
  13. [13]
    Exon and intron sharing in opposite direction-an undocumented ...
    Oct 5, 2021 · Overlapping genes share same genomic regions in parallel (sense) or anti-parallel (anti-sense) orientations. These gene pairs seem to occur ...Missing: altORFs | Show results with:altORFs
  14. [14]
    Origin, Evolution and Stability of Overlapping Genes in Viruses - MDPI
    Overlapping genes represent an unusual pattern of the genetic language [75,76], as two, or exceptionally three, reading frames may lie inside a single ...<|control11|><|separator|>
  15. [15]
    Using networks to analyze and visualize the distribution of ...
    Our results show that while the number of OvRFs increases with genome length, they tend to be shorter in longer genomes. The majority of overlaps involve +2 ...
  16. [16]
    Evidence for a novel overlapping coding sequence in POLG initiated ...
    Mar 6, 2020 · We provide evidence for a novel coding sequence, ORF-Y, that overlaps the POLG ORF. Ribosome profiling and mass spectrometry data show that ORF-Y is expressed.Missing: POLGARF | Show results with:POLGARF
  17. [17]
    Origin, Evolution and Stability of Overlapping Genes in Viruses
    Overprinting is a process in which critical nucleotide substitutions in a pre-existing gene can induce the expression of a novel protein by translation of an ...
  18. [18]
    Overlapping genes: a window on gene evolvability - BMC Genomics
    Aug 27, 2014 · Overlapping genes (genes partially or entirely overlapping) represent a genomic feature that is shared widely across biological organisms ...
  19. [19]
    Why genes overlap in viruses | Proceedings of the Royal Society B
    Jul 7, 2010 · The genomes of most virus species have overlapping genes—two or more proteins coded for by the same nucleotide sequence.Missing: primary | Show results with:primary<|control11|><|separator|>
  20. [20]
    Searching for frameshift evolutionary relationships between protein ...
    Nov 1, 1999 · The frameshift conditional probability matrix, Pfs, accounts for the conversion of a particular codon for amino acid i into amino acid j in a ...
  21. [21]
  22. [22]
    [PDF] Purifying and directional selection in overlapping prokaryotic genes
    TRENDS in Genetics Vol.18 No.5 May 2002 http ... Purifying and directional selection in overlapping prokaryotic genes. Igor B. Rogozin, Alexey N. ... content of ...
  23. [23]
    Properties of overlapping genes are conserved across microbial ...
    Here we show that overlapping genes are a consistent feature (approximately one-third of all genes) across all microbial genomes sequenced to date.
  24. [24]
    Initiation of Decay of Bacillus subtilis rpsO mRNA by ...
    rpsO mRNA, a small monocistronic mRNA that encodes ribosomal protein S15, was used to study aspects of mRNA decay initiation in Bacillus subtilis.Missing: rpsN | Show results with:rpsN
  25. [25]
    Evolution of Overlapping Genes: Comparative Genomics ... - J-Stage
    The genome of M. pneumoniae, on the other hand, contains 160 overlapping gene pairs. Many parts of genome sequences of these two species are homologous. There ...
  26. [26]
    A Novel pH-Regulated, Unusual 603 bp Overlapping Protein Coding ...
    Mar 19, 2020 · Two genes encoded by two different reading frames (ORFs) at the same DNA locus are termed “non-trivially overlapping genes” (OLGs) if the area ...
  27. [27]
    Overlapping genes in the human and mouse genomes - PMC
    About 10% of the genes under study are overlapping genes, the majority of which are different-strand overlaps. The majority of the same-strand overlaps are ...
  28. [28]
  29. [29]
  30. [30]
  31. [31]
  32. [32]
  33. [33]
    A high-resolution map of bacteriophage ϕX174 transcription
    Sequencing solved this mystery by showing that 8 of the 11 genes in ϕX174 overlap, thereby increasing the available genetic encoding capacity beyond the 5386 nt ...
  34. [34]
    Design and synthesis of a minimal bacterial genome
    ### Summary of Overlapping Genes and Related Metrics in JCVI-syn3.0
  35. [35]
    Synthetic translational coupling element for multiplexed signal ...
    Nov 11, 2024 · Here, we introduce a modular synthetic translational coupling element (synTCE) and integrate this design with de novo designed riboregulators, toehold switches.
  36. [36]
    Natural antisense RNAs as mRNA regulatory elements in bacteria
    Jul 28, 2016 · This review focuses on known cases of antisense RNA control in prokaryotes and provides an overview of some natural RNA-based mechanisms that bacteria use to ...
  37. [37]
    Genome-wide natural antisense transcription: coupling its regulation ...
    The regulatory mechanisms by which NATs act are diverse, as are the means to control their expression. Here, we review the current understanding of NAT function ...
  38. [38]
    Mechanism of expression regulation of head-to-head overlapping ...
    Mar 8, 2025 · The transcripts of overlapping genes can regulate transcription at the level of RNA–DNA and RNA–RNA interactions. Examples of the first one are ...
  39. [39]
    miRNA‐dependent gene silencing involving Ago2‐mediated ...
    Sep 30, 2011 · This study provides the first evidence for non‐coding antisense transcripts as functional miRNA targets, and a novel regulatory mechanism ...<|separator|>
  40. [40]
    Gene Overlapping as a Modulator of Begomovirus Evolution - NIH
    Feb 4, 2022 · ... buffering effects of gene overlapping on the accumulation of mutations on AC1. ... Mutation and RNA viruses. Trends Ecol. Evol. 2008;23:188 ...Missing: protective dual-
  41. [41]
    An overlapping essential gene in the Potyviridae - PNAS
    Mutations that knock out expression of the PIPO protein in Turnip mosaic potyvirus but leave the polyprotein amino acid sequence unaltered are lethal to the ...Missing: protective buffering
  42. [42]
    Toxic antiphage defense proteins inhibited by intragenic antitoxin ...
    Here, we report that these proteins are toxin–antitoxin systems, comprised of genes-within-genes, that combat phage infection.<|separator|>
  43. [43]
    Functional Segregation of Overlapping Genes in HIV - ScienceDirect
    Dec 15, 2016 · For instance, high polymerase error rates favor short genomes thereby decreasing the probability of catastrophic mutations, while the viral ...Missing: protective roles
  44. [44]
    Mir125b-2 imprinted in human but not mouse brain regulates ...
    Mar 14, 2023 · Our findings demonstrate MIR125B2 imprinted in human but not mouse brain, mediated learning, memory, and anxiety, regulated excitability and synaptic ...
  45. [45]
    Design of overlapping genes using deep generative models of ...
    May 7, 2025 · The pairwise product of the two amino acid probability vectors at the masked overlapping positions gives a joint probability matrix, which is ...Missing: viable | Show results with:viable
  46. [46]
    Overlapping Genes, Unfolding Insights: Synthetic Biology Meets AI
    May 22, 2025 · Synthetic overlapping genes can encode distinct, functional protein folds, revealing new possibilities for compact genetic circuit design.
  47. [47]
    OLGenie: Estimating Natural Selection to Predict Functional ...
    Apr 3, 2020 · OLGenie can be used to study known OLGs and to predict new OLGs in genome annotation. Software and example data are freely available.
  48. [48]
    OpenProt 2021: deeper functional annotation of the coding potential ...
    Nov 12, 2020 · Using such algorithms, OpenProt aims to insert ORFs shorter than 30 codons while avoiding spurious annotations. Furthermore, new tools and ...
  49. [49]
    Discovery of coding regions in the human genome by integrated ...
    Mar 2, 2018 · Proteogenomics enable the discovery of novel peptides (from unannotated genomic protein-coding loci) and single amino acid variant peptides ...
  50. [50]
  51. [51]
    A Massive Proteogenomic Screen Identifies Thousands of Novel ...
    Using a proteogenomic approach, we screened 50,000 mass spectrometry runs to identify novel proteins translated from five million transcripts in the GTEx ...Missing: altORFs | Show results with:altORFs
  52. [52]
    The coding capacity of SARS-CoV-2 - Nature
    Sep 9, 2020 · Two overlapping ORFs, ORF1a and ORF1b, are translated from the positive-strand genomic RNA and generate continuous polypeptides, which are ...
  53. [53]
    Dynamically evolving novel overlapping gene as a factor in ... - eLife
    Oct 1, 2020 · A novel, overlapping, putatively functional gene in SARS-CoV-2, ORF3d, is absent from close relatives of SARS-CoV-2 and may have contributed ...
  54. [54]
  55. [55]
  56. [56]
    Expression of the Two Nested Overlapping Reading Frames of ... - NIH
    In paired reporter constructs, the 5′ 217 nucleotides harboring the UTR and the first 43 or 41 codons of the two overlapping TYMV open reading frames (ORFs), ...
  57. [57]
    Development of nanoRibo-seq enables study of regulated&nbsp
    Aug 24, 2023 · nanoRibo-seq shows extensive translation of upstream open reading frames in key synaptic and axonal genes. Froberg et al., 2023, Cell Reports 42 ...Missing: altORFs | Show results with:altORFs