Fact-checked by Grok 2 weeks ago

Reading frame

In molecular biology, a reading frame is one of the three possible ways to partition a linear nucleotide sequence from DNA or messenger RNA (mRNA) into consecutive, non-overlapping triplets known as codons, where each codon specifies an amino acid or a stop signal during protein translation. These frames arise because the sequence can begin grouping at the first, second, or third nucleotide position, potentially yielding entirely different amino acid sequences from the same nucleotides; for double-stranded DNA, there are six possible reading frames (three per strand). The biologically relevant frame is typically established by the start codon AUG, which initiates translation and sets the phase for subsequent codons until a stop codon (UAA, UAG, or UGA in mRNA) is reached. A continuous stretch of codons without an intervening within a defined reading frame constitutes an open reading frame (ORF), which represents a potential protein-coding region in a . In prokaryotes, often begins at the first AUG in the correct frame, while in eukaryotes, additional regulatory elements like the 5' cap and Kozak sequence help ensure accurate frame selection. Reading frames are crucial for in bioinformatics, where computational tools scan genomic sequences for long ORFs as indicators of functional genes. Disruptions to the reading frame, such as insertions or deletions of not in multiples of three, cause frameshift mutations that shift the codon grouping downstream, often introducing premature stop codons and producing truncated or aberrant proteins. These mutations are a common cause of genetic disorders and underscore the precision required in translational fidelity to maintain protein function.

Definition and Basics

Core Concept

A reading frame is defined as a method of dividing the sequence of in DNA or mRNA into consecutive, non-overlapping triplets known as codons, with three possible frames arising from different starting positions in the sequence. This partitioning ensures that the genetic information is read in groups of three , reflecting the triplet structure of the . The existence of three reading frames stems from the triplet nature of the code: translation can begin at the first nucleotide (frame 1), the second (frame 2), or the third (frame 3), with each shift altering the grouping of subsequent nucleotides. For instance, in the short DNA sequence ATGCGTACG:
  • Frame 1 reads as ATG-CGT-ACG
  • Frame 2 reads as TGC-GTA-CG
  • Frame 3 reads as GCG-TAC-G
These frames link to amino acids via the genetic code during translation, but only one is typically used in a given mRNA. The concept of the reading frame emerged in the early 1960s amid efforts to decipher the , particularly through experiments on frameshift mutations by , , and colleagues. Their 1961 work demonstrated how insertions or deletions could shift the frame, providing key evidence for the triplet code.

Role in Translation

During translation, ribosomes process mature mRNA transcripts in the by reading the sequence from the 5' to 3' direction in groups of three , known as codons, within a specific reading frame to synthesize proteins according to the . This process initiates when the small ribosomal subunit, along with initiation factors, binds to the mRNA: in eukaryotes, near the 5' cap and scans to the AUG; in prokaryotes, directly via base-pairing of the Shine-Dalgarno sequence with the 16S rRNA. The AUG sets the reading frame by pairing with a special initiator tRNA carrying (in prokaryotes) or (in eukaryotes). The large ribosomal subunit then joins to form the complete , positioning the initiator tRNA in the and beginning elongation, where subsequent codons dictate the addition of to the growing polypeptide chain. Maintaining the correct reading frame is essential for preserving the protein's primary sequence, as any misalignment disrupts the codon groupings and leads to a garbled product that is typically nonfunctional or prematurely terminated. For instance, in a hypothetical gene's mRNA segment AUG-UUA-GAA-UUC (translating to Met-Leu-Glu-Phe in the correct frame, forming a functional protein ), a +1 shift would regroup it as UGU-UAG-AAU-UC, yielding Cys followed by a , leading to a truncated protein that abolishes the protein's . Such frame ensures that the encoded protein adopts its proper three-dimensional structure and performs its cellular role without errors accumulating from the initiation point onward. At the cellular level, the and tRNAs enforce reading frame maintenance through precise molecular interactions during elongation. The 's three tRNA-binding sites—A (aminoacyl), P (peptidyl), and E (exit)—accommodate tRNAs whose anticodons base-pair strictly with consecutive mRNA codons, with translocation catalyzed by EF-G (in prokaryotes) or (in eukaryotes) advancing the mRNA by exactly three nucleotides per cycle to prevent slippage. This coordinated mechanism, supported by the 's peptidyl transferase center, links sequentially while upholding the frame established at , thereby safeguarding translational accuracy across the entire .

The Genetic Code

Codon Structure

In , a codon is defined as a sequence of exactly three that serves as the fundamental unit of genetic information during protein synthesis. The triplet nature of the was experimentally established through frameshift experiments in T4, which showed that insertions or deletions of in multiples of three restore , confirming the code's triplet, non-overlapping . Studies using synthetic polynucleotides in cell-free systems helped assign specific codons to . Within a given reading frame, codons are read sequentially from the (mRNA), starting from a defined point, ensuring that the genetic message is translated without gaps or ambiguities in alignment. The triplet nature of the codon is nearly across all known organisms, from to eukaryotes. When the was first elucidated in the , it was found to be identical in phylogenetically diverse lineages such as metazoa, fungi, and . This universality underscores the shared evolutionary origin of the genetic machinery, with minor exceptions in certain organelles and microbes representing derived variations rather than fundamental differences. A key feature of codon structure is its non-overlapping arrangement, where each codon is read independently, with adjacent codons sharing no and abutting directly to form a continuous chain along the mRNA. This was demonstrated through frameshift experiments in bacteriophage T4, which showed that insertions or deletions of in multiples of three restore function, confirming the code's triplet, non-overlapping organization. For instance, consider a hypothetical 9- mRNA sequence: AUGCCGGUA. In the first reading frame, it divides into three codons: , CCG, and , each comprising distinct triplets without overlap. The reading frame thus acts as the organizational framework that partitions the sequence into these discrete codon units for accurate .

Frame Dependency

The choice of reading frame fundamentally determines how a nucleotide sequence is interpreted during , as it dictates the grouping of into codons. For a given (mRNA) sequence, shifting the starting position by one or two alters the triplet boundaries, resulting in completely different codon sets and, consequently, distinct amino acid sequences in the resulting polypeptide. This frame-specific decoding ensures that only the correct frame, typically established by the , produces the functional protein, while alternative frames often yield non-functional or aberrant products. The standard genetic code, which is nearly universal across organisms, comprises 64 possible triplets (codons) derived from the four nucleotides (A, U, G, C), with 61 codons specifying one of the 20 amino acids and the remaining three (UAA, UAG, UGA) acting as stop signals that terminate translation. A change in reading frame can drastically alter codon identities; for example, the sequence AUGA is read as AUG (encoding methionine) in frame 1, but as UGA (a stop codon) in frame 2, potentially halting protein synthesis prematurely. Such shifts demonstrate how even a single nucleotide offset can redirect the decoding process, emphasizing the precision required in ribosomal reading. The degeneracy of the genetic code—where multiple codons (often differing in the third position) encode the same amino acid—can mitigate some effects of frame shifts by allowing recoding within synonymous groups, thereby partially preserving protein function in certain contexts. However, shifts more frequently produce non-synonymous codons, leading to amino acid substitutions or premature stops that disrupt protein structure and activity. This interplay between degeneracy and frame sensitivity contributes to the code's overall robustness against translational errors, as evidenced by evolutionary optimizations that minimize deleterious outcomes from such shifts. To illustrate, consider the mRNA sequence GGGAAACCC. In reading frame 1, it is parsed as GGG-AAA-CCC, translating to glycine-lysine-proline. In frame 2, beginning at the second , it becomes GGA-AAC-CCC, yielding glycine-asparagine-proline instead. These variations underscore how frame dependency can transform a seemingly identical sequence into proteins with different functional properties.

Reading Frames in Sequences

Three Forward Frames

In , the three forward reading frames refer to the three possible ways to partition a DNA or sequence on the sense ( into consecutive triplets, or codons, read in the 5' to 3' direction, beginning at position 1 (frame +1), position 2 (frame +2), or position 3 (frame +3). These frames are essential for identifying potential start points in genomic sequences, as the correct frame determines the sequence of any encoded protein. To analyze these frames, one can manually group into sets of three starting from each offset position, translating each codon using the standard to reveal potential protein sequences. Computationally, tools such as NCBI's ORF Finder scan the sequence across all three forward frames, translating from start codons (typically ATG) until a is encountered, to predict protein-coding regions without prior . This approach is widely used in genome assembly and pipelines to detect coding potential in uncharacterized DNA. The three forward frames overlap extensively, with adjacent frames sharing up to two per codon boundary; for instance, the second and third of a codon in frame +1 align with the first and second of the corresponding codon in frame +2. In a long of length L, each frame accommodates approximately L/3 complete codons, covering nearly the entire modulo 3, though the exact number varies slightly by starting position and length. Consider a hypothetical 12-nucleotide forward DNA sequence: 5'-ATGCGTACGCTA-3'. This can be parsed as follows: Such parsing highlights how shifts in frame yield entirely different codon sequences from the same nucleotides, underscoring the frame's role in translation specificity.

Reverse Complement Frames

The reverse complement of a DNA sequence is formed by first reversing the order of its nucleotides and then substituting each base with its complementary partner: adenine (A) pairs with thymine (T), and guanine (G) pairs with cytosine (C). This process yields the sequence of the antiparallel strand read in the conventional 5' to 3' direction, which is essential for analyzing the antisense strand in double-stranded DNA. Like the forward strand, the reverse complement is partitioned into three reading frames, each offset by one nucleotide, allowing translation to begin at different starting positions. These frames, often labeled as -1, -2, and -3, enable the identification of potential protein-coding regions on the opposite strand. In prokaryotic genomes, the reverse complement frames are particularly relevant due to the prevalence of overlapping genes, where the same DNA segment encodes multiple proteins by utilizing different reading frames on complementary strands. Such overlaps contribute to genomic compactness, with studies showing that approximately one-third of genes in microbial genomes overlap, often involving frames from both strands. This arrangement is less common in eukaryotes but underscores the need to examine all six reading frames (three forward and three reverse) for comprehensive gene annotation in bacteria. Genes in both prokaryotes and eukaryotes can be transcribed from either the or antisense strand, depending on the orientation of regulatory elements like promoters, which dictates the template strand used by . Consequently, analyzing reading frames on the reverse complement is necessary to capture genes oriented in the opposite direction, ensuring no potential is overlooked during interrogation. Bioinformatics tools facilitate the generation and analysis of reverse complement frames; for example, the NCBI algorithm computes the reverse complement internally and translates query sequences in all six frames for database searches, aiding in the detection of homologous proteins. The reverse frames are notated as -1 (starting from the first of the reverse complement), -2 (second nucleotide), and -3 (third nucleotide), providing a standardized way to distinguish them from the forward frames (+1, +2, +3). Dedicated reverse complement converters, such as those integrated into platforms, automate this process for manual verification. To illustrate, consider the forward DNA sequence 5'-ATGCCGTAGCTA-3' (12 nucleotides). Its reverse complement is calculated by complementing each base (A↔T, G↔C) and reversing the order, yielding 5'-TAGCTACGG CAT-3' (noting the 12th base as T for completeness in pairing). The three reverse complement frames are then:
  • Frame -1: Starting at position 1:
    TAG CTA CGG CAT
    Codons: TAG (), CTA (), CGG (), CAT ().
  • Frame -2: Starting at position 2:
    AGC TAC GGC AT-
    Codons: AGC (serine), TAC (), GGC (); incomplete at end.
  • Frame -3: Starting at position 3:
    GCT ACG GCA T--
    Codons: GCT (), ACG (), GCA (); incomplete.
This example highlights how offsets in the reverse complement can yield distinct codon sequences, potentially revealing alternative coding regions when compared to the forward frames.

Open Reading Frames

Identification Criteria

An open reading frame (ORF) is defined as a continuous sequence of nucleotide triplets, known as codons, that begins with an initiator codon—typically AUG in messenger RNA (mRNA), corresponding to ATG in DNA—and terminates at one of the three stop codons (UAA, UAG, or UGA), with no intervening stop codons within the same reading frame. This definition ensures the sequence can potentially be translated into a polypeptide without premature interruption. To distinguish biologically relevant ORFs from those arising by chance, length thresholds are commonly applied during identification. In eukaryotic genomes, ORFs exceeding 100 codons are typically deemed significant, as shorter sequences are more likely to occur randomly. Statistical models further refine this by calculating the expected frequency of ORFs of various lengths in non-coding or random DNA, allowing researchers to set significance cutoffs based on probabilistic expectations. Computational algorithms facilitate ORF detection by systematically scanning sequences across multiple reading frames. The ORF Finder tool from the National Center for Biotechnology Information (NCBI) exemplifies this approach, analyzing both forward and reverse strands while incorporating parameters such as minimum ORF length, genetic code selection, and strand specificity to identify candidate frames. These tools treat the six possible reading frames of a double-stranded DNA molecule as search spaces for potential ORFs. A practical example illustrates ORF identification: consider the DNA sequence ATGAAATTTGCGTAA. In the forward reading frame starting at position 1, the codons are ATG (Met), AAA (Lys), TTT (Phe), GCG (), TAA (stop). The ORF spans 15 nucleotides from the to the end of the , comprising 5 triplets, but the is not translated, resulting in a 4-amino-acid (M-K-F-A). The encoded length is the number of codons from start to the one before stop (4 codons).

Functional Implications

Open reading frames (ORFs) serve as primary indicators of potential protein-coding genes during genome , particularly in prokaryotes where polycistronic operons allow multiple ORFs within a single reading frame to encode distinct functional proteins. In bacterial genomes, such as that of mycoides, bioinformatic identification of ORFs has enabled the annotation of hundreds of candidate genes, facilitating the design and synthesis of minimal genomes with verified essential functions. This approach contrasts with eukaryotic annotation, where introns complicate ORF prediction, but in prokaryotes, the absence of splicing allows direct mapping of ORFs to translated regions, often revealing overlapping genes that enhance coding density. Functional ORFs exhibit strong evolutionary due to selective pressure on their encoded proteins, maintaining integrity across related to preserve biochemical roles. In contrast, pseudogenes arise from duplicated or retrotransposed where disruptive mutations, such as premature stop codons, fragment ORFs, rendering them non-functional and subject to drift rather than . This distinction aids in distinguishing active genes from relics; for instance, reveals that intact ORFs in protein-coding loci show higher substitution rates only in synonymous positions, while ORFs accumulate mutations uniformly. In applications, ORFs play a crucial role in by enabling the discovery of novel genes from uncultured microbial communities, where de novo assembly identifies thousands of previously unknown protein families based on ORF clustering. For example, global surveys have uncovered over 100,000 novel metagenome protein families (NMPFs) derived from ORFs, expanding functional annotations in diverse environments like microbiomes.

Frameshift Mutations

Causes and Mechanisms

Frameshift mutations arise from the insertion or deletion of in , where the number of affected is not a multiple of three, thereby shifting the reading frame for all downstream codons. This alteration disrupts the triplet alignment essential for accurate , as the relies on non-overlapping codons read in a fixed frame from the . Common causes of frameshift mutations include slipped-strand mispairing during , particularly in regions with repetitive sequences such as microsatellites, where the nascent strand temporarily dissociates and realigns out of register, leading to additions or deletions. Errors in processes, such as imperfect mismatch repair following replication slippage, can also propagate these mutations by failing to correct the misalignment. Additionally, exposure to certain mutagens like acridines, which intercalate between DNA base pairs and distort the helix, promotes insertions or deletions during replication or repair synthesis. Frameshift mutations are classified by their net effect, such as +1 shifts from single insertions or -1 shifts from deletions, which change the reading frame from the mutation site onward. Multiple frameshifts, for instance two -1 shifts or a combination yielding a net multiple of three, can potentially restore the original frame but result in an altered amino acid sequence in the intervening region. At the molecular level, consider a single nucleotide deletion in the coding sequence; if the original sequence is ATG-CGT-TAC (encoding methionine-arginine-tyrosine), deletion of the first G yields ATC-GTT-AC, which is read as isoleucine-valine (with an incomplete codon), thus garbling the downstream protein sequence until a is encountered. This mechanism exemplifies how even a small propagates errors throughout the transcript due to the rigid triplet structure of the .

Effects on Protein Synthesis

Frameshift mutations disrupt the reading frame during , altering all downstream codons and typically leading to the synthesis of a protein with an entirely different sequence from the mutation site onward. This shift often introduces a premature shortly after the , resulting in a truncated polypeptide that is usually non-functional or degraded by cellular mechanisms such as . In humans, frameshift mutations are associated with severe phenotypic outcomes, including various genetic diseases due to the production of aberrant proteins. For instance, in , frameshift mutations such as a one-nucleotide deletion or a two-nucleotide insertion in 7 of the CFTR gene lead to non-functional chloride channels, contributing to the disease's respiratory and digestive symptoms. Similarly, frameshift mutations in other genes, like those causing Tay-Sachs disease, result in enzyme deficiencies that accumulate toxic substrates, leading to neurological deterioration. In , frameshift mutations can cause loss of antibiotic resistance by inactivating resistance genes; for example, frameshift mutations in genes encoding can inactivate the enzyme, rendering susceptible to antibiotics like . Cells have evolved rare suppression mechanisms to mitigate the effects of frameshift and restore the correct . Frameshift suppressor tRNAs, which possess altered anticodon loops to accommodate the shifted , can occasionally the during , allowing partial restoration of the protein sequence. Additionally, second-site elsewhere in the can compensate by re-establishing the original frame, though such events are infrequent and often context-specific. These suppression strategies highlight the robustness of translational fidelity but occur at low efficiency, typically insufficient to fully rescue protein function in most cases. A illustrative example of a -1 frameshift mutation involves a hypothetical gene segment where the wild-type sequence encodes a functional peptide. Consider the original mRNA sequence (with codons separated for clarity):
AUG CCC GGG UUU AAA UGA
Met Pro Gly Phe Lys Stop
This translates to a short peptide: Met-Pro-Gly-Phe-Lys. A -1 frameshift due to deletion of one nucleotide (e.g., the second C in the second codon) shifts the reading frame:
AUG CC GGU UUA AAU GA...
Met Pro Val Leu Asn ...
However, the altered frame often encounters a premature stop codon soon after, such as if the sequence becomes UUA AAU GAA (Leu-Asn-Glu), but in many cases like this, it truncates the protein at an early point, yielding only Met--Val and rendering it non-functional due to loss of essential domains. This demonstrates how even a single deletion can abolish protein activity by changing the sequence and introducing early termination.

References

  1. [1]
    Three reading frames - Educating with the Genome Browser
    20 different amino acid variations are derived from only four bases. Three different peptides can be formed from any sequence depending on the reading frame.Missing: definition | Show results with:definition
  2. [2]
    The Reading Frame, Codons and Anticodons
    The AUG start codon establishes the beginning of the reading frame on a mRNA. The ribosome must follow this reading frame to build the correct protein.
  3. [3]
    Open Reading Frame - National Human Genome Research Institute
    An open reading frame is a portion of a DNA molecule that, when translated into amino acids, contains no stop codons.
  4. [4]
    [PDF] The Genetic Code & Translation - Projects at Harvard
    ... sequence, the start codon sets the reading frame for downstream codons. The protein- coding sequence is therefore also known as an open reading frame.
  5. [5]
    What kinds of gene variants are possible?: MedlinePlus Genetics
    Nov 4, 2021 · A reading frame consists of groups of three nucleotides that each code for one amino acid . A frameshift variant occurs when there is an ...
  6. [6]
    Reading Frame - an overview | ScienceDirect Topics
    A reading frame is defined as a method of dividing nucleotide sequences (RNA or DNA) into consecutive, nonoverlapping triplets known as codons, ...
  7. [7]
    Understanding a Genome Sequence - NCBI - NIH
    The coding regions of genes are open reading frames ... Each strand has three reading frames, depending on which nucleotide is chosen as the starting position.
  8. [8]
    From RNA to Protein - Molecular Biology of the Cell - NCBI Bookshelf
    In principle, an RNA sequence can be translated in any one of three different reading frames, depending on where the decoding process begins (Figure 6-51).
  9. [9]
    Reading Frame - an overview | ScienceDirect Topics
    The reading frame rule states that the number of exons deleted is not the primary determinant of disease severity.
  10. [10]
    The dependence of cell-free protein synthesis in E. coli upon ... - PNAS
    The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides. Marshall W. Nirenberg and J. Heinrich ...
  11. [11]
    General Nature of the Genetic Code for Proteins
    General Nature of the Genetic Code for Proteins. F. H. C. CRICK,; LESLIE BARNETT,; S. BRENNER & … R. J. WATTS- ...
  12. [12]
    The origin of the genetic code - ScienceDirect.com
    The general features of the genetic code are described. It is considered that originally only a few amino acids were coded, but that most of the possible ...Missing: paper | Show results with:paper
  13. [13]
    How do Cells Read Genes? - Learn Genetics Utah
    Cells read genes by grouping DNA into codons (3-nucleotide bases), each specifying an amino acid, using the Universal Genetic Code. They read messenger RNA, ...Missing: definition | Show results with:definition
  14. [14]
    Genetic Codes - NCBI
    Table 11 is used for Bacteria, Archaea, prokaryotic viruses and chloroplast proteins. As in the standard code, initiation is most efficient at AUG. In addition, ...
  15. [15]
    On the Origin of Frameshift-Robustness of the Standard Genetic Code
    May 27, 2021 · For frameshift errors, we considered +1 (the reading frame is moved by one nucleotide toward the 3′) frameshifts and refer to the resultant MS ...Results · Frameshift-Robustness Is... · Random Genetic CodesMissing: interpretation | Show results with:interpretation<|control11|><|separator|>
  16. [16]
    BLAST QuickStart - Comparative Genomics - NCBI Bookshelf - NIH
    ” The translations are performed in the three forward as well as the three reverse reading frames so that no possible translation is missed. 5.1. Available ...
  17. [17]
    ORFfinder Home - NCBI - NIH
    ORF finder searches for open reading frames (ORFs) in the DNA sequence you enter. The program returns the range of each ORF, along with its protein translation.Missing: forward | Show results with:forward
  18. [18]
    An Integrative Method for Identifying the Over-Annotated Protein ...
    Sep 8, 2011 · Where, N{+0}, N{+1} and N{+2} denotes the total number of trinucleotides in the three forward reading frames, respectively. To differentiate ...
  19. [19]
    A Computer Scientist's Dictionary for Genomics
    The reverse complement of a DNA sequence is formed by reversing the letters, interchanging A and T and interchanging C and G. Thus the reverse complement of ...
  20. [20]
    Properties of overlapping genes are conserved across microbial ...
    Here we show that overlapping genes are a consistent feature (approximately one-third of all genes) across all microbial genomes sequenced to date.
  21. [21]
    Large gene overlaps in prokaryotic genomes: result of functional ...
    Jul 15, 2008 · Overlapping genes among prokaryotes represented around 17% (173,663 overlapping pairs) out of the total gene pairs contained in 338 microbial ...Missing: six | Show results with:six
  22. [22]
    From DNA to RNA - Molecular Biology of the Cell - NCBI Bookshelf
    Since DNA is double-stranded, two different RNA molecules could in principle be transcribed from any gene, using each of the two DNA strands as a template.
  23. [23]
    Reverse Complement Converter - QIAGEN
    What is a reverse complement in DNA? A reverse complement is a sequence of DNA that is complementary to the original sequence and read in the opposite direction ...<|control11|><|separator|>
  24. [24]
    Reverse Complement - Bioinformatics.org
    Reverse Complement converts a DNA sequence into its reverse, complement, or reverse-complement counterpart. You may want to work with the reverse-complement ...
  25. [25]
    Open Reading Frames - MeSH - NCBI - NIH
    An open reading frame is a sequence of nucleotide triplets read as codons specifying amino acids, starting with an initiator codon and ending with a stop codon.
  26. [26]
    The Definition of Open Reading Frame Revisited - PubMed
    The term open reading frame (ORF) is of central importance to gene finding. Surprisingly, at least three definitions are in use.
  27. [27]
    question of size: the eukaryotic proteome and the problems in ...
    ... minimum length cut-off point of 100 codons used in the original ORF ... ) Open-reading-frame sequence tags (OSTs) support the existence of at least ...
  28. [28]
    Definitive demonstration by synthesis of genome annotation ... - PNAS
    Using bioinformatic tools, we first identified 315 potential open reading frames (ORFs) within the genome, including the 11 established essential genes and 82 ...
  29. [29]
    Overlapping genes in natural and engineered genomes
    Oct 5, 2021 · In prokaryotic genomes, 84% of CDS overlaps are unidirectional (→→) and produced through start codon or stop codon loss, resulting in one member ...
  30. [30]
    Distinguishing protein-coding and noncoding genes in the human ...
    Putative protein-coding genes are identified based on computational analysis of genomic data—typically, by the presence of an open-reading frame (ORF) exceeding ...
  31. [31]
    The physical and evolutionary energy landscapes of ... - PNAS
    May 13, 2024 · Pseudogenes are DNA sequences that previously encoded protein sequences but are no longer translated due to sequence degradation.
  32. [32]
    Unraveling the functional dark matter through global metagenomics
    Oct 11, 2023 · In total, we identified 106,198 families with at least 100 members that will be referred to as novel metagenome protein families (NMPFs) (Table ...
  33. [33]
    Incomplete influenza A virus genomes occur frequently but ... - Nature
    Aug 6, 2019 · Segmentation of viral genomes into multiple RNAs creates the potential for replication of incomplete viral genomes (IVGs).
  34. [34]
    Functional analysis of gapped microbial genomes: Amino acid ...
    The 95 steps in the required predicted functions for T. ferrooxidans do not have assigned ORFs, which could encode the corresponding enzyme or structural gene.
  35. [35]
    Frameshift Mutation - National Human Genome Research Institute
    A frameshift mutation in a gene refers to the insertion or deletion of nucleotide bases in numbers that are not multiples of three.
  36. [36]
    Mutation, Repair and Recombination - Genomes - NCBI Bookshelf
    Insertions and deletions are often called frameshift mutations because when one occurs within a coding region it can result in a shift in the reading frame ...
  37. [37]
    Slipped-Strand Mispairing Can Function as a Phase Variation ... - NIH
    One of the mechanisms that can result in phase variation is slipped-strand mispairing (SSM), which is a process that produces mispairing between the mother and ...
  38. [38]
  39. [39]
    Mutagen-induced phytotoxicity in maize seed germination ... - Nature
    Sep 19, 2018 · At the molecular level, acridines have been shown to cause both addition- and deletion-type frameshift mutations. Acridine orange (AO) is a ...
  40. [40]
    Genetic Diversity: Frameshift Mechanisms Alter Coding of a Gene ...
    Frameshifting events, occurring in short runs of pyrimidines (mainly C residues) in the repeats, give rise to mutations which may provide a mechanism for escape ...
  41. [41]
  42. [42]
    Two frameshift mutations in the cystic fibrosis gene - PMC - NIH
    We have identified in exon 7 two frameshift mutations, one caused by a two-nucleotide insertion and the other caused by a one-nucleotide deletion.
  43. [43]
    Transient Silencing of Antibiotic Resistance by Mutation Represents ...
    Oct 30, 2019 · SARM resulted from diverse mutational events but most commonly through frameshift mutation of resistance determinants as a result of point ...
  44. [44]
    A New Model for Phenotypic Suppression of Frameshift Mutations by ...
    Ribosomes read messenger RNAs in sequential nonoverlapping triplet codons without the need for any punctuation to identify the reading frame (Crick et al. 1961) ...
  45. [45]
    Insights into genome recoding from the mechanism of a classic +1 ...
    Jan 12, 2021 · In bacteria, +1-frameshifting tRNAs that suppress single-nucleotide insertion mutations that shift the translational reading frame to the +1- ...
  46. [46]
    Frameshift suppressor mutations outside the anticodon in yeast ...
    Abstract. Extragenic suppressors of +1 frameshift mutations in proline codons map in genes encoding two major proline tRNA isoacceptors.<|control11|><|separator|>
  47. [47]
    Types of mutations - Understanding Evolution
    In frameshifts, a similar error occurs at the DNA level, causing the codons to be parsed incorrectly. This usually generates truncated proteins that are as ...