Fact-checked by Grok 2 weeks ago

LINE1

LINE-1 (L1), or Long Interspersed Nuclear Element-1, is an autonomous non-long terminal repeat (non-LTR) that comprises approximately 17% of the and represents the only active class of such elements capable of mobilizing in the . Full-length LINE-1 elements are roughly 6 kilobases in length and two proteins essential for their replication: ORF1p, a that chaperones the LINE-1 , and ORF2p, a multifunctional with endonuclease and domains that facilitate genomic insertion. Through a "copy-and-paste" mechanism involving transcription into , ribonucleoprotein complex formation, and target-primed reverse transcription, LINE-1 elements can insert new copies into the genome, thereby driving and but also risking deleterious mutations. The contains around 500,000 LINE-1 copies, but the vast majority are 5' truncated, rearranged, or otherwise inactivated by mutations, leaving only approximately 280 full-length, retrotransposition-competent elements (as of 2025). These active LINE-1s not only propagate themselves but also mobilize non-autonomous elements like Alu and SVA retrotransposons, amplifying their collective impact on ~40% of the genome. LINE-1 expression is tightly regulated in cells by epigenetic silencing, including at CpG islands in the promoter and modifications, as well as post-transcriptional mechanisms involving piwi-interacting RNAs (piRNAs), , and host antiviral factors like APOBEC3 proteins. Deregulation of LINE-1 activity, often through global hypomethylation, is a hallmark of many cancers, where increased retrotransposition contributes to genomic instability, activation, and tumor heterogeneity. For instance, somatic LINE-1 insertions have been documented in genes like in , promoting tumorigenesis, while elevated levels of LINE-1-encoded ORF1p serve as a for epithelial malignancies such as , , and ovarian cancers. Beyond cancer, LINE-1 insertions are implicated in neurological disorders and aging, underscoring their dual role as both evolutionary drivers and potential disease contributors.

Molecular Structure

5' Untranslated Region

The (5' UTR) of human LINE-1 (L1) elements spans approximately 900 base pairs and serves as an internal promoter that drives transcription of the full-length L1 . This region is essential for initiating retrotransposition by enabling autonomous expression independent of external promoters. Unlike typical eukaryotic mRNAs, the L1 5' UTR integrates promoter functionality with regulatory elements that influence both sense-strand transcription for retrotransposition and antisense transcription potentially involved in regulatory roles. The 5' UTR exhibits a bipartite structure, comprising tandemly repeated monomers (typically 2-6 copies of ~200 bp units in humans) followed by a non-repeated sequence. This organization supports bidirectional promoter activity: the upstream elements primarily direct sense transcription, while downstream motifs enable antisense promoter function, producing transcripts that may repress L1 activity or influence nearby . Key transcription factor binding sites, including multiple YY1 motifs located near the transcription start site, are critical for precise and enhancer-like effects on L1 expression. YY1 binding enhances core promoter efficiency, particularly in the context of CpG-rich sequences that are susceptible to epigenetic regulation. During retrotransposition, newly inserted L1 copies are frequently 5' truncated, often losing substantial portions of the UTR due to incomplete reverse transcription. Such truncations the promoter architecture, drastically reducing transcriptional activity and rendering most insertions transcriptionally inactive, which limits L1 proliferation in the . Full-length insertions retain promoter competence, but even partial losses of upstream monomers can diminish promoter strength by up to 90% in reporter assays. A 2025 review highlights that m6A modifications in the 5' UTR enhance L1 RNA stability and promote its export via interactions with YTHDC1, with potential contributions from secondary structures in the monomers. These features underscore the 5' UTR's multifaceted role beyond promotion, integrating post-transcriptional control to support L1 mobility.

Open Reading Frame 1

The 1 (ORF1) of LINE-1 retrotransposons spans approximately 1 kb and encodes a 338-amino-acid protein known as ORF1p, which exhibits -binding and chaperone activities essential for retrotransposition. This protein is translated from the bicistronic LINE-1 mRNA and plays a key role in forming ribonucleoprotein (RNP) complexes by binding to its own LINE-1 transcript with high affinity and without sequence specificity. ORF1p consists of three main domains: an N-terminal coiled-coil domain that mediates trimerization, a central RNA recognition motif (RRM) responsible for initial binding, and a C-terminal domain (CTD) that facilitates additional interactions and oligomerization. The protein is initially expressed as monomers in the but undergoes rapid oligomerization upon binding to , leading to the assembly of discrete, cytoplasmic RNP particles that package LINE-1 for subsequent mobilization. In the , ORF1p is primarily derived from approximately 80-100 active LINE-1 loci, reflecting the limited number of full-length, retrotransposition-competent elements capable of producing functional protein. Recent analyses of post-mortem tissue have revealed steady-state ORF1p levels that are predominantly neuronal, with about 37-48% of neurons expressing the protein across regions like the cingulate and frontal , and expression increasing with age in neurons. These findings underscore ORF1p's tissue-specific distribution and potential role in neuronal processes. Biochemical assays demonstrate that ORF1p functions as a chaperone, promoting the annealing and strand exchange of while exhibiting helicase-like activity to unwind secondary structures, thereby facilitating RNA remodeling without ATP dependence. This chaperone capability is critical for the structural dynamics of LINE-1 RNPs. In collaboration with the ORF2p protein, ORF1p supports the overall retrotransposition process by maintaining RNA integrity during particle formation.

Open Reading Frame 2

The open reading frame 2 (ORF2) of LINE-1 spans approximately 3.8 kb and encodes a 1,279-amino-acid multifunctional protein (ORF2p) with a molecular weight of about 150 kDa, which is indispensable for the element's retrotransposition. This protein integrates endonuclease and reverse transcriptase activities within a single polypeptide, enabling the conversion of LINE-1 RNA into genomic DNA inserts. Full-length ORF2p is produced through an unconventional translation mechanism at the ORF1-ORF2 junction, where ribosomes reinitiate at the first in-frame methionine codon of ORF2 following termination in the 63-nucleotide inter-ORF spacer. ORF2p comprises distinct structural domains that underpin its enzymatic roles. The N-terminal endonuclease () domain (residues ~1–239), resembling APE-like endonucleases, includes two zinc-binding motifs that facilitate DNA recognition and cleavage. This is followed by a central cysteine-rich domain (~residues 240–492) of unclear function but implicated in stability, and a C-terminal reverse transcriptase () domain (~residues 493–1,037) featuring a classic right-hand fold with seven conserved motifs (0, 1–7) essential for polymerization. A carboxy-terminal segment (~residues 1,038–1,279) contains a CCHC zinc-knuckle motif and contributes to binding and retrotransposition efficiency, though its precise role remains under investigation. The EN domain initiates retrotransposition by generating staggered single-strand nicks in target DNA at a consensus sequence of 5'-TT/AAAA-3', producing a 3'-hydroxyl group that serves as a primer. The RT domain then performs target-primed reverse transcription (TPRT), using the nicked DNA end to reverse-transcribe the LINE-1 RNA template in a process that integrates the new copy directly into the genome. These activities are coordinated within the ORF2p structure, with the protein exhibiting cis-preference for binding its own encoding RNA to form a functional ribonucleoprotein complex, aided briefly by ORF1p in assembly. Cryo-electron microscopy structures of the human LINE-1 TPRT complex, resolved in 2025 at resolutions up to 3.2 Å, have elucidated the dynamic conformational changes in ORF2p during integration. These structures capture ORF2p in complex with substrate RNA and target DNA from the factor VIII gene, revealing how the protein bends and unwinds the DNA duplex to expose the cleavage site, remodels the target for primer insertion, and forms a secure "primer grip" in the RT active site to initiate synthesis. Notably, the EN domain nicks the second DNA strand either before or concurrently with first-strand reverse transcription, contributing to variability in target site duplications.

3' Untranslated Region

The 3' untranslated region (3' UTR) of LINE-1 retrotransposons spans approximately 200 base pairs immediately downstream of 2 (ORF2). This non-coding sequence features a weak polyadenylation signal, typically an AATAAA hexamer, which often leads to inefficient cleavage and during transcription. As a result, LINE-1 transcripts frequently undergo into downstream genomic regions, producing extended mRNAs with variable poly(A) tails averaging 60 in length. These structural characteristics are essential for the element's retrotransposition efficiency, as the poly(A) tract serves as a for ORF2 protein and facilitates reverse transcription initiation. The 3' UTR contains sequence motifs that interact with Alu short interspersed nuclear elements (), promoting the formation of composite retrotransposons such as LINE-1-Alu chimeras. These interactions occur through RNA-level associations, where Alu transcripts can be co-mobilized by the LINE-1 retrotransposition machinery, often utilizing the 3' UTR's poly(A) tract for stabilization and integration. Such chimeras contribute to genomic diversity by linking Alu sequences to the 3' terminus of LINE-1 insertions, with recent comparative genomic analyses identifying over 100 such events across mammalian lineages. This mechanism underscores the 3' UTR's role in facilitating non-autonomous retrotransposition of dependent elements like Alu. A key function of the 3' UTR involves 3' , where the weak polyadenylation signal allows transcription to extend into adjacent genomic DNA, generating chimeric transcripts that incorporate downstream sequences. Upon retrotransposition, these transcripts insert the captured DNA at new loci, creating hybrid elements that propagate non-LINE-1 material. This process is responsible for approximately 10% of observed LINE-1 insertions carrying adjacent genomic DNA, as evidenced by genome-wide surveys of retrotransposon polymorphisms. Such transductions have evolutionary implications, enabling the dispersal of regulatory elements and exons across the genome. Recent studies from 2023 to 2025 have highlighted the role of s in targeting LINE-1 transcripts via the piRNA pathway to modulate transposon silencing and . These s, often derived from clusters containing repetitive sequences similar to those in the 3' UTR, are processed in cells to guide proteins to nascent transcripts, reinforcing epigenetic repression. Comparative analyses across species reveal contributions to transposon control with implications for integrity and disease susceptibility. This emerging role positions the 3' UTR not only as a structural but also as a dynamic element in small RNA-mediated defense mechanisms.

Genomic Distribution and Propagation

Copy Number and Activity

LINE-1 elements constitute approximately 17% of the , with an estimated 500,000 to 1 million copies dispersed throughout, the majority of which are 5'-truncated or harbor inactivating mutations that preclude autonomous retrotransposition. Of these, around 4,000 full-length copies exist, defined as retaining the ~6 structure with intact open reading frames, though only 80 to 100—termed "" loci—are deemed retrotransposition-competent based on and empirical assays. These active elements predominantly belong to the human-specific L1Hs subfamily, which emerged and began amplifying approximately 1 to 2 million years ago. Due to incomplete reverse transcription during propagation via target-primed reverse transcription, new LINE-1 insertions typically average ~900 bp in length. The 2022 Telomere-to-Telomere (T2T) Consortium's gapless assembly of the haploid (T2T-CHM13) has refined these estimates by resolving previously unassembled repetitive regions, identifying 1,787 L1Hs copies wherein 80.2% exhibit 5' . This complete reference highlights the precise genomic landscape of LINE-1, confirming the prevalence of truncated forms while underscoring the subset of intact elements capable of ongoing activity. Inter-individual variation in LINE-1 content arises from ongoing retrotransposition, with estimates indicating roughly 1 new insertion per 100 to 250 live births, thereby introducing 1 to 10 polymorphic sites across a population per generation on average.

Retrotransposition

LINE-1 (L1) elements propagate via target-primed reverse transcription (TPRT), a -intermediate copy-and-paste that integrates new copies into the . L1 is transcribed in the from an internal promoter and exported to the , where it assembles with the L1-encoded proteins ORF1p and ORF2p to form ribonucleoprotein (RNP) particles. These RNPs localize primarily in the but gain nuclear access, often during when the breaks down, enabling interaction with genomic DNA. In TPRT, the endonuclease (EN) domain of ORF2p introduces a single-strand nick in the target genomic DNA at a consensus sequence of 5'-TT/AAAA-3', generating a free 3'-hydroxyl (OH) group. This 3'-OH serves as the primer for reverse transcription by the RNA-dependent DNA polymerase activity of the ORF2p reverse transcriptase (RT) domain, which synthesizes complementary DNA (cDNA) starting from the poly(A) tail of the L1 RNA template. Concurrently, the RNase H activity of ORF2p degrades the RNA template as cDNA synthesis proceeds, displacing the downstream genomic DNA and facilitating integration. The second DNA strand is likely cleaved and joined later, possibly by host non-homologous end-joining repair pathways. Most new L1 insertions are 5'-truncated due to incomplete reverse transcription, often retaining only the 3' portion of the element. A 2025 study proposes that this 5' truncation acts as a protective for integrity, limiting the propagation of potentially deleterious full-length insertions by reducing their transcriptional activity and mobility. L1 insertions exhibit a bias toward AT-rich, gene-poor regions of the , reflecting the EN cleavage preference for AT-rich sites and post-insertion selection against gene disruptions. Insertions can also occur at DNA double-strand breaks independently of EN activity, integrating via host repair mechanisms. Retrotransposition frequency is approximately 1 event per 100 cells, contributing to heritable variation, while it is strongly inhibited in cells through transcriptional and post-transcriptional controls.

Regulation of Activity

Transcriptional Regulation

The transcriptional regulation of LINE-1 (L1) retrotransposons is initiated by an internal promoter embedded within the (UTR), which drives the synthesis of full-length L1 essential for retrotransposition. This promoter, spanning approximately the first 900 base pairs of the 5' UTR, functions as a II-dependent element and is responsive to host cellular that bind specific motifs to modulate initiation efficiency. Key among these is the transcription factor YY1, which binds to a conserved site near the transcription start site to facilitate accurate positioning of the transcriptional machinery and enhance promoter activity. Other factors, such as and , also interact with the 5' UTR to influence L1 expression levels in a context-dependent manner. L1 transcriptional activity exhibits strong developmental stage specificity, with high expression observed in embryonic cells and induced pluripotent cells (iPSCs), where it supports pluripotency maintenance and early processes, but markedly reduced levels in differentiated tissues due to enhanced repressive mechanisms. In human iPSCs and cerebral derived from them, recent analyses have identified thousands of active hominoid-specific L1 loci, whose expression is driven by open states and correlates with neural ; perturbation of these L1s via interference disrupts early neural and reduces size, highlighting their regulatory role in brain development. Chromatin accessibility plays a critical role in L1 promoter function, with active loci characterized by open configurations permissive to transcription factor binding, whereas repressive histone modifications, such as , deposit at L1 elements to silence expression in somatic cells by compacting and limiting promoter access. enrichment at L1 promoters correlates with low transcriptional output in differentiated lineages, serving as a barrier to ectopic activation. In active L1 loci, sense-oriented transcription from the internal promoter predominates, occurring several-fold higher than antisense transcription, which is generally limited and often arises from bidirectional promoter activity. These transcriptional controls are reinforced downstream by epigenetic modifications that maintain long-term silencing in stable cell states.

Post-Transcriptional and Epigenetic Control

Epigenetic silencing of LINE-1 (L1) elements primarily occurs through at CpG islands within the (UTR), which acts as the element's internal promoter and suppresses its transcriptional activity. This methylation is established by DNA methyltransferases DNMT3A and DNMT3B, which target CpG-rich sequences to maintain repression across cell divisions, preventing aberrant L1 expression in differentiated cells. High methylation levels (>75% at key CpG sites) correlate with transcriptional inactivity, while demethylation enables promoter accessibility and potential L1 activation. In addition to DNA modifications, histone-based epigenetic control involves KRAB zinc-finger proteins (KRAB-ZFPs) that recruit the co-repressor KAP1, which in turn activates the SETDB1 to deposit repressive marks on L1 in cells. This pathway provides a layer of formation that reinforces silencing independently of , particularly for evolutionarily younger L1 subfamilies prone to mobilization. SETDB1-mediated trimethylation is essential for stable repression, as its disruption leads to L1 derepression and genomic instability in non-germline tissues. Post-transcriptional regulation in the germline relies heavily on the piRNA pathway, where PIWI proteins loaded with piRNAs recognize and cleave L1 transcripts through a ping-pong amplification cycle. In this mechanism, primary antisense piRNAs guide PIWI to slice sense L1 RNA, generating 5' fragments that serve as templates for secondary piRNAs bound to proteins like Aubergine, amplifying the response and directing degradation of L1 mRNA to limit retrotransposition. This slicer-dependent process ensures robust silencing during gametogenesis, protecting the genome from insertional damage. APOBEC3 proteins, such as APOBEC3A and APOBEC3C, further inhibit L1 retrotransposition post-transcriptionally by deaminating cytidines to uridines in L1 transcripts, which disrupts stability and impairs reverse transcription during the mobilization process. This editing introduces hypermutations that render the RNA non-functional for cDNA synthesis, providing a deamination-dependent barrier particularly effective against active L1 elements. APOBEC3A's activity has been observed in immune cells, extending L1 control beyond the . Somatic reactivation of L1 can occur in states of global hypomethylation, often mediated by ten-eleven translocation () enzymes that oxidize to , facilitating active demethylation and promoter derepression. and , in particular, contribute to this process during cellular or stress responses, leading to increased L1 transcription in non-proliferative contexts like neural progenitors. A 2025 review highlights how such TET-driven dynamics integrate L1 regulation into developmental processes, influencing networks without full retrotransposition.

Biological Roles

Contribution to Genome Evolution

LINE-1 retrotransposons have profoundly influenced eukaryotic through episodic amplification bursts, particularly in mammals over the past approximately 100 million years, which have significantly expanded sizes. These bursts align with the diversification of mammals, where LINE-1 elements proliferated alongside other transposable elements, contributing to an "accordion" model of genome dynamics involving gains and losses of repetitive DNA. In mammals, LINE-1 sequences alone account for a substantial portion of this expansion, with their activity shaping genome architecture across lineages. A key mechanism by which LINE-1 drives innovation is through shuffling and the creation of novel genes, often via insertions near or within exons that rearrange coding sequences. Experimental evidence demonstrates that LINE-1 retrotransposition can capture and relocate exons, generating chimeric transcripts with potential new functions. This process has contributed to the modular evolution of genes, allowing for functional diversification without relying solely on point mutations. LINE-1 also indirectly amplifies genomic diversity by supplying reverse transcriptase for the mobilization of non-autonomous short interspersed nuclear elements (SINEs), such as the ~1 million Alu elements in the , which comprise about 11% of total DNA. These Alu insertions, dependent on LINE-1 machinery, have further fueled evolutionary changes by altering structure and providing raw material for recombination. In the , LINE-1-derived sequences constitute roughly 20% of the total, underscoring their pervasive impact. Recent studies (2023–2025) have identified hominoid-specific LINE-1 integrants within neurodevelopmental genes, suggesting these recent insertions promote regulatory novelty in brain evolution. While the majority of LINE-1 insertions are deleterious or selectively neutral—often silenced or purged to mitigate genomic instability—some have played adaptive roles by enhancing and facilitating species-specific adaptations. This balance reflects LINE-1's dual nature as both a parasitic force and an evolutionary engine, with rare beneficial integrations fixed in populations to drive innovation.

Influence on Gene Expression and Development

Active LINE-1 (L1) elements exhibit heightened expression during early embryogenesis, particularly in preimplantation stages, where they contribute to and developmental progression before being epigenetically silenced post-implantation to prevent genomic instability. In mouse embryos, L1 transcription peaks at the 2-cell stage, facilitating global epigenetic and the activation of zygotic expression, with interference in this process leading to arrest in preimplantation development. Similarly, in human embryos, L1 activity is elevated from the to the 8-cell stage, correlating with decreased and supporting the transition to totipotency, after which progressive silencing via and modifications occurs around implantation. This temporal regulation ensures L1's transient role in early development without long-term disruption. Young, evolutionarily recent L1 elements can function as enhancer-like sequences in embryonic cells (ESCs), particularly influencing genes associated with pluripotency. In ESCs, the 5' untranslated regions (UTRs) of young L1s, marked by the ELL3, act as cis-regulatory enhancers that promote the of core pluripotency factors such as Nanog and Oct4, thereby maintaining naive pluripotency states. This enhancer activity is epigenetically modulated, with treatment enhancing L1 expression and improving (iPSC) reprogramming efficiency by facilitating epigenetic remodeling. Additionally, antisense transcription initiated from L1 promoters can interfere with nearby host gene expression, producing chimeric transcripts or overlapping RNAs that disrupt normal transcriptional and splicing of adjacent genes. For instance, the L1 antisense promoter drives bidirectional transcription, leading to sense-antisense overlaps that attenuate expression of cellular genes in a position-dependent manner. Recent studies have elucidated L1's role in human pluripotency dynamics and using iPSCs and organoids. In human iPSCs and cerebral organoids, thousands of hominoid-specific L1 integrants are actively transcribed, with their suppression via interference impairing the exit from pluripotency and delaying neuronal by dysregulating genes involved in neural progenitor maintenance. This L1-mediated transcriptional control is cis-acting, where L1-derived s influence nearby loci to coordinate the transition from pluripotent to neuroectodermal states, highlighting a conserved mechanism across species. Furthermore, L1 ribonucleoprotein (RNP) particles, containing ORF1p and L1 , localize to cytoplasmic granules, where they interact with host mRNAs and RNA-binding proteins to modulate mRNA stability and translation under conditions. ORF1p's association with granule components, such as G3BP1, sequesters L1 RNPs and potentially stabilizes or degrades interacting mRNAs, linking L1 activity to post-transcriptional gene regulation during developmental responses.

Associations with Disease

Insertional Mutagenesis in Genetic Disorders

Insertional mutagenesis by LINE-1 (L1) retrotransposons occurs primarily through de novo insertions in the or early embryonic stages, leading to heritable genetic disorders by disrupting critical genes. These events are estimated to happen at a frequency of approximately 1 in 100 births, though only a subset result in pathogenic outcomes due to their into functional genomic regions. Such insertions often introduce premature stop codons, cause aberrant splicing, or alter , manifesting as monogenic diseases. Over the past decades, around 100 cases of disease-causing L1 insertions have been documented, highlighting their role in rare but impactful mutations. Seminal examples include L1 insertions into the gene (F8), responsible for severe hemophilia A, first identified in the late 1980s in unrelated patients where truncated L1 sequences integrated into 14, disrupting coagulation factor production. Similarly, full-length L1 insertions in intron 14 of the gene (RB1) have been linked to familial retinoblastoma, promoting abnormal splicing and loss of tumor suppressor function in affected individuals and their relatives. These cases underscore how L1-mediated disruptions in key developmental or housekeeping genes can lead to severe, inherited phenotypes. In addition to purely events, early embryonic L1 insertions can result in , where only a portion of cells carry the , contributing to isolated or variable expressivity in disorders. For instance, mosaic L1 integrations have been implicated in sporadic cases of neurofibromatosis type 1 and , where partial tissue involvement leads to milder or atypical presentations compared to uniform mutations. A 2023 review emphasizes the hallmarks of these mutagenic events, including target site duplications and poly-A tail variations, which distinguish pathogenic L1 insertions from polymorphic ones. Detection of these insertions has advanced with long-read sequencing technologies, such as PacBio and Oxford Nanopore, which enable identification of full-length L1 elements (>6 kb) that short-read methods often overlook due to repetitive sequences and structural complexity. This approach has facilitated the characterization of previously undetected events in clinical , improving of heritable disorders.

Deregulation in Cancer

Global hypomethylation of LINE-1 (L1) elements, observed as a hallmark in the majority of human cancers, leads to their reactivation and increased retrotransposition activity, promoting genomic instability in tumor cells. This epigenetic deregulation, which normally suppresses L1 through methylation-based silencing, results in elevated expression of L1-encoded proteins such as ORF1p and ORF2p in various tumor types. For instance, ORF1p is upregulated in approximately half of human cancers, serving as a marker of active L1 mobilization. Similarly, ORF2p, which provides endonuclease and functions essential for retrotransposition, shows enhanced levels in epithelial cancers including colon, , and tumors. Somatic L1 insertions frequently disrupt key regulatory genes, contributing to oncogenesis by inactivating tumor suppressors or altering function. Notable examples include insertions into the gene in colorectal cancers, which truncate its protein product and promote tumorigenesis, and into PTEN in various solid tumors, leading to loss of its phosphatase activity and enhanced PI3K signaling. These events can also generate chimeric transcripts, such as L1-MET fusions, that drive aberrant and cellular transformation. In a 2025 pan-cancer multi-omic analysis, the TotalReCall algorithm was developed to detect locus-specific L1 retrotransposition events across thousands of tumor samples, revealing strong correlations between L1 transcriptional activity and heterogeneous insertion efficiencies that exacerbate tumor evolution. Circulating L1 fragments in cell-free DNA (cfDNA) have emerged as noninvasive biomarkers for early cancer detection, with 2025 studies demonstrating their utility in multi-cancer screening through hypomethylation patterns indicative of active retrotransposition. L1 hypomethylation and activity correlate with poor prognosis in specific malignancies, including reduced overall survival in colorectal and lung cancers. Therapeutically, targeting L1 reverse transcriptase with inhibitors, such as repurposed antiretrovirals, has shown promise in suppressing retrotransposition and tumor growth in preclinical models of colorectal and other cancers.

Implications in Neurological Conditions

LINE-1 (L1) retrotransposons have been implicated in various neurological conditions through mechanisms such as retrotransposition, leading to genomic mosaicism, and deregulation of L1 expression, which can disrupt neuronal function and contribute to disease pathology. In the , L1 activity is particularly relevant during and in mature neurons, where it may influence regulation and cellular heterogeneity, but aberrant activity is associated with neurodevelopmental and neurodegenerative disorders. In , postmortem analyses of patient brains have revealed increased L1 copy numbers, particularly in neurons from the , suggesting elevated somatic retrotransposition that may contribute to synaptic and circuit dysfunction. This neuronal-specific increase contrasts with stable L1 levels in non-neuronal cells, indicating a role in brain-specific pathology; induced pluripotent stem cell-derived neurons from schizophrenia patients also exhibit higher L1 activity. Hypomethylation of L1 elements is a key feature in Aicardi-Goutières syndrome (AGS), an autoinflammatory caused by mutations in genes like TREX1, RNASEH2, and SAMHD1 that normally repress L1 activity. These mutations lead to genome-wide DNA hypomethylation, accumulation of L1-derived nucleic acids, and activation of innate immune responses, resulting in interferon-driven and neurological symptoms such as and developmental delay. Somatic L1 insertions in neurons generate genomic mosaicism that has been linked to neurodevelopmental disorders including autism spectrum disorder () and . In ASD brains, deep sequencing has identified mosaic L1 insertions in cortical neurons, potentially altering and contributing to neuronal diversity or dysfunction; similar somatic events occur in neural progenitors, amplifying variability during brain development. In epilepsy, L1-mediated mosaicism in focal cortical dysplasia lesions disrupts neuronal signaling, with insertions near epilepsy-associated genes exacerbating seizure susceptibility. Recent studies have highlighted neuron-predominant expression of L1-encoded ORF1p protein in the , with levels increasing in aging and potentially disrupting through interactions with proteins involved in cGMP signaling and synaptic transmission. In models, ORF1p is expressed in up to 91% of frontal neurons under steady-state conditions, and its elevation correlates with altered , which may underlie cognitive impairments in neurological conditions. L1 activity contributes to (ALS) and (FTD) by forming a feedback loop with , where L1 derepression promotes TDP-43 aggregation and vice versa. In TDP-43-deficient models, increased L1 retrotransposition leads to RNA stress and neuronal death, mimicking ALS/FTD neurodegeneration; conversely, L1 transcripts can seed TDP-43 mislocalization and inclusion formation in motor neurons. In retinal diseases such as (), or L1 insertions disrupt key genes like , leading to photoreceptor degeneration. Targeted sequencing in RP families has identified large chimeric L1 insertions in RP1 exons, segregating with disease and causing frameshift mutations that impair microtubule stability in rod cells. L1 retrotransposition occurs actively in neural progenitor cells during brain development, generating mosaicism that influences neuronal differentiation and complexity. In human neural progenitors, L1 insertions regulate accessibility and exit from pluripotency, with dysregulation potentially contributing to neurodevelopmental vulnerabilities in later neurological conditions. Recent research has revealed that LINE-1 (L1) retrotransposons become activated during aging through declining , particularly hypomethylation of L1 promoters, which serves as an epigenetic of aging and correlates with chronological age across tissues. This age-associated L1 derepression contributes to chronic low-grade inflammation, known as inflammaging, by promoting the release of pro-inflammatory cytokines and sustaining the (SASP) in aging cells. In mammalian models, L1 patterns at specific loci track age progression and are linked to the incidence of age-related pathologies, highlighting L1's role in driving systemic inflammatory responses that exacerbate tissue decline. A 2025 study demonstrated that L1 retrotransposition directly contributes to and tissue dysfunction by generating cytoplasmic cDNA that activates /type I pathway, leading to persistent and SASP maintenance in senescent cells. This process is amplified in late-stage , where L1 transcription increases 4- to 5-fold, causing DNA damage, genomic instability, and replicative stress that impair tissue homeostasis in aging organisms. inhibitors, such as lamivudine, have shown promise in suppressing L1-driven responses and reducing age-associated in preclinical models, suggesting potential interventions for senescence-related disorders. L1 elements interact with the through evasion and activation mechanisms, including sensing of L1 RNA and cDNA by the cGAS-STING pathway, which triggers antiviral type I responses as part of innate immunity. Conversely, APOBEC3 family proteins inhibit L1 retrotransposition by deaminating L1 RNA or cDNA, preventing and limiting immune activation, with differential efficacy among APOBEC3 isoforms (e.g., APOBEC3A to APOBEC3H). In autoimmune diseases like systemic lupus erythematosus (SLE) and primary Sjögren's syndrome (SS), aberrant L1 expression in affected tissues, such as kidneys and salivary glands, exposes endogenous nucleic acids that stimulate plasmacytoid dendritic cells via TLR7/8, driving type I production and . L1 promoter hypomethylation in these conditions correlates with elevated L1 mRNA and -alpha levels, underscoring L1's contribution to nucleic acid-mediated immune dysregulation. During the , reports from 2020 to 2023 identified L1 hypomethylation in peripheral blood of infected patients, particularly those with moderate to severe disease, leading to activation and heightened via pathways like cGAS-STING. This epigenetic alteration persisted for at least three months post-recovery, correlating with disease severity and promoting prolonged immune overactivation, which exacerbates storms in severe cases. Targeting L1 activity, such as through inhibition, emerges as a potential therapeutic to mitigate in severe and related long-term sequelae.