Fact-checked by Grok 2 weeks ago

Mutational signatures

Mutational signatures are the characteristic patterns of mutations observed in cancer genomes, representing the cumulative imprints left by specific mutational processes that operate during tumorigenesis, including those driven by environmental mutagens, replication errors, or defects in mechanisms. These signatures manifest as distinct combinations of base substitutions—such as C>T transitions—within defined trinucleotide sequence contexts, enabling the classification of up to 96 possible types based on the six main substitution classes and the flanking bases. The concept of mutational signatures emerged from comprehensive analyses of whole-genome sequencing data, with the first systematic cataloguing in identifying 21 such signatures across multiple cancer types by applying algorithms to millions of mutations from over 7,000 tumors. This approach revealed varying mutation burdens, from as low as 0.001 mutations per megabase to over 400 in cancers linked to or light exposure, and associated specific signatures with etiologies like Signature 4 (tobacco-induced) and Signature 7 (UV-induced). Subsequent advancements, including refined computational tools like SigProfiler, have expanded the repertoire, extracting 67 single-base substitution () signatures, 11 doublet-base substitution () signatures, and 17 insertion/deletion () signatures from over 23,000 cancer samples in 2020, providing higher resolution through whole-genome data and better separation of overlapping processes. Today, mutational signatures are cataloged in authoritative databases like COSMIC (version 3.4, updated October 2023), which curates reference sets of high-confidence signatures across (96 contexts), DBS (78 types), ID (83 types), copy number, structural variants, and even RNA-level changes to track mutational processes in human cancers. Their significance extends beyond etiology, informing clinical applications such as identifying patients with deficiency via Signature 3 for targeted therapies like , and revealing age-related signatures like SBS1 that accumulate spontaneously over a lifetime. Ongoing research continues to uncover new signatures tied to therapies (e.g., platinum chemotherapy) and non-cancer contexts, underscoring their role in dissecting the genomic landscape of disease.

Introduction and general concepts

Definition and overview

Mutational signatures are characteristic patterns of in cancer genomes that reflect the cumulative effects of specific mutational processes, including DNA damage from endogenous or exogenous sources and errors in . These patterns manifest as combinations of types, such as single base substitutions (SBS), small insertions and deletions (indels), and larger structural variants, each shaped by the underlying biological mechanisms acting over time. By analyzing the relative frequencies of these types within trinucleotide contexts or other genomic features, signatures provide a genomic of the processes driving . The concept of mutational signatures emerged from large-scale whole-genome sequencing efforts, with the first comprehensive identification occurring in 2013 through the analysis of 507 cancer samples across multiple tumor types. This pioneering work revealed over 20 distinct signatures, demonstrating that cancers accumulate mutations in predictable patterns rather than randomly, revolutionizing the study of somatic evolution. These signatures hold critical importance for oncology, as they enable the tracing of mutational causes back to specific etiologies, such as aging, environmental exposures, or therapeutic interventions, thereby enhancing understanding of cancer development and informing strategies for prevention and treatment. For instance, Signature 1 (SBS1), often described as a clock-like signature, arises from the spontaneous deamination of 5-methylcytosine to thymine, predominantly producing C>T transitions at CpG sites and correlating with patient age.

Underlying mechanisms

Mutational processes in the arise from a combination of endogenous and exogenous sources that introduce DNA damage, which, if not properly repaired, results in permanent mutations. Endogenous includes spontaneous chemical alterations such as base deamination (e.g., to uracil, leading to C>T transitions, especially at CpG sites), hydrolysis creating abasic sites, and oxidative damage from (ROS) like hydroxyl radicals that generate lesions such as (8-oxoG), causing G:C to T:A transversions. Replication errors during also contribute, with polymerases occasionally inserting incorrect bases or slipping at repetitive sequences, at rates of approximately 10^{-6} to 10^{-8} errors per nucleotide per . Exogenous factors encompass environmental agents like , which produces ROS and double-strand breaks (DSBs), (UV) light forming cyclobutane (CPDs) and (6-4) photoproducts that lead to C>T or CC>TT mutations, and chemicals such as alkylating agents that add methyl groups to bases, resulting in G:C to A:T transitions. Cells counteract this damage through multiple DNA repair pathways, each specialized for specific lesion types. (BER) addresses small, non-helix-distorting base modifications, such as deaminated or oxidized bases: remove the damaged base, creating an abasic site that is incised by AP endonuclease (APE1), followed by insertion via β (POLβ) and ligation. (NER) handles bulky, helix-distorting adducts like UV-induced dimers or chemical adducts; it involves damage recognition by XPC-RAD23B, unwinding by TFIIH, excision of a 24-32 oligonucleotide by XPF-ERCC1 and XPG, and resynthesis by polymerases δ/ε. Mismatch repair (MMR) corrects base-base or insertion/deletion mismatches from replication, using MutSα (MSH2-MSH6) or MutSβ for recognition, exonuclease 1 (EXO1) for strand-specific excision, and resynthesis by POLδ. For DSBs, (HR) employs a sister chromatid template for accurate repair: the MRN complex resects ends, RAD51 forms a filament to invade the homolog, and restores the sequence. In contrast, (NHEJ) rapidly ligates DSB ends without a template, involving Ku70/80 binding, activation for end processing, and ligation by LIG4-XRCC4, often introducing small insertions or deletions. Failure or deficiency in these repair pathways allows damage to persist, directly linking specific lesions to mutation types. Unrepaired deamination or oxidation in BER-deficient contexts yields base substitutions, such as C>T from deamination or G to T from 8-oxoG mispairing with during replication. NER defects leave bulky lesions that, upon replication, cause base substitutions like C:G to T:A from UV photoproducts via error-prone translesion synthesis. MMR inefficiency permits replication slippage in microsatellites, generating small insertions or deletions (indels) of 1-5 bases. DSBs unresolved by HR or erroneously joined by NHEJ produce structural variants (SVs), including deletions, duplications, inversions, or translocations, often with microhomology at junctions indicating NHEJ involvement. An additional layer of complexity is transcriptional strand bias, where mutations accumulate preferentially on the non-transcribed strand due to asymmetry in repair efficiency. Transcription-coupled NER (TC-NER), a subpathway of NER, prioritizes repair of lesions on the transcribed (template) strand by recruiting repair factors when stalls at damage sites, involving CSA and CSB proteins; this protects actively transcribed genes but leaves the non-transcribed strand more susceptible to . Consequently, processes like oxidative damage or certain replication errors exhibit higher mutation rates on the non-transcribed strand, influencing the overall mutational landscape.

Genomic data and mutation types

Base substitutions

Base substitutions, also known as single base substitutions (SBS) or single nucleotide variants (SNVs), refer to mutations where one nucleotide base in the DNA sequence is replaced by another. These changes can involve any of the four bases (A, C, G, T) substituting for the original, resulting in six possible substitution types when accounting for strand symmetry: C>A, C>G, C>T, T>A, T>C, and T>G. In mutational signature analysis, SBS mutations are classified based on their trinucleotide to capture the immediate sequence environment influencing the mutation. This involves the mutated base flanked by one on each side (5' and 3'), yielding 4 × 4 × 6 = 96 possible combinations, often represented as 96-channel vectors where each channel corresponds to the relative frequency of a specific in its . The Catalogue of Somatic Mutations in Cancer (COSMIC) employs this 96-channel framework as the standard for cataloging signatures across cancer genomes. SBS mutations predominate among point mutations in cancer, comprising approximately 90% of such events, with insertions and deletions (indels) making up the remainder at about 10% frequency relative to substitutions. This prevalence underscores their central role in signature extraction from whole-genome sequencing data. Representative examples of SBS contexts include C>T transitions occurring at CpG dinucleotides, which arise from spontaneous of , a process reflected in signatures like COSMIC SBS1. Another prominent context is C>T or C>G substitutions at TpC motifs, characteristic of cytidine deaminase activity and captured in signatures such as SBS2 and SBS13. These contexts highlight how sequence-specific patterns enable the dissection of underlying mutational processes without delving into their mechanistic details.

Insertions, deletions, and structural variants

Insertions and deletions (indels) refer to small-scale genomic alterations involving the addition or removal of 1 to 50 base pairs, distinct from point mutations like base substitutions. In mutational signature analysis, indels are classified into 83 distinct types (ID1 to ID83) based on factors such as event length, the specific nucleotides affected, presence in repetitive sequences, and microhomology at breakpoints, as defined by the . These signatures capture patterns arising from replication errors, repair deficiencies, or other processes, with extraction typically performed using on catalogs from whole-genome sequencing data. For instance, ID2 predominantly features 1-base pair deletions of in mononucleotide poly-T repeats, attributed to polymerase slippage during of the template strand. Structural variants (SVs) encompass larger genomic rearrangements exceeding 1 kilobase, including deletions, tandem duplications, inversions, and translocations that disrupt structure. COSMIC catalogs 12 rearrangement signatures (RS1 to RS12), derived from patterns in over 10,000 whole-genome sequences, classifying SVs into 32 subtypes by size categories (1-10 kb, 10-100 kb, 100 kb-1 Mb, 1-10 Mb, >10 Mb), orientation, clustering (e.g., tandem or dispersed), and complexity. These signatures reflect underlying mutational processes, such as breakage-fusion-bridge cycles or replication-based mechanisms, and are validated using multiple callers like , Lumpy, and Delly to ensure accuracy. Analyzing and signatures presents unique challenges compared to single base substitutions, primarily due to their lower frequency in most tumor genomes—indels occur at rates 10-100 times lower than substitutions, while SVs vary widely but often number in the dozens per sample—necessitating larger cohorts for robust . Accurate detection requires high-confidence variant calling pipelines to distinguish true events from artifacts, particularly for clustered SVs or short indels in repetitive regions, and the lack of a universally intuitive classification system complicates . Despite these hurdles, such analyses link indel and SV patterns to processes like deficiency, enhancing their utility in identifying tumor-specific mutagenic landscapes when integrated with larger datasets.

Tumor mutation catalogs

Tumor mutation catalogs are compiled from large-scale genomic datasets of cancer samples to enable the extraction and analysis of mutational signatures. Primary data sources include whole-genome sequencing (WGS) and whole-exome sequencing (WES) efforts from consortia such as (TCGA) and the Cancer Genome (ICGC). WGS provides comprehensive coverage of both coding and non-coding regions, capturing the full spectrum of mutations essential for accurate signature profiling, as the majority of somatic mutations occur outside exonic areas. In contrast, WES focuses on protein-coding exons but limits insight into non-coding mutational processes, though it remains widely used due to lower cost and higher depth. These datasets form the backbone of catalogs, with TCGA and ICGC collectively providing mutation profiles from over 15,000 tumors across diverse cancer types. Catalog construction begins with variant calling on aligned sequencing reads from tumor-normal pairs to identify mutations. Tools such as MuTect for single nucleotide variants and Strelka for small indels perform this by comparing tumor and matched normal genomes to distinguish alterations from variants and sequencing artifacts. Subsequent filtering steps remove low-confidence calls based on , strand bias, and quality scores, ensuring high specificity. Mutations are then annotated with their trinucleotide contexts (e.g., the 5' and 3' flanking bases) to generate 96-channel profiles for base substitutions, facilitating signature analysis. This process yields structured catalogs of validated mutations, often aggregated into public repositories like the NCI Genomic Data Commons or ICGC data portal for community access. Tumor mutation burden (TMB), defined as the total number of somatic mutations per megabase of interrogated, serves as a broad for overall genomic in these catalogs. Typically calculated from nonsynonymous variants in exonic regions via WES, TMB correlates with neoantigen load and response but lacks granularity on underlying processes. Mutational signatures, derived from cataloged mutations, offer process-specific insights by deconvolving contributions from distinct etiologies, such as environmental exposures or replication errors, beyond mere TMB quantification. Robust requires large cohort sizes to overcome and achieve statistical , with thousands of samples needed to reliably identify rare or tissue-specific patterns. Current catalogs from TCGA and ICGC encompass over 10,000 tumors, enabling pan-cancer analyses that reveal conserved and novel signatures across histologies. For instance, the Pan-Cancer Analysis of Whole Genomes (PCAWG) consortium analyzed 2,658 WGS samples, demonstrating how scaled catalogs enhance signature resolution.

Endogenous mutational processes

Age-related and spontaneous mutagenesis represents a fundamental endogenous process contributing to the accumulation of somatic mutations over time, independent of external exposures. These processes manifest primarily through three clock-like mutational signatures: SBS1, SBS5, and SBS18. SBS1 is characterized by C>T transitions predominantly at CpG dinucleotides, arising from the spontaneous deamination of 5-methylcytosine (5mC) to thymine, which creates a G:T mismatch if unrepaired. This signature accumulates linearly with chronological age in both normal and cancerous tissues across diverse cell types, reflecting ongoing endogenous DNA damage. SBS5 features a broader spectrum of base substitutions, including T>C and A>G transitions, and also exhibits a strong positive correlation with age, though its precise etiology remains under investigation. SBS18 is characterized by C>A transversions at TpCpT contexts, likely resulting from oxidative DNA damage by reactive oxygen species (ROS). The underlying mechanisms of these signatures involve intrinsic cellular processes that generate DNA lesions during normal metabolism. For SBS1, the deamination of 5mC occurs spontaneously and is exacerbated by incomplete repair via base excision repair pathways, leading to persistent mismatches during replication. SBS5 is a clock-like signature that accumulates independently of cell divisions, possibly due to endogenous damage such as reactive oxygen species (ROS), though its precise etiology remains under investigation. Oxidative stress, a byproduct of cellular respiration, contributes to spontaneous mutagenesis by producing reactive oxygen species (ROS) that oxidize DNA bases, creating abasic sites or modified nucleotides that fuel signatures such as SBS18 and potentially SBS5 if not fully repaired. These processes are particularly pronounced in post-mitotic tissues, such as neurons and oligodendrocytes, where reduced proliferative activity limits dilution of mutations but allows accumulation from unrepaired damage over extended lifespans. Evidence for these signatures underscores their universality and utility as molecular clocks. SBS1, SBS5, and SBS18 are detected in virtually all tissues, both and neoplastic, with mutation burdens increasing predictably with age—typically by dozens of events per year across the . In proliferative tissues, SBS1 correlate with the estimated number of divisions, serving as a proxy for mitotic history. This clock-like behavior enables estimation of lifetime cell divisions, highlighting how spontaneous drives baseline genomic instability that can predispose to oncogenesis without external influences.

APOBEC cytidine deaminase activity

APOBEC cytidine deaminases are a family of enzymes primarily involved in innate antiviral defense by editing genomes through cytidine deamination, converting (C) to uracil (U) in single-stranded DNA. In cancer, dysregulated activity, particularly from APOBEC3A and APOBEC3B, leads to off-target in host genomic DNA, contributing to hypermutation and tumor evolution. This process generates characteristic mutational signatures dominated by C>T and C>G transitions at TCW trinucleotide motifs (where W is A or T), reflecting the enzymes' preference for deaminating cytosines flanked by . The primary APOBEC-associated signatures are SBS2 and SBS13. SBS2 is characterized by C>T mutations at TCW contexts, arising when incorporates adenine opposite unrepaired uracil, resulting in a C-to-T transition after . In contrast, SBS13 features C>G transversions at the same motifs, occurring when removes the uracil to create an abasic site, followed by error-prone translesion synthesis (e.g., by polymerase REV1) that inserts opposite the lesion. These signatures often co-occur and are linked to episodic bursts of , such as kataegis, potentially triggered by , viral infections, or replication stress. SBS2 and SBS13 are enriched in multiple cancer types, notably and cancers, where they account for a substantial fraction of somatic mutations—up to 25% in some tumors and similarly high in non-small cell . 3B upregulation, often due to copy number gains or transcriptional activation, correlates with signature prevalence, while germline deletions in the 3A/B locus modulate in and cancers. A related but distinct pattern, captured in SBS12, shows similarities in involvement but differs in context preferences and is observed at lower levels in liver cancers, though its precise link to remains under investigation. Clinically, signatures contribute to elevated (TMB), often exceeding 10 mutations per megabase, which enhances neoantigen presentation and correlates with improved responses to inhibitors, such as PD-1/ blockade, in and cancers. For instance, tumors with dominant APOBEC activity show prolonged in cohorts, independent of other signatures. However, this hypermutator can also drive aggressive evolution and resistance to targeted therapies.

DNA repair deficiencies

DNA repair deficiencies in key pathways, such as (HR), (MMR), and (NHEJ), generate distinct mutational signatures by allowing unrepaired or error-prone repair of DNA damage to persist in the genome. These deficiencies often arise from germline or somatic in repair genes, leading to increased mutation rates and specific patterns observable in cancer genomes. Homologous recombination deficiency (HRD), commonly caused by mutations in or , is characterized by single base substitution signature SBS3, which features a balanced profile across trinucleotide contexts but is enriched in , ovarian, and pancreatic cancers. Indel signature ID8, involving deletions of 10 or more s with 1 microhomology, and rearrangement signature RS3, marked by small tandem duplications and clustered structural variants, also reflect HRD due to reliance on alternative error-prone repair mechanisms like . These signatures are prevalent in BRCA1/2-mutated tumors, where HRD leads to genomic instability, including and copy number alterations, and serve as biomarkers for sensitivity to in ovarian and cancers. Mismatch repair deficiency (MMRd), often from mutations in MLH1, MSH2, MSH6, or PMS2, produces signatures SBS6, SBS15, SBS20, and SBS26, all linked to (MSI) and a bias toward transversions at specific motifs. These patterns arise from unrepaired replication errors and polymerase slippage, resulting in hypermutation and elevated , particularly in colorectal, endometrial, and gastric cancers. -high tumors with MMRd signatures predict enhanced efficacy of inhibitors, such as , due to increased neoantigen load and . Errors in (NHEJ), the primary pathway for repairing double-strand breaks without a template, contribute to signature SBS17, characterized by C>T and C>A substitutions, often alongside rearrangements featuring blunt-end joins. NHEJ deficiencies, such as those from LIG4 or XRCC4 mutations, promote imprecise ligation, leading to small insertions or deletions at break sites, though these signatures are less common and sometimes co-occur with other repair defects.

Exogenous and environmental exposures

Ultraviolet radiation

Ultraviolet (UV) radiation from sun exposure is a primary environmental responsible for inducing distinct mutational signatures in cancers, particularly those of the skin. These signatures reflect DNA damage primarily caused by UVB wavelengths (280–315 nm), which penetrate the and generate photoproducts such as cyclobutane (CPDs) and 6-4 photoproducts. These lesions, if unrepaired, lead to characteristic base substitutions during replication or translesion synthesis. The most prominent UV-associated signature is SBS7 (including subtypes SBS7a–d), characterized by C>T transitions at dipyrimidine sites, such as TC>TT or CC>TT, resulting from CPD-induced mispairing where deaminates to uracil, pairing with instead of . SBS7 is highly prevalent in sun-exposed skin cancers, with the highest contributions observed in melanomas, where it can account for over 80% of substitutions in some tumors. The signature's intensity is amplified by deficiencies in (NER), the primary pathway for removing UV photoproducts; in melanomas, reduced NER activity leads to persistence of these lesions and elevated mutation rates compared to other skin cancers like . The contribution of UV signatures exhibits a dose-response relationship with cumulative lifetime exposure, as evidenced by higher SBS7 loads in tumors from individuals with chronic sun exposure, such as outdoor workers or those in sunny climates, correlating positively with mutation burden in epidermal samples. This pattern supports UV's role in both and progression of cancers, with signature prevalence increasing exponentially with age and exposure intensity in normal tissues.

Alkylating agents and chemotherapy

Alkylating agents, a class of chemotherapeutic drugs, introduce alkyl groups onto DNA bases, primarily guanine at the O6 position, leading to the formation of O6-methylguanine (O6-meG) adducts. These adducts cause mispairing with thymine during DNA replication, resulting in C>T transitions if unrepaired. This process is particularly prominent when O6-alkylguanine-DNA alkyltransferase (MGMT) or mismatch repair (MMR) pathways are deficient, amplifying mutagenesis. Base excision repair (BER) plays a role in repairing some alkylated lesions but is detailed elsewhere. Signature 11 (SBS11) exemplifies this damage, featuring C>T and C>A substitutions enriched at TpC trinucleotides (e.g., TCC, TCA, TCT contexts). It is strongly associated with exposure to temozolomide (TMZ), an S_N1-type alkylator used in glioblastoma treatment, where it appears in nearly all hypermutated post-treatment tumors. Similarly, procarbazine, another alkylator employed in regimens for anaplastic gliomas and Hodgkin lymphoma, induces SBS11-like patterns, as observed in experimental models of bone marrow mutagenesis. Recent studies (as of 2024) have shown procarbazine induces higher mutation burdens and novel signatures in Hodgkin lymphoma. SBS11's persistence in tumor genomes serves as a biomarker of prior alkylating chemotherapy, aiding in reconstructing patient treatment histories and assessing secondary malignancy risks. Signature 9 (SBS9), characterized by T>C transitions at TCC trinucleotides, arises from error-prone replication by η (pol η), a translesion synthesis enzyme involved in bypassing DNA lesions including those from alkyl . In lymphomas treated with alkylators like bendamustine, SBS9 is observed alongside other signatures, reflecting pol η's role in tolerating replication stress post-adduct formation, though it may also contribute to in lymphoid cells. This signature's presence in post-chemotherapy samples highlights how therapeutic alkylators can inadvertently promote mutagenesis via TLS polymerases. Therapeutically, these signatures inform glioma and lymphoma management, where TMZ or procarbazine efficacy correlates with MGMT status, but unrepaired adducts drive resistance and relapse through hypermutation. Detecting SBS11 or SBS9 in relapsed tumors indicates alkylator exposure, guiding avoidance of similar agents to prevent further genomic instability.

Tobacco and other carcinogens

Tobacco smoke is a major source of environmental carcinogens that induce specific mutational signatures in human cancers, particularly in tissues directly exposed such as the lung and bladder. Among these, signature 4 (SBS4) is characterized by C>A transversions predominantly at CpCpT trinucleotide contexts and is attributed to DNA adducts formed by benzopyrene diol epoxide (BPDE), a metabolite of the polycyclic aromatic hydrocarbon benzopyrene present in cigarette smoke. This signature is prominently observed in lung squamous cell carcinomas and adenocarcinomas from smokers, where it correlates with cumulative smoking exposure measured in pack-years, contributing significantly to the overall mutation burden in these tumors. Experimental models exposed to tobacco mutagens replicate the SBS4 profile, confirming its causal link to smoking-related DNA damage. In addition to inhaled , smokeless forms like generate distinct mutational patterns. Signature 29 (SBS29), featuring C>A mutations in YCY motifs (where Y denotes ), is associated with tobacco chewing and is frequently detected in oral cavity and pharyngeal cancers among users in regions with high prevalence of this habit. Unlike SBS4, SBS29 shows a different distribution of C>A preferences, reflecting variations in the chemical composition and local exposure from products. Beyond , other environmental carcinogens produce analogous strand-biased signatures through formation. , a found in certain traditional remedies, induces signature 22 (SBS22), marked by A>T transversions at TW dinucleotide contexts (W = A or T), and is etiologically linked to upper urinary tract carcinomas in exposed populations, such as those in Balkan nephropathy-endemic areas or users of aristolochic acid-containing medicines. Similarly, , a produced by fungi contaminating foodstuffs like peanuts and corn in humid climates, causes signature 24 (SBS24) with transversions enriched at TpCN trinucleotide contexts, predominantly in hepatocellular carcinomas from high-exposure regions in and . Epidemiological analyses reveal that the contribution of SBS4 to tumor mutation burdens decreases with time since , as quitting halts further accumulation of tobacco-induced mutations while the signature persists as a record of prior exposure. This temporal decline underscores the preventive benefit of cessation in reducing ongoing genotoxic damage from these carcinogens.

Analysis methods and resources

Signature extraction and attribution

Signature extraction involves identifying the underlying mutational processes from catalogs of somatic mutations across tumor samples, typically represented as a where rows correspond to mutation types and columns to samples. The most widely adopted method is (NMF), which decomposes this V (of dimensions m \times n, where m is the number of mutation types and n the number of samples) into two non-negative matrices: W (of dimensions n \times k, representing the activity weights of k signatures across samples) and S (of dimensions k \times m, representing the signature profiles themselves), such that V \approx W S. This factorization assumes that the observed mutations result from a of independent mutational processes, with the goal of minimizing the while ensuring non-negativity to reflect biological plausibility. Seminal work established NMF as the foundation for de novo extraction of signatures from large cohorts, enabling the discovery of processes like spontaneous and UV-induced damage. Attribution assigns contributions from a predefined set of known signatures (e.g., from the COSMIC database) to individual samples or cohorts, often using (NNLS) regression to fit the observed spectrum as a weighted sum of reference profiles. is commonly employed to measure how well the reconstructed spectrum matches the observed data, with thresholds (e.g., >0.90) indicating reliable fits. Tools like SigProfiler facilitate both extraction and attribution; its SigProfilerExtractor module uses hierarchical NMF with to determine the optimal number of signatures and provides high accuracy in recovering known profiles from simulated data. Similarly, deconstructSigs employs NNLS for rapid attribution in single samples, allowing decomposition into COSMIC signatures and visualization of contributions, which has proven effective for identifying repair deficiencies like /2 mutations. For subclonal analysis, SciClone clusters mutations by variant allele frequencies to infer clonal architecture, enabling attribution of signatures to specific subclones and tracking evolutionary changes in process activity across tumor regions. Key challenges include , particularly in samples with low burdens (<100 mutations), where NMF may extract spurious signatures due to noise, leading to unstable decompositions. Multi-process overlap further complicates attribution, as correlated signatures (e.g., SBS1 and SBS5, both age-related) can yield ambiguous weights, reducing resolution in distinguishing co-occurring etiologies. Validation strategies mitigate these issues; bootstrap resampling, which repeatedly factorizes subsampled matrices to estimate signature stability, helps assess robustness and select the number of components via metrics like cophenetic correlation. Extracted or attributed signatures are benchmarked against the COSMIC v3.3 reference set, which includes 49 single-base substitution (SBS), 58 indel (ID), and other signatures derived from thousands of tumors, ensuring consistency with validated profiles.

Databases and computational tools

The Catalogue of Somatic Mutations in Cancer (COSMIC) maintains a comprehensive database of reference mutational signatures, curated from large-scale genomic datasets and peer-reviewed studies. In its latest version 3.4 (released October 2023 as part of COSMIC v98), the catalog includes 86 single-base substitution (SBS) signatures defined across 96 trinucleotide contexts, 11 doublet-base substitution (DBS) signatures across 78 types, 58 insertion/deletion (ID) signatures categorized into 83 feature types, and 12 rearrangement (RS) signatures based on structural variant patterns. These signatures are assigned etiologies where possible, such as age-related deamination for SBS1 or UV exposure for SBS7, drawing from experimental validations and clinical associations to aid in interpreting mutational processes. The catalog was initially derived using non-negative matrix factorization on the Pan-Cancer Analysis of Whole Genomes (PCAWG) dataset and has been iteratively updated with new signatures validated against thousands of tumor samples. The PCAWG dataset serves as a foundational benchmark resource for mutational signature analysis, encompassing whole-genome sequencing data from over 2,600 primary tumors across 38 cancer types, paired with normal tissues. This consortium effort, published in 2020, provides standardized mutation catalogs that enabled the extraction and validation of the core , including SBS, ID, and RS profiles, and remains widely used for testing signature attribution methods due to its scale and diversity. Researchers access PCAWG data through portals like the , facilitating reproducible analyses of signature contributions in pan-cancer contexts. Several computational tools support the analysis and visualization of mutational signatures, primarily implemented as R packages for integration with genomic workflows. The Yet Another Package for Signature Analysis (YAPSA) enables supervised decomposition of mutation catalogs into known signatures, incorporating confidence intervals for exposures and stratified analyses by genomic features, with built-in support for COSMIC and PCAWG reference sets. Similarly, sigfit applies Bayesian inference to fit and extract signatures from count data, offering flexible modeling of exposures and handling of catalog uncertainties, which improves robustness in low-mutation samples. MutationalPatterns provides comprehensive functions for generating 96-context SBS spectra, detecting strand biases, and attributing signatures via non-negative matrix factorization, with visualization capabilities such as heatmaps to display mutation type frequencies across samples. Recent updates to these resources, post-2022, have expanded COSMIC's experimental signatures section to integrate non-cancer data from model organisms and controlled exposures, enhancing etiology assignments for processes like oxidative damage or replication errors observed in both normal and tumor genomes. This inclusion supports broader applications in germline and somatic mutagenesis studies, while tools like and have incorporated these extended catalogs for more accurate cross-context analyses.

Applications and emerging areas

Role in cancer etiology and therapy

Mutational signatures provide critical insights into cancer etiology by serving as molecular footprints of underlying mutagenic processes, enabling the retrospective identification of causative exposures or defects. For example, signature SBS4, dominated by C>A transversions at CpCpA trinucleotides, is a hallmark of and has been detected in , head and neck, and bladder cancers, correlating with cumulative exposure levels and aiding in etiological classification beyond self-reported history. Similarly, homologous recombination deficiency (HRD) signatures, such as those reflecting /2-associated patterns (e.g., SBS3), trace or somatic defects in , which are prevalent in up to 50% of high-grade serous ovarian cancers and inform hereditary risk assessment. In terms of , specific signatures predict tumor aggressiveness, recurrence risk, and overall survival across cancer types. -related signatures (SBS2 and SBS13), arising from cytidine deaminase activity, are associated with worse outcomes in , ovarian, and non-small cell cancers due to their role in accelerating subclonal and immune evasion, with high APOBEC burden linked to reduced in . () signatures, characterized by elevated insertions/deletions at homopolymer repeats (e.g., ID1, ID2), indicate mismatch repair deficiency and correlate with favorable in but heterogeneous outcomes in others, often tied to hypermutated profiles that influence metastatic potential. Mutational signatures facilitate therapeutic decision-making through patient stratification and monitoring of treatment effects. HRD signatures predict sensitivity to poly(ADP-ribose) polymerase (PARP) inhibitors, as evidenced by improved progression-free survival in ovarian cancer patients with high HRD scores (e.g., via loss of heterozygosity or signature analysis), even in BRCA-wild-type cases, guiding targeted therapy selection. MSI signatures serve as biomarkers for immune checkpoint inhibitor response, with MSI-high tumors showing objective response rates exceeding 40% to PD-1 blockade in pan-cancer cohorts, including endometrial and colorectal cancers. Additionally, therapy-induced signatures like SBS11, featuring T>C transitions from temozolomide alkylation, emerge post-chemotherapy in gliomas and other tumors, signaling potential resistance mechanisms such as MGMT promoter methylation and informing adjustments to avoid hypermutator phenotypes. In precision oncology clinical trials, mutational signatures are increasingly integrated for trial enrichment and companion diagnostics. For instance, HRD signature testing has been pivotal in trials like PAOLA-1 for , where patients with HRD-positive profiles (detected via genomic scarring or substitution patterns) exhibited doubled with plus combinations, supporting FDA approvals for signature-based eligibility. This approach extends to basket trials evaluating in HRD-enriched non-s, enhancing therapeutic precision while minimizing off-target toxicities.

Non-cancer applications and future directions

Mutational signatures have been observed in normal human tissues, where they accumulate gradually with age due to endogenous processes. The signature SBS1, characterized by C>T transitions at CpG sites resulting from spontaneous of , dominates the mutational landscape in healthy cells such as neurons and stem cells across various organs. In aging brains, SBS1 mutations increase linearly with chronological age, reaching thousands of variants per by late adulthood, providing insights into baseline genomic instability. These patterns extend to applications in , where SBS1 and related clock-like signatures help reconstruct the timing and drivers of over millennia, distinguishing endogenous from exogenous processes in and . In non-cancer diseases, particularly inherited deficiencies, mutational signatures reveal heightened susceptibility to specific mutagens. For instance, in (XP), a disorder impairing , UV-induced signatures (SBS7 and SBS7a) predominate even in non-malignant skin cells exposed to , leading to accelerated mutation accumulation and serving as biomarkers for disease progression and environmental sensitivity. Similarly, viral integrations contribute distinct signatures in chronic infections; (HBV) integration generates rearrangement signatures (e.g., RS1) in liver cells, extending beyond oncogenesis to model persistent inflammatory damage in non-cancerous tissues during viral persistence. Emerging research has expanded the catalog of mutational signatures, with the COSMIC database incorporating additional profiles in its version 3.4 update (October 2023), which established 49 single-base substitution signatures, enhancing resolution for subtle processes. Advances in DNA damage mapping, including high-resolution techniques for alkylation and oxidative lesions, now enable improved high-resolution mapping of mutational burdens, facilitating early detection of genotoxic exposures in vivo. Artificial intelligence approaches, such as deep neural networks for signature decomposition, are predicting novel signatures by analyzing complex mutation spectra, improving de novo discovery in heterogeneous datasets. Recent advances include tools like joint inference methods for indels and substitutions (2025) and refined fitting algorithms such as RESOLVE (2025), improving accuracy in heterogeneous datasets. Looking forward, integration of mutational signatures with promises granular resolution of heterogeneity in normal and diseased tissues, allowing reconstruction of clone-specific processes at the cellular level. Non-human applications are gaining traction in , where signatures from model organisms exposed to pollutants delineate chemical-specific , aiding regulatory assessments of ecological risks.

History and development

The concept of mutational signatures began to take shape with advances in next-generation sequencing in the early 2010s, enabling the analysis of entire cancer genomes. The first systematic extraction of mutational signatures occurred in 2012, when researchers analyzed whole-genome sequences from 21 breast cancers and used (NMF) to identify five distinct (SNV) signatures reflecting underlying mutational processes. This approach was expanded in through a pan-cancer study of over 7,000 tumors, which cataloged 21 mutational signatures across various cancer types, linking some to environmental factors like and UV exposure. Subsequent work in 2015 further explored the etiologies, associating signatures with , , and other exposures using data from thousands of cancers. The field advanced significantly with the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium's 2020 publication, which analyzed over 23,000 cancer genomes and refined the catalog to include 49 single-base substitution signatures, along with doublet substitutions and indels, improving resolution and attribution. The Catalogue of Somatic Mutations in Cancer (COSMIC) database began integrating these signatures in 2013 and has since become the primary repository, with version 3.4 released in October 2023 incorporating expanded sets for SBS, DBS, ID, and other variant types. Post-2020 developments include experimental validation using models like organoids to link signatures to specific mutagens, and projects such as Cancer ' Mutographs (launched 2020, reporting in 2024-2025) identifying novel signatures from normal tissues and global cohorts. As of 2025, ongoing research continues to uncover therapy-induced and rare signatures, enhancing applications in precision oncology.

References

  1. [1]
    Signatures of mutational processes in human cancer | Nature
    Aug 14, 2013 · Each mutational signature is the imprint left on the cancer genome by a mutational process that may include one or more DNA damage and/or DNA ...
  2. [2]
    The repertoire of mutational signatures in human cancer | Nature
    Feb 5, 2020 · This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer.
  3. [3]
    COSMIC | Mutational Signatures
    ### Summary of Mutational Signatures Catalogue (v3.4, October 2023)
  4. [4]
    Signatures of mutational processes in human cancer - PMC - NIH
    We recently developed an algorithm to extract mutational signatures from catalogues of somatic mutations and applied it to 21 breast cancer whole genome ...
  5. [5]
    Understanding the origins of human cancer - Science
    Dec 4, 2015 · However, some of the identified mutational signatures were associated with age of cancer diagnosis, tobacco smoking, exposure to ultraviolet ...
  6. [6]
  7. [7]
  8. [8]
  9. [9]
  10. [10]
  11. [11]
  12. [12]
  13. [13]
  14. [14]
  15. [15]
  16. [16]
  17. [17]
  18. [18]
  19. [19]
    Mutational strand asymmetries in cancer genomes reveal ... - NIH
    Having established that we can observe and separate transcriptional and replicative strand asymmetries in two well-understood mutational processes, we performed ...
  20. [20]
    SBS - Mutational Signatures - COSMIC
    SBS1. Proposed Aetiology Spontaneous deamination of 5-methylcytosine (clock-like signature) · SBS2. Proposed Aetiology Activity of APOBEC family of cytidine ...
  21. [21]
    ID - Mutational Signatures - COSMIC
    COSMIC Mutational Signatures is a catalogue of curated reference mutational signatures. Curation covers aetiology, acceptance criteria, tissue distribution ...
  22. [22]
    SV - Mutational Signatures - COSMIC
    Structural variations (SVs) are large genomic changes typically exceeding 1kb in length, which impact the arrangement of the genome. Types of SVs include large ...
  23. [23]
  24. [24]
    5. Indel signature analysis - Bioconductor
    Jan 28, 2019 · A prerequisite for an analysis of Indel mutational signatures is a consistent classification system of Indels. This is non-trivial and critically depends on ...3 Pcawg Indel Signatures · 5.1 Loading Example Data · 5.3 Lcd Analysis
  25. [25]
    Joint inference of mutational signatures from indels and single ...
    Jul 3, 2025 · Many of these mutations have unclear impact, making it difficult to ascertain HRd just based on the presence of known pathogenic variants in a ...
  26. [26]
    Whole-genome sequencing is more powerful than whole-exome ...
    Although currently more expensive, WGS is more powerful than WES for detecting potential disease-causing mutations within WES regions, particularly those due ...
  27. [27]
    a database for deciphering mutational signatures in human cancers
    Nov 14, 2017 · The current set of mutational signatures in mSignatureDB is based on an analysis of 15 780 tumors across 73 TCGA/ICGC cancer projects. Our ...
  28. [28]
    A review of somatic single nucleotide variant calling algorithms for ...
    A collection of variant calling pipelines have been developed with different underlying models, filters, input data requirements, and targeted applications.
  29. [29]
    Tumor Mutational Burden (TMB) as a Predictive Biomarker in Solid ...
    Tumor mutational burden (TMB) is broadly defined as the number of somatic mutations per megabase of interrogated genomic sequence.
  30. [30]
    Mutational signature analyses in multi-child families reveal sources ...
    Nov 6, 2024 · We exploited mutational signatures from both cancer and non-cancer datasets to provide a comprehensive picture of the mechanisms involved in the ...
  31. [31]
    Deficiency of replication-independent DNA mismatch repair drives a ...
    Nov 3, 2021 · Deficiency of replication-independent DNA mismatch repair drives a 5-methylcytosine deamination mutational signature in cancer. Hu Fang https ...
  32. [32]
    The genomics of oxidative DNA damage, repair, and resulting ... - NIH
    Jan 7, 2020 · This review addresses what we currently know about the genome distribution of oxidative DNA damage, repair intermediates, and mutations.
  33. [33]
    Somatic mutation rates scale with lifespan across mammals - Nature
    Apr 13, 2022 · The somatic mutation rate per year (mutations per genome per year) ... mutations occur during cell division (Supplementary Table 7). We ...
  34. [34]
    Disentangling sources of clock-like mutations in germline and soma
    SBS1 is dominated by cytosine transitions in the CpG context and likely arises from the spontaneous deamination of methylated cytosines [6, 17]. The rate of ...<|control11|><|separator|>
  35. [35]
  36. [36]
    Mechanisms of APOBEC3 mutagenesis in human cancer cells | Nature
    Jul 20, 2022 · DNA glycosylases in APOBEC3 mutagenesis. SBS2 is characterized by C>T mutations, whereas SBS13 consists of C>G and C>A mutations. Processing of ...
  37. [37]
    SBS2 - Mutational Signatures - COSMIC
    SBS2 mutations may be generated directly by DNA replication across uracil or by error prone polymerases replicating across abasic sites generated by base ...
  38. [38]
    SBS13 - Mutational Signatures - COSMIC
    SBS13 is usually found in the same samples as SBS2. It has been proposed that activation of AID/APOBEC cytidine deaminases in cancer may be due to previous ...
  39. [39]
    An APOBEC cytidine deaminase mutagenesis pattern is ... - PubMed
    We show here that throughout cancer genomes APOBEC-mediated mutagenesis is pervasive and correlates with APOBEC mRNA levels.
  40. [40]
    APOBEC Mutational Signature and Tumor Mutational Burden as ...
    Mar 6, 2022 · Tumors harboring APOBEC mutational signature have been shown to have a higher TMB (16). Previously reported data also suggests that presence of ...Abstract · Introduction · Results · Discussion
  41. [41]
    APOBEC Mutational Signatures in Hormone Receptor–Positive ...
    Oct 31, 2022 · High TMB (TMB-H), which is associated with generation of neoantigens, has emerged as a biomarker for response to immune checkpoint inhibitors ( ...Introduction · Prevalence Of Apobec... · Discussion
  42. [42]
    SBS3 - Mutational Signatures - COSMIC
    SBS3 is strongly associated with germline and somatic BRCA1 and BRCA2 mutations and BRCA1 promoter methylation in breast, pancreatic, and ovarian cancers.
  43. [43]
    Mutational Signature 3 Detected from Clinical Panel Sequencing is ...
    Nov 1, 2022 · We find that Sig3 as detected by SigMA is positively associated with improved progression-free survival and objective responses.
  44. [44]
    SBS15 - Mutational Signatures - COSMIC
    SBS15 is one of seven mutational signatures associated with defective DNA mismatch repair and microsatellite instability (MSI)
  45. [45]
    Immunotherapy efficacy on mismatch repair-deficient colorectal cancer
    Compared to pMMR/MSS CRCs, dMMR/MSI CRCs typically have increased tumor mutational burden (TMB), lower response rate to 5-fluorouracil-based chemotherapy, ...
  46. [46]
    SBS17a - Mutational Signatures - COSMIC
    SBS17a is a mutational signature, part of SBS17, with unknown aetiology, and is associated with SBS17b. It is split into SBS17a and SBS17b.Missing: homologous end joining NHEJ
  47. [47]
    SBS7a - Mutational Signatures - COSMIC
    SBS7a may possibly be the consequence of just one of the two major known UV photoproducts, cyclobutane pyrimidine dimers or 6-4 photoproducts. However ...Missing: radiation | Show results with:radiation
  48. [48]
  49. [49]
    Repair of UVB-induced DNA damage is reduced in melanoma due ...
    Sep 20, 2016 · Reduced GGR in melanoma may contribute to the UV mutation spectrum of the melanoma ... NER deficiency is a key feature of melanoma. In contrast to ...
  50. [50]
  51. [51]
    Somatic mutations in facial skin from countries of contrasting skin ...
    Aug 3, 2023 · Keratinocyte cancer risk increases with an individual's cumulative ultraviolet (UV) exposure that depends on age, outdoor work, sunbathing, use ...
  52. [52]
    Biochemical reconstitution of temozolomide-induced mutational ...
    Sep 4, 2025 · Single-base substitution signature 11 (SBS11) has been linked to TMZ-treated cancer samples (24, 25). The vast majority (99%) of TMZ-treated ...Results · Replicative Dna Polymerase... · Hpol η Suppresses...Missing: procarbazine | Show results with:procarbazine
  53. [53]
    SBS11 - Mutational Signatures - COSMIC
    SBS11 exhibits a mutational pattern resembling that of alkylating agents. Patient histories indicate an association between previous treatment with the ...Missing: procarbazine | Show results with:procarbazine
  54. [54]
    SBS9 - Mutational Signatures - COSMIC
    Mutational profile using the conventional 96 mutation type classification. This classification is based on the six substitution subtypes.Missing: alkylating agents η
  55. [55]
    Detection of early seeding of Richter transformation in chronic ...
    Aug 11, 2022 · ... SBS9 (attributed to polymerase eta) and SBS18 (possibly damage by reactive oxygen species)); three had been only found in DLBCL (SBS2 and ...
  56. [56]
    DNA polymerase η mutational signatures are found in a variety ... - NIH
    We explored whether the mutational signature of pol η could be found in datasets of human somatic mutations derived from normal and cancer cells.
  57. [57]
    SBS4 - Mutational Signatures - COSMIC
    SBS4 is a mutational signature associated with tobacco smoking, likely due to direct DNA damage from tobacco smoke mutagens.Missing: pattern | Show results with:pattern
  58. [58]
    SBS29 - Mutational Signatures - COSMIC
    SBS29 has been found in cancer samples from individuals with a tobacco chewing habit. Comments. The pattern of C>A mutations in SBS29 appears different from ...
  59. [59]
    Tobacco exposure and somatic mutations in normal human ... - NIH
    Jul 29, 2020 · Thus, tobacco smoking increases mutation burden, cell-to-cell heterogeneity and driver mutations, but quitting promotes replenishment of ...<|control11|><|separator|>
  60. [60]
    deconstructSigs: delineating mutational processes in single tumors ...
    Feb 22, 2016 · deconstructSigs identifies samples with DNA repair deficiencies and reveals distinct and dynamic mutational processes molding the cancer genome.
  61. [61]
    SciClone: Inferring Clonal Architecture and Tracking the Spatial and ...
    We present a method for identifying the fraction of cells containing specific mutations, clustering them into subclonal populations, and tracking the changes in ...
  62. [62]
    Mutational signature assignment heterogeneity is widespread and ...
    Sep 22, 2023 · We demonstrate that this ensemble approach improves the accuracy of SBS3 assignment, a marker of homologous recombination deficiency (HRD), ...
  63. [63]
    A comprehensive comparison of tools for fitting mutational signatures
    Nov 2, 2024 · We fill this gap by comprehensively evaluating twelve signature fitting tools on synthetic mutational catalogs with empirically driven signature weights.
  64. [64]
    MSA: reproducible mutational signature attribution with confidence ...
    Nov 4, 2021 · ... replication infidelity or DNA enzymatic editing [1, 2]. Currently, on the order of 100 signatures have been discovered in human cancer, but ...
  65. [65]
    Pan-cancer analysis of whole genomes - Nature
    Feb 5, 2020 · Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan- ...
  66. [66]
    YAPSA - Bioconductor
    This package provides functions and routines for supervised analyses of mutational signatures (i.e., the signatures have to be known, cf. L. Alexandrov et al., ...
  67. [67]
    Analysis of mutational signatures with yet another package for ...
    We here present the R package yet another package for signature analysis (YAPSA) to deconvolute the contributions of mutational signatures to tumor genomes.
  68. [68]
    kgori/sigfit: Flexible Bayesian inference of mutational signatures
    sigfit is an R package to estimate signatures of mutational processes and their activities on mutation count data. Starting from a set of single-nucleotide ...
  69. [69]
    sigfit: flexible Bayesian inference of mutational signatures - bioRxiv
    Jan 17, 2020 · Here we present sigfit, an R package for mutational signature analysis that applies Bayesian inference to perform fitting and extraction of signatures from ...
  70. [70]
    MutationalPatterns - Bioconductor
    No information is available for this page. · Learn why
  71. [71]
    MutationalPatterns: comprehensive genome-wide analysis of ...
    Apr 25, 2018 · Consistently, transcriptional strand bias analysis of the mutation catalogues detects a strong bias for Signature A (Fig. 3a, b), confirming ...
  72. [72]
    Dissection of Cancer Mutational Signatures with Individual ... - NIH
    Mar 28, 2023 · The mutational signatures arising from benzo[a]pyrene and norharmane were similar to human lung cancer signatures attributed to tobacco smoking.
  73. [73]
    Homologous recombination deficiency (HRD) testing landscape
    Feb 21, 2025 · A mutational signature reveals alterations underlying deficient homologous recombination repair in breast cancer. Nat Genet. 2017;49:1476–86 ...
  74. [74]
    APOBEC mutational signature predicts prognosis and ... - NIH
    Mar 31, 2023 · Our review provides new insights into using the APOBEC3 mutation to predict prognosis and improve the immunotherapy response for future applications.Missing: TpC | Show results with:TpC
  75. [75]
  76. [76]
    A panel-based mutational signature of homologous recombination ...
    SigMA-predicted HRD may better identify patients likely to benefit from olaparib as compared to the current FDA label.
  77. [77]
    Tumor mutational burden is predictive of response to immune ...
    Microsatellite instability (MSI) is a biomarker for response to immune checkpoint inhibitors (ICPIs). PD-1 inhibitors in metastatic colorectal carcinoma ...
  78. [78]
    Mutational signatures are markers of drug sensitivity of cancer cells
    Mutational signatures can reveal the impact of mutagenic processes in cancer, including exposure to therapy.Missing: procarbazine | Show results with:procarbazine
  79. [79]
    Mutational Signatures in Cancer: Laboratory Considerations and ...
    This review aims to describe the scientific principles, laboratory considerations, and emerging clinical applications of mutational signatures in human cancer.Missing: AAT | Show results with:AAT
  80. [80]
    Somatic mutations in aging and disease - PMC - NIH
    Somatic mutational signatures can provide information on specific mutational conditions in a tissue at a given age, such as DNA replication infidelity, ...
  81. [81]
    The Dynamics of Somatic Mutagenesis During Life in Humans
    Dec 16, 2021 · Most somatic mutations in stem cells of normal tissues are characterized by different contributions of mutational signatures SBS1 and SBS5, but ...
  82. [82]
    Signatures of Mutational Processes in Human DNA Evolution | bioRxiv
    Jan 9, 2021 · At least four mutational processes are responsible for human genetic variations. One process is European-specific and no longer active.
  83. [83]
    Genomic mutation landscape of skin cancers from DNA repair ...
    May 4, 2023 · To better understand how the NER deficiency modulates the protein-damaging effect of UV irradiation we grouped protein-damaging mutations into ...Missing: amplifies | Show results with:amplifies
  84. [84]
    The landscape of viral associations in human cancers - Nature
    Feb 5, 2020 · We extensively characterized the known and novel viral associations by integrating driver mutations, mutational signatures, gene expression ...
  85. [85]
    [PDF] cosmicsig: Mutational Signatures from COSMIC (Catalogue ... - CRAN
    The cosmicsig package contains mutational signature profiles from COSMIC, including single base substitutions, doublet base substitutions, and small insertions ...
  86. [86]
    Quantification and Mapping of Alkylation in the Human Genome ...
    Feb 22, 2023 · Evaluating DNA damage profiles with mutational signatures requires the availability of single nucleotide resolution DNA damage maps, and ...Figure 2 · Figure 3 · Bpde Damage Patterns Share...
  87. [87]
    Mutational signature decomposition with deep neural networks ...
    Dec 8, 2023 · We present a novel approach to signature decomposition using artificial neural networks that addresses these problems.Results · Signet Algorithm · Methods
  88. [88]
    Single-cell genomic variation induced by mutational processes in ...
    Oct 26, 2022 · Single-cell WGS can readily decompose clone-specific and cellular genomic events, enabling the calculation of copy number alteration (CNA) and ...
  89. [89]
    The mutational signature profile of known and suspected human ...
    Cellular and Molecular Pathology Branch, Division of National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, ...