Fact-checked by Grok 2 weeks ago

Fold change

Fold change is a fundamental metric in biomedical and statistical research used to quantify the relative magnitude of differences between two groups or conditions, typically calculated as the of the in one group to that in another, such as versus . Originating in studies around the early 2000s, it has become widely applied across fields—including , , and —to identify differentially expressed genes, proteins, or other variables by highlighting proportional changes rather than absolute ones. For instance, a fold change of 2 indicates that the quantity in the experimental group is twice that of the reference group, while values below 1 signify decreases; to handle bidirectional changes and improve interpretability, it is often transformed into a , such as log₂(fold change), where positive values denote upregulation and negative values downregulation. In practice, fold change is computed using various methods to aggregate data from multiple samples, including the of ratios, , or , with the latter two preferred for their robustness against outliers, unequal variances, or skewed distributions common in . It plays a central role in differential analysis pipelines, such as those for or data, where it is combined with statistical tests (e.g., t-tests or moderated approaches) to rank features in volcano plots—graphical representations plotting log₂() against (e.g., -log₁₀()). This aids in selecting biologically relevant subsets for downstream applications like models or pathway enrichment analysis. Despite its intuitiveness and prevalence—evidenced by over 10,000 entries by 2024—fold change has limitations, including sensitivity to sample size, noise in high-dimensional data, and lack of inherent , necessitating integration with p-values or corrections to avoid false positives. Researchers are encouraged to report the exact calculation method for , as inconsistencies can lead to variable results across studies. Beyond , the concept extends to other domains like (e.g., dose-response ratios) and (e.g., multiples), underscoring its versatility as a relative measure in comparative analyses.

Definition and Basic Concepts

Standard Definition

Fold change quantifies the by which a changes from an initial value, such as a or , to a final value, such as a or experimental , typically expressed as the final divided by the initial. This multiplicative measure captures proportional alterations, making it suitable for comparing changes across varying scales in scientific contexts. For instance, if a biological increases from 10 to 20, the fold change is 2, signifying it has doubled; if it decreases from 20 to 10, the fold change is 0.5, indicating it has been halved. In contrast to absolute change, which computes the arithmetic difference (e.g., +10 or -10 in the examples above), fold change emphasizes relative proportions, enabling scale-invariant assessments that are particularly valuable when initial values differ widely, as often occurs in experimental . Logarithmic transformations of fold change, such as log₂(fold change), are sometimes used to symmetrize increases and decreases around zero, though the standard form remains the direct ratio.

Alternative Formulations

While the standard fold change is typically expressed as the ratio of final to initial values, a related measure known as is defined as (final - initial)/initial. This measure is equivalent to the standard minus 1 and is useful for expressing the change as a proportion of the original value. A related variant is percentage change, calculated as $100 \times (final - initial)/initial, which scales the by 100 for more intuitive reporting in everyday contexts. For instance, a doubling from an initial value corresponds to a 100% increase. To illustrate, consider values changing from 10 to 15: the standard fold change is 1.5, the relative change is 0.5, and the percentage change is 50%. In contrast to the multiplicative nature of the standard , they provide a more direct assessment of proportional shifts relative to the .

Mathematical Properties

Calculation Methods

The fold change is calculated as the of the final to the initial , expressed as \text{fold change} = \frac{\text{final}}{\text{initial}}. This direct division provides a measure of , where values greater than 1 indicate an increase and those less than 1 indicate a decrease. When the initial value is zero, direct division results in undefined or infinite fold changes, which is common in count-based data such as RNA sequencing reads. To handle this, a small pseudocount \epsilon is added to both the numerator and denominator, yielding \text{fold change} = \frac{\text{final} + \epsilon}{\text{initial} + \epsilon}, where \epsilon is typically set to 1 or a value specific to the data scale, such as the minimum detectable count. This approach stabilizes computations while approximating the true ratio without introducing substantial bias for non-zero values. For experiments with replicates, normalization precedes fold change computation to account for variability. Replicate expression values are first averaged—often using the for ratio-based data, as it better preserves multiplicative relationships—before calculating the fold change between group means. This step reduces noise and ensures the ratio reflects central tendencies rather than outliers. Computationally, fold changes are efficiently calculated in batch using statistical software that implements linear models or Bayesian shrinkage. In , the limma package fits moderated linear models to normalized data, deriving fold changes as contrasts between conditions while incorporating empirical Bayes adjustments for stability across multiple features. Similar algorithmic steps in libraries, such as those in or scanpy, involve data normalization, model fitting, and ratio extraction, emphasizing vectorized operations for large datasets. For edge cases where the initial value is near zero even after pseudocount addition, results are sometimes reported qualitatively as "greater than N-fold" (e.g., >10-fold), where N is derived from the detection threshold or pseudocount limit, to convey magnitude without implying infinity.

Logarithmic Transformations

Logarithmic transformation of fold changes addresses the multiplicative nature of ratios by converting them to an additive scale, facilitating statistical analysis and interpretation. The log fold change is computed as \log \left( \frac{\text{final}}{\text{initial}} \right) = \log (\text{final}) - \log (\text{initial}), where the logarithm is typically base 2 (\log_2) in biological contexts for intuitive interpretation of doublings or halvings— for instance, \log_2(2) = 1 indicates a twofold up-regulation, while \log_2(0.5) = -1 indicates a twofold down-regulation. This transformation yields several key mathematical properties that enhance its utility. On the log scale, fold changes exhibit around zero: positive values denote increases (up-regulation), and negative values denote equivalent proportional decreases (down-regulation), avoiding the asymmetry of raw ratios where a twofold increase (2) and twofold decrease (0.5) do not balance. Additionally, the log scale converts multiplicative effects to additive ones, enabling straightforward averaging, variance stabilization, and application of linear models for differential analysis across multiple conditions or replicates. A fundamental derivation underscores the appropriateness of averaging on the log scale: the arithmetic mean of log fold changes equals the logarithm of the geometric mean of the corresponding raw fold changes, given by \frac{1}{n} \sum_{i=1}^n \log (FC_i) = \log \left( \prod_{i=1}^n FC_i^{1/n} \right), where FC_i is the i-th fold change. This property is particularly valuable in multi-condition experiments, as it provides a multiplicatively interpretable summary of changes without toward extreme ratios. For example, consider a 4-fold increase (\log_2 = 2) and a 2-fold decrease (\log_2 = -1); their on the log scale is \frac{2 + (-1)}{2} = 0.5, corresponding to \log_2(1.414) \approx 0.5 or a net 1.4-fold increase via the . This contrasts with averaging raw fold changes (4 and 0.5), which yields 2.25—an arithmetically misleading result that overstates the net effect.

Applications

In and

In and , fold change serves as a fundamental metric for quantifying differences in levels between biological conditions, such as treated versus control samples or diseased versus healthy tissues. This measure is particularly prominent in the analysis of data from s and RNA sequencing (), where it assesses the magnitude of differential expression for thousands of genes simultaneously. The concept gained widespread adoption in the with the advent of technology, which enabled high-throughput monitoring of patterns, and played a key role in studies following the around 2000. In microarray experiments, fold change is calculated as the ratio of normalized expression intensities between two conditions, often using (cDNA) probes spotted on arrays to detect relative mRNA abundances. Similarly, in , it derives from normalized read counts, capturing changes in transcript levels across conditions like drug treatment in cell lines. A common threshold for identifying biologically relevant differential expression is a fold change greater than 2 (upregulation) or less than 0.5 (downregulation), as these cutoffs highlight genes with substantial magnitude shifts while filtering noise. For instance, log-transformed fold changes are frequently employed in these analyses to symmetrize up- and downregulation and facilitate statistical modeling. A representative application occurs in , where a exhibiting a 3-fold upregulation in tumor compared to may signal a potential driving proliferation. Such findings, derived from or profiles, guide hypothesis generation for oncogenic pathways, as seen in studies of where upregulated genes like show consistent fold changes across cohorts. Fold change integrates with statistical metrics like p-values in visualization tools such as volcano plots, where it assesses expression magnitude on the x-axis (often as log2 fold change) while p-values evaluate significance on the y-axis, prioritizing genes with both large effect sizes and low error probabilities. This combination underscores fold change's role in emphasizing biologically impactful differences beyond mere statistical detection.

In Other Scientific Fields

In , fold change is commonly used to quantify differences in drug potency, particularly through metrics like the half maximal inhibitory concentration (IC50), where a 10-fold decrease in IC50 value between compounds indicates a 10-fold increase in potency. For instance, in studies of antiretroviral , fold changes in IC50 relative to wild-type viruses help assess how reduce , with values often exceeding 10-fold signaling significant clinical implications. In , fold change describes relative growth in indicators such as (GDP), highlighting expansion over time periods. Emerging markets have shown substantial increases; for example, Brazil's nominal GDP rose from approximately $237 billion in 1980 to $1.44 trillion in 2020, representing about a 6-fold growth. This metric aids analysts in comparing economic trajectories across regions, such as the collective rise in emerging and developing economies' share of global GDP from 25% in 2000 to 45% by the mid-2020s, driven by compounded annual growth rates. In physics and engineering, particularly , fold change quantifies variations, such as in filters to suppress unwanted frequencies. For example, anti-aliasing filters in analog-to-digital converters are often designed for 60 attenuation, equivalent to a 1,000-fold reduction in signal to prevent folding. In wireless communications, materials like brick can cause 40 attenuation in millimeter-wave signals, corresponding to a 100-fold decrease, which informs system design for propagation losses. In climate science, fold change measures relative shifts in atmospheric concentrations, such as (CO2). From pre-industrial levels of about 280 parts per million (ppm) to approximately 425 ppm as of 2025, this represents a 1.5-fold increase, underscoring human-induced enhancements beyond natural variability over millennia.

Interpretation and Limitations

Statistical Significance

Assessing the of an observed fold change requires evaluating whether the difference in expression or measurement levels between conditions is likely due to chance, rather than a true biological effect. Fold change alone is insufficient for this determination, as it does not account for variability in the data; instead, statistical tests such as the t-test or analysis of variance (ANOVA) are applied to log-transformed data to compute p-values that indicate the probability of observing the difference under the of no change. This approach uses the log fold change, typically log2-transformed, as the variable for testing, which normalizes the data and stabilizes variance across scales. Fold change serves as a measure of biological , quantifying the magnitude of change, but its interpretation is strengthened by combining it with , such as the 95% around the log2 fold change estimate. These intervals provide a range within which the true likely lies, helping to assess precision and reliability; for instance, a narrow around a log2 fold change of 1 (corresponding to a 2-fold increase) supports a more robust finding than a wide one. In practice, a 2-fold change accompanied by a less than 0.05 from a t-test on replicates may indicate , but small sample sizes can inflate false positive rates by increasing the likelihood of spurious detections due to noise. In high-throughput experiments, such as those in , multiple testing across thousands of features necessitates correction to control the overall error rate. The (FDR) correction, particularly the Benjamini-Hochberg procedure, is widely recommended to adjust p-values and limit the proportion of false positives among significant results, ensuring that only a controlled (e.g., 5%) of declared changes are expected to be erroneous. This method is particularly valuable when fold changes are modest, as it balances discovery power with reliability in large-scale analyses.

Common Pitfalls and Considerations

One common pitfall in interpreting fold changes is ignoring the expression scale, where large relative changes from low levels often reflect technical noise rather than biological . For instance, a 10-fold increase from 0.1 to 1 unit may arise from variation in low-count data, whereas a 1.1-fold change from 100 to 110 units represents a more substantial absolute shift despite the modest . Fold changes are inherently context-dependent and directional, with positive values indicating upregulation and negative values (in log scale) signaling downregulation, but their biological varies by experimental system and expression levels. To enhance clarity, fold changes should always be reported alongside values or normalized counts, avoiding isolated reliance on ratios that can mislead without this context. A for reliable interpretation, particularly in small-sample studies, involves using moderated fold changes within Bayesian frameworks, such as limma's , which shrinks estimates toward a to stabilize variability across genes and reduce false positives from noisy . Early analyses frequently overlooked batch effects, leading to inflated or spurious fold changes; post-2010 standards have emphasized techniques to mitigate these technical artifacts, improving in high-throughput .

References

  1. [1]
    Revisiting Fold-Change Calculation: Preference for Median or ...
    Fold change provides an intuitive measure of the magnitude of the difference between groups. Using l o g 2 allows direct interpretation of how many times ...
  2. [2]
    [PDF] Introduction to Statistics for Differential Gene Expression Analysis
    • Thus, we often take the log of fold change values for data science purposes = log fold change (LFC). ➢ A negative LFC indicates lower overall values in ...Missing: definition | Show results with:definition
  3. [3]
    [PDF] A comparison of fold-change and the t-statistic for microarray data ...
    Feb 1, 2007 · Simulations suggest that whether fold-change or a modified t-statistic results in more accurate gene lists depends on whether one is interested ...
  4. [4]
    Fold-change detection in biological systems - ScienceDirect.com
    Fold-change detection (FCD) means a dynamic response determined by relative changes in input, not absolute changes. · FCD is found in diverse cellular signaling ...Missing: statistics | Show results with:statistics
  5. [5]
    Section PF.2 – Absolute and Relative Change – College Mathematics
    Relative change gives a percent change. If the relative change is positive this is a percent increase. If the relative change is negative this is a percent ...Missing: formula | Show results with:formula
  6. [6]
    None
    ### Summary of Percentage Change Calculation and Use in Economics
  7. [7]
    How to Calculate Fold Change in Excel - Bricks
    Feb 12, 2025 · Fold change is calculated in Excel using the formula: Final Value / Initial Value. For example, =B2/A2.
  8. [8]
    Moderated estimation of fold change and dispersion for RNA-seq ...
    Dec 5, 2014 · We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and ...Missing: formulations percentage
  9. [9]
    Estimating pseudocounts and fold changes for digital expression ...
    We build on a statistical framework for fold changes, where pseudocounts correspond to the parameters of the prior distribution used for Bayesian inference of ...
  10. [10]
    Naught all zeros in sequence count data are the same - PMC - NIH
    Last, we designed a Pseudo-Count (PC) model that does not model any ZGPs, but rather avoids numerical issues with zeros by adding a fixed positive pseudo-count ...
  11. [11]
    limma User's Guide - Bioconductor
    No information is available for this page. · Learn why
  12. [12]
    limma powers differential expression analyses for RNA-sequencing ...
    limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments.
  13. [13]
    Quantitative monitoring of gene expression patterns with a ... - PubMed
    Quantitative monitoring of gene expression patterns with a complementary DNA microarray ... M Schena , D Shalon, R W Davis, P O Brown. Affiliation. 1 ...
  14. [14]
    How to choose log2FC thresholds for DGE analysis - biostatsquid.com
    Common values are: |log₂FC| ≥ 1 (2-fold change) – most common, good balance; |log₂FC| ≥ 0.5 (1.4-fold change) – more sensitive, catches subtle changes; |log₂FC| ...
  15. [15]
    A pan-cancer transcriptomic study showing tumor specific alterations ...
    Jul 1, 2021 · The color bar on the right represents the log-fold changes; red shows genes upregulated in tumors and blue shows genes downregulated in tumors ...
  16. [16]
    Gene expression trend changes in breast cancer populations over ...
    We determined that three oncogenes, PD-L2, ETV5, and MTOR and 113 long intergenic non-coding RNAs (lincRNAs) were constantly up-regulated, whereas two oncogenes ...Missing: upregulation | Show results with:upregulation
  17. [17]
    an interactive tool for pathway guided visualization of differential ...
    A common tool for visualizing this type of data is the volcano plot where each feature is plotted as the log2 transformed fold change along the x-axis and the ...
  18. [18]
    Within the fold: assessing differential expression measures and ...
    'Fold-change' cutoffs have been widely used in microarray assays to identify genes that are differentially expressed between query and reference samples.<|separator|>
  19. [19]
    Dose–response curve slope is a missing dimension in the analysis ...
    Apr 18, 2011 · IC50 is used as a measure of the potency of antiretroviral drugs. Resistance is typically expressed as the fold change in IC50 relative to WT.Missing: IC50 | Show results with:IC50
  20. [20]
  21. [21]
    Emerging and Developing Economies in the Twenty-First Century
    Jul 10, 2025 · These economies now account for about 45 percent of global GDP, up from about 25 percent in 2000, a trend driven by robust collective growth in ...
  22. [22]
    Fundamentals of Sigma-Delta ADCs
    FIR Anti-aliasing Filter. • 60dB attenuation is required to suppress folding noise by system simulations. • Third order FIR filter is chosen and provides 80 ...Missing: change | Show results with:change
  23. [23]
    Carbon dioxide now more than 50% higher than pre-industrial levels
    Jun 3, 2022 · Prior to the Industrial Revolution, CO2 levels were consistently around 280 ppm for almost 6,000 years of human civilization. Since then, humans ...
  24. [24]
    Trends in CO2 - NOAA Global Monitoring Laboratory
    Sep 5, 2025 · August 2024: 422.99 ppm ... The last five complete years of the Mauna Loa CO2 record plus the current year are shown in the first graph.Data · Last 1 Year · Last Month · Global
  25. [25]
    Should We Abandon the t-Test in the Analysis of Gene Expression ...
    A first approach to identify differentially expressed genes is known as the Fold-Change estimation (FC). It evaluates the average log-ratio between two ...<|control11|><|separator|>
  26. [26]
    [PDF] Tests for Fold Change of Two Means (Log-Normal Data) - NCSS
    State the statistical hypotheses in terms of the fold change (ratio) of the means. 2. Transform this into hypotheses about a difference by taking logarithms.
  27. [27]
    Statistical tests for differential expression in cDNA microarray ...
    Genes with large fold-change values will lie outside a pair of vertical threshold lines.
  28. [28]
    Topconfects: a package for confident effect sizes in differential ...
    Mar 28, 2019 · We propose a method of ranking by confidence bounds on the log fold change, based on the previously developed TREAT test.
  29. [29]
    Testing significance relative to a fold-change threshold is a TREAT
    The biological significance of a given fold-change is likely to depend on the gene and on the experimental context. On the other hand, it is reasonable to ...
  30. [30]
    Statistical significance for genomewide studies - PNAS
    Here we propose an approach to measuring statistical significance in these genomewide studies based on the concept of the false discovery rate.Missing: size | Show results with:size
  31. [31]
    Clipper: p-value-free FDR control on high-throughput data from two ...
    Oct 11, 2021 · The most widely used criterion to ensure the analysis reliability is the false discovery rate (FDR), which is primarily controlled based on p- ...
  32. [32]
    When Two-Fold Is Not Enough: Quantifying Uncertainty in Low-Copy ...
    Our empirical data highlight that at low template concentrations, stochastic sampling dominates technical noise, impairing the reliability of even large fold ...<|separator|>
  33. [33]
    limma powers differential expression analyses for RNA-sequencing ...
    For each coefficient in the linear model or contrast, empirical Bayes moderated t-statistics and their associated P-values are generally used to assess the ...
  34. [34]
    Tackling the widespread and critical impact of batch effects in high ...
    Sep 14, 2010 · After normalization, these global differences are greatly reduced (FIG. 1b), and the sTCC study properly normalized the data. However, it is ...Missing: post- | Show results with:post-