Fact-checked by Grok 2 weeks ago

Complex traits

Complex traits, also referred to as polygenic or quantitative traits, are phenotypic characteristics in organisms that arise from the combined and interactive effects of multiple genetic variants across the and various environmental influences, distinguishing them from simple Mendelian traits that are primarily controlled by variations at a single locus. These traits exhibit continuous variation rather than discrete categories and often display incomplete and variable expressivity, where the same genetic variants may produce different outcomes depending on genetic background and external conditions. The genetic architecture of complex traits is typically polygenic, involving numerous loci with small individual effects that collectively contribute to phenotypic variation, alongside gene-gene interactions () and gene-environment interactions. Heritability estimates for complex traits generally range from 40% to 70%, indicating a substantial but not exclusive genetic component, as environmental factors such as , , and exposures play critical roles in modulating outcomes. Early studies of complex traits faced significant challenges, including the "missing heritability" problem where identified genetic variants explained only a of observed variation, though advances in genomic technologies have since improved detection of these subtle effects. Common examples of complex traits include physical attributes like and , as well as behavioral and physiological features such as cognitive abilities and metabolic rates. In human health, many prevalent diseases qualify as complex traits, including , , , and certain cancers, where polygenic risk scores derived from genetic data can predict susceptibility but require integration with environmental data for accurate assessment. Understanding these traits is essential for fields like , as it enables the identification of at-risk individuals and the development of targeted interventions. To dissect complex traits, researchers employ approaches such as genome-wide association studies (GWAS), which scan millions of genetic variants in large populations to pinpoint associated loci, and systems genetics methods that model interactions across biological pathways. These techniques, often combined with computational modeling of metabolic and developmental networks, reveal how robustness mechanisms buffer traits against perturbations, facilitating predictions of phenotypic outcomes under varying conditions. Despite progress, ongoing challenges include capturing rare variants, non-coding regulatory elements, and dynamic environmental interactions to fully elucidate the omnigenic nature of many complex traits.

Introduction

Definition and Characteristics

Complex traits, also known as quantitative or polygenic traits, are phenotypic characteristics influenced by the combined effects of multiple genetic variants across numerous loci, along with environmental factors, rather than a single following patterns. These traits result from interactions including additive effects (where contribute independently), dominance effects (where one masks another), and epistatic effects (where one modifies the expression of another), leading to continuous or discrete phenotypes that do not segregate in simple ratios. Unlike Mendelian traits, which are typically controlled by a single locus with large effect and exhibit discrete phenotypes like presence or absence, complex traits involve many loci each with small effects, producing a spectrum of variation that blurs clear categorical boundaries. Key characteristics of complex traits include continuous variation, often approximating a in populations, as seen in traits like where individuals span a wide range of values rather than discrete classes. They also demonstrate incomplete , where not all individuals with a predisposing genetic background express the trait, and variable expressivity, where the trait's severity or form varies among those who do express it, largely due to gene-environment interactions and polygenic modifiers. This sensitivity to environmental influences, such as nutrition or lifestyle, further complicates prediction, as the emerges from the interplay of genetic liability and external conditions. For discrete complex traits, such as disease susceptibility, the liability model posits an underlying continuous liability distribution—shaped by polygenic and environmental factors—where individuals exceeding a certain manifest the , while those below do not, explaining the apparent binary outcomes from continuous genetic underpinnings. This model highlights how complex traits can appear categorical yet stem from quantitative genetic architecture, distinguishing them fundamentally from the deterministic patterns of .

Examples and Importance

Complex traits are exemplified by , a polygenic influenced by thousands of genetic variants, as demonstrated by large-scale genome-wide association studies (GWAS) identifying over 12,000 variants explaining a substantial portion of its . Similarly, (BMI), a measure of risk, involves common variants like those near the , with GWAS revealing hundreds of loci contributing to its variation. (IQ), reflecting cognitive ability, is another polygenic trait, where GWAS have pinpointed numerous loci associated with and cognitive performance as proxies. Susceptibility to diseases such as , involving over 280 genetic loci identified through consortium efforts, and , linked to variants in genes like TCF7L2 and hundreds of other loci, further illustrate complex traits in humans with polygenic underpinnings. In non-human organisms, complex traits include in , where quantitative trait loci (QTL) mapping and GWAS in have identified genomic regions controlling grain production under varying environments. Milk production in , such as , is governed by polygenic factors, with GWAS detecting variants associated with and traits to enhance efficiency. Behavioral traits in model organisms like , including aggression and geotaxis, exhibit complex genetic architectures, with QTL studies revealing multiple interacting loci influencing these phenotypes. These traits play a pivotal role in and , as polygenic variation enables populations to respond to environmental pressures, such as shifts in or diet that favor certain height or metabolic profiles. In agriculture, understanding complex traits supports and genomic selection, accelerating genetic gains for yield by up to 50% in compared to traditional methods. For human health, polygenic risk scores derived from these traits aid by stratifying risk, enabling early interventions for conditions like or . Economically, genomics-driven improvements in productivity have contributed to increases in animal-derived , while similar advances in crops support global yield enhancements. Societal implications of complex trait research include ethical concerns over polygenic risk scores, such as potential misuse in through embryo selection or in underwriting based on predicted susceptibility.

Historical Development

Early Observations

Early observations of complex traits, which exhibit continuous variation rather than discrete categories, date back to ancient times but gained scientific traction in the through naturalists' studies of patterns that deviated from simple blending or particulate models. , in his 1868 theory of , proposed that hereditary information was carried by from all parts of the body, which could blend during reproduction to explain the intermediate forms seen in offspring, such as variations in plant and animal heights or sizes that did not strictly follow parental extremes. This theory aimed to account for the apparent blending observed in many traits, like flower color dilution in crosses, but it struggled to explain the persistence of variation across generations without complete homogenization. By the late , biometricians began quantifying continuous variation in natural populations, highlighting traits influenced by multiple factors. , in his 1883 work Inquiries into Human Faculty and Its Development, coined the term "" to describe how offspring of exceptionally tall or short parents tended to revert toward the population mean in height, based on measurements of heights in 205 families, involving over 900 children, demonstrating that human stature followed a rather than discrete classes. This observation, further detailed in his 1889 Natural Inheritance, underscored the polygenic nature of such traits and their deviation from Mendelian ratios. Similarly, Walter Frank Raphael Weldon conducted early biometric studies in the 1890s, measuring shell sizes in shore crabs () to show continuous variation correlated with environmental factors, challenging the idea of inheritance as solely discrete units and emphasizing statistical approaches to non-Mendelian traits. These findings initially clashed with emerging Mendelian genetics, as early proponents resisted integrating continuous traits. , a key advocate of Mendel's particulate inheritance after its 1900 rediscovery, initially dismissed biometric models as overlooking discontinuous variations in his 1894 Materials for the Study of Variation, but by the early 1900s, he acknowledged that many traits, like or crop yields, involved complex interactions beyond simple 3:1 ratios. In 1909, introduced the concept of "" to distinguish stable heritable factors from observable phenotypes, providing a framework for understanding how multiple genotypes could produce continuous phenotypic distributions in traits like bean seed weight. These pre-20th-century insights laid the observational groundwork for recognizing complex traits, bridging empirical patterns with theoretical inheritance models.

Foundations of Quantitative Genetics

The foundations of quantitative genetics were established in the early 20th century through efforts to reconcile Mendelian inheritance with the continuous variation observed in complex traits, a challenge that biometrical approaches had previously addressed without genetic mechanisms. Ronald A. Fisher's seminal 1918 paper, "The Correlation Between Relatives on the Supposition of Mendelian Inheritance," provided a mathematical framework integrating Mendelian genetics with biometrics, demonstrating that polygenic inheritance at many loci could produce the normal distributions typical of quantitative traits. In this work, Fisher introduced the partitioning of phenotypic variance into genetic and environmental components, where the total phenotypic variance V_P is expressed as the sum of genetic variance V_G and environmental variance V_E, further subdividing V_G to distinguish additive effects from dominance and epistasis. He also formalized the concept of additive genetic variance, representing the variance due to the average effects of alleles, which underpins predictions of resemblance among relatives and response to selection. Central to Fisher's framework was the infinitesimal model, which assumes that complex traits are influenced by a large number of loci, each contributing small, equal effects, leading to a near-normal distribution of breeding values even under Mendelian segregation. This model facilitated the analysis of traits without identifying individual genes, emphasizing the cumulative action of polygenes. Building on this, developed path analysis in his 1921 paper "Correlation and Causation," introducing a graphical method to quantify causal relationships among variables in genetic systems, such as the paths from genotypes to phenotypes via environmental influences. Wright's approach allowed for the decomposition of correlations into direct and indirect effects, enhancing the understanding of how multiple factors contribute to trait variation in populations. In the , Jay L. Lush advanced these principles through his models, outlined in his influential book Animal Breeding Plans (1943), which applied quantitative genetic theory to practical improvement. Lush emphasized the use of estimates and selection indices based on additive genetic variance to predict breeding outcomes, integrating Fisher's and Wright's ideas into systematic programs for enhancing traits like milk yield and growth rate in and . His work demonstrated how partitioning variance could guide to maximize genetic gain while accounting for environmental interactions. The 1930s marked a key milestone with the evolutionary synthesis, where quantitative genetics merged with population genetics through works like Fisher's The Genetical Theory of Natural Selection (1930), Wright's "Evolution in Mendelian Populations" (1931), and J.B.S. Haldane's The Causes of Evolution (1932), formalizing how polygenic traits evolve under natural and artificial selection. Following World War II, quantitative genetics expanded rapidly in agricultural applications, particularly for improving crop yields—such as in wheat and maize breeding programs that doubled productivity through selection on polygenic traits—fueling the Green Revolution and global food security efforts.

Classification of Complex Traits

Quantitative Traits

Quantitative traits, also known as continuous traits, are phenotypic characteristics that exhibit a continuous range of variation within a , without distinct categories, and are typically measurable on a numerical scale. These traits result from the combined effects of multiple genetic loci, each contributing small incremental effects, along with environmental influences, leading to a smooth gradation of phenotypes. In natural populations, the distribution of such traits often approximates a (Gaussian) curve, reflecting the additive nature of polygenic inheritance as first theoretically established by Ronald A. Fisher in 1918. Statistically, quantitative traits are characterized by their population means and variances, which encapsulate both genetic and environmental contributions to variation. The bell-shaped distribution arises primarily from the summation of numerous small, across loci, producing a central tendency around the mean with tails representing extreme values. This polygenic architecture, where many genes of small effect predominate, explains why individual loci are difficult to resolve without advanced techniques, and why the trait's expression shows gradual shifts rather than abrupt changes. Prominent examples include human stature and , where measurements vary continuously among individuals due to polygenic and environmental factors. In agriculture, plant height and yield in breeding programs, such as those for crops like or , serve as key quantitative traits targeted for improvement through selection. The basic modeling of quantitative traits relies on the normal distribution to describe phenotypic variation, with the phenotypic value P expressed as P = G + E, where G represents the genotypic value (genetic contribution) and E the environmental deviation (non-genetic effects). This linear decomposition assumes no genotype-environment interaction for simplicity and underpins analyses of trait , though estimates (detailed elsewhere) quantify the relative genetic proportion.

Meristic and Threshold Traits

Meristic traits represent discrete, countable phenotypic variations that emerge from the cumulative effects of multiple genetic and environmental factors, typically manifesting as integer values rather than continuous measurements. These traits are analyzed using statistical distributions such as Poisson for unbounded counts or binomial for bounded ones, reflecting their underlying polygenic basis. Classic examples include the number of abdominal bristles in Drosophila melanogaster, a model system where bristle counts exhibit substantial additive genetic variation with heritabilities around 0.5. Similarly, the number of vertebrae in fish species like salmonids shows meristic variation influenced by developmental temperature and genetics, often varying by a few units across populations. In humans, total fingerprint ridge count serves as a well-studied meristic trait, polygenically inherited and relatively stable post-development, making it useful for genetic analyses. Threshold traits, in contrast, appear as binary or categorical outcomes—such as the presence or absence of a —but are conceptualized as arising from an underlying continuous scale where individuals exceeding a specific express the . This is normally distributed and shaped by polygenic plus environmental factors, with the determining the population . Prominent examples include nonsyndromic cleft lip with or without , traditionally viewed as a multifactorial where familial recurrence risks align with model predictions. follows a similar pattern, with disease onset occurring when pushes beyond a population-specific influenced by factors like age and lifestyle. risk is another key illustration, where the 's dichotomous diagnosis masks a continuous spectrum, supported by meta-analyses of family data fitting assumptions. The primary framework for modeling both meristic and threshold traits is the liability threshold model, originally proposed for analyzing disease incidence among relatives, which assumes a latent continuous aggregating contributions from multiple loci and environments. In practice, this involves links (assuming normal liability) or logit links (logistic approximation) to connect the unobserved liability to observed discrete outcomes, enabling estimation on the underlying scale via generalized linear mixed models. For meristic traits, extensions incorporate count-specific distributions while preserving the polygenic continuity, distinguishing them from purely continuous quantitative traits by their discretized expression.

Genetic and Environmental Components

Polygenic Inheritance

Polygenic inheritance describes the genetic basis of complex traits where phenotypic variation arises from the combined action of alleles at numerous genetic loci, each typically exerting a small effect on the trait. This mode of inheritance contrasts with monogenic traits controlled by a single locus and explains the continuous distribution of phenotypes observed in populations for traits such as , , and disease susceptibility. The foundational model posits that many genes contribute incrementally to the trait, leading to a bell-shaped curve of variation when environmental factors are uniform. The concept was pioneered by Ronald A. Fisher in his 1918 paper, which reconciled Mendelian segregation with the quantitative nature of heritable traits by demonstrating that multiple loci with additive effects could produce continuous variation mimicking blending inheritance. In this framework, allelic effects at different loci can be additive, where the total phenotypic value is the linear sum of individual contributions, or non-additive, encompassing dominance effects (interactions between alleles at the same locus) and (interactions between alleles at different loci), which introduce deviations from simple additivity and can modulate the overall trait expression. Mechanistically, polygenic inheritance involves variants across the , with both common alleles ( >1%) contributing small, widespread effects and rare variants ( <1%) potentially exerting larger but less frequent influences on trait variation. The distribution of these variants reflects evolutionary pressures, population history, and selection, such that common variants often tag shared genetic backgrounds in populations, while rare variants may arise de novo or through mutation and contribute to individual-specific risks. This spectrum underlies the polygenic architecture, where no single locus dominates, but collective effects drive heritability. Empirical evidence for polygenic inheritance is evident in family studies, such as those on human height, where dizygotic twin or full sibling pairs exhibit phenotypic correlations of approximately 0.40–0.45, indicating partial similarity beyond what single-gene Mendelian patterns would predict and consistent with shared polygenic backgrounds. This intermediate concordance reflects the probabilistic sharing of alleles across many loci, rather than complete or absent resemblance. Building on classical polygenic models, the omnigenic hypothesis proposes that trait variation involves not just a handful of causal genes but potentially all genes expressed in relevant cell types, due to dense regulatory networks where "core" genes directly impact the trait and "peripheral" genes influence it indirectly through trans-regulatory effects. This model, articulated by Boyle, Li, and Pritchard in 2017, accounts for the unexpectedly broad genomic signals in association studies and emphasizes pervasive connectivity in gene expression as a key mechanism amplifying polygenicity.

Heritability Estimation

Heritability quantifies the proportion of phenotypic variation in a population attributable to genetic differences among individuals. In quantitative genetics, phenotypic variance (V_P) is partitioned into genetic variance (V_G) and environmental variance (V_E), such that V_P = V_G + V_E. Broad-sense heritability (H^2) captures all genetic effects and is defined as H^2 = V_G / V_P, while narrow-sense heritability (h^2) specifically measures the contribution of additive genetic variance (V_A) and is given by h^2 = V_A / V_P. Narrow-sense heritability is particularly relevant for predicting response to selection in breeding programs, as it reflects transmissible genetic effects across generations. Estimation of h^2 often relies on examining phenotypic resemblance between relatives, such as through parent-offspring regression. In this approach, the regression slope (b) of offspring phenotype on the phenotype of a single parent provides an estimate of h^2 / 2, so h^2 = 2b; when using the mid-parent value (average of both parents), the slope directly estimates h^2. Heritability estimates are population-specific and context-dependent, varying with environmental conditions and not representing a fixed property of the trait itself. For instance, heritability estimates from for in adults typically range from 0.75 to 0.85, reflecting a substantial genetic component in well-nourished populations. Similarly, for , heritability estimates are high and stable across birth cohorts in the Netherlands, typically around 0.72-0.78, though in some populations, such as Finland, they have increased over time as improved nutrition reduces environmental variance. These methods assume additive genetic effects without significant gene-environment interactions (G×E), which can bias estimates if present. Heritability can thus fluctuate across generations or environments, as seen with height where better nutrition amplifies the relative contribution of genetics by minimizing environmental constraints on growth.

Environmental Influences

Environmental factors play a crucial role in the expression of complex traits, contributing to phenotypic variance through direct effects and interactions with genetic variants. Common environmental influences include nutrition, socioeconomic status, lifestyle choices (e.g., physical activity, smoking), and exposures (e.g., pollutants, infections), which can modulate trait outcomes independently or via G×E interactions. For example, in height, malnutrition during childhood can suppress genetic potential, while in type 2 diabetes, diet and obesity interact with polygenic risk to elevate susceptibility. These factors explain the remaining variance after genetic contributions and underscore why heritability is not fixed, decreasing in heterogeneous environments. Advances in studying G×E, such as through twin designs or longitudinal cohorts, highlight how environmental optimization can mitigate risks for complex diseases.

Methods for Identification and Mapping

Family and Twin Studies

Family and twin studies represent classical pedigree-based approaches in quantitative genetics, designed to partition the variance in complex traits between genetic and environmental influences without requiring molecular markers. These methods leverage the natural similarities and differences among family members, particularly twins, to infer heritability. The twin method originated in the 1920s, with German dermatologist pioneering the systematic comparison of monozygotic (MZ) and dizygotic (DZ) twins to assess genetic contributions to traits like skin conditions, as detailed in his 1924 book Die Zwillingspathologie. This approach has since become foundational, enabling researchers to control for shared environments while isolating additive genetic effects. The core design of twin studies contrasts MZ twins, who share nearly 100% of their genetic material, with DZ twins, who share on average 50%, akin to non-twin siblings. Concordance rates—the probability that both twins exhibit a trait if one does—typically exceed those in DZ pairs for complex traits, indicating genetic involvement. For instance, in , meta-analyses of European twin studies from 1963 to 1987 report MZ concordance rates of 48% compared to 17% in DZ twins, underscoring substantial heritability despite environmental factors. Adoption studies complement this by separating genetic from rearing environment effects; adoptees' resemblance to biological relatives over adoptive ones highlights heritable influences in behavioral traits. Analysis often employs Falconer's formula to estimate broad-sense heritability (h^2), calculated as h^2 = 2(r_{MZ} - r_{DZ}), where r_{MZ} and r_{DZ} are the phenotypic correlations for MZ and DZ twins, respectively; this doubles the difference to account for DZ twins sharing half the genetic variance of MZ pairs. Early applications included heritability estimates for blood pressure, with twin studies yielding around 50% for both systolic and diastolic measures in systematic reviews. Longitudinal twin designs further track trait stability over time, revealing that genetic factors increasingly explain variance in personality traits like extraversion from adolescence to adulthood, with correlations stabilizing at 0.30–0.65 across intervals. In modern behavioral genetics, family and twin studies extend beyond basic heritability to model gene-environment interactions and causal pathways, serving as quasi-experimental tools for traits like cognition and psychopathology. These methods remain vital for validating molecular findings and informing public health strategies, though they assume equal environments for MZ and DZ twins reared together.

QTL Mapping

Quantitative Trait Loci (QTL) refer to genomic regions that contribute to variation in a quantitative trait, identified through statistical analysis of linkage between genetic markers and phenotypic data in experimental populations. QTL mapping employs linkage analysis in controlled crosses, such as pedigrees or recombinant inbred lines, to localize these regions by detecting associations between marker genotypes and trait values. The foundational method for QTL mapping is interval mapping, introduced by Lander and Botstein in 1989, which scans the genome by testing for the presence of a QTL between pairs of flanking markers and calculates a likelihood ratio statistic expressed as a logarithm of odds (LOD) score. A LOD score exceeding 3 is commonly used as a significance threshold, corresponding to odds of 1000:1 in favor of a QTL at that position, though permutation tests are recommended for genome-wide adjustment. To address biases from multiple linked QTL, composite interval mapping extends this approach by incorporating additional markers as covariates to control for other QTL effects, as developed by Zeng in 1994. QTL mapping has been widely applied in plant and animal breeding programs to identify loci influencing economically important traits, such as yield components in maize, where multiple QTL for kernel number and ear length have been mapped in doubled haploid populations across environments. For instance, studies in maize have detected stable QTL for grain yield on chromosomes 1, 3, and 6, facilitating marker-assisted selection in breeding. However, the resolution of QTL mapping is typically limited to intervals of 10-20 centimorgans due to recombination constraints in mapping populations. Key limitations include low statistical power to detect QTL with small effect sizes, often below 10% of total variance, and the necessity for structured family designs, which restrict its use to biparental or experimental crosses rather than diverse natural populations.

Genome-Wide Association Studies (GWAS)

Genome-wide association studies (GWAS) are population-based approaches that systematically scan the genomes of unrelated individuals to identify common genetic variants associated with complex traits by testing millions of single nucleotide polymorphisms (SNPs) across the genome. These studies typically employ high-density SNP arrays to genotype hundreds of thousands to millions of markers, followed by statistical analysis using linear or logistic regression models to assess associations between each SNP and the trait of interest, with genome-wide significance conventionally defined as a p-value threshold of p < 5 \times 10^{-8} to account for multiple testing across approximately 1 million independent SNPs. This threshold balances the discovery of true associations while controlling the false positive rate, enabling the detection of variants with small effect sizes that contribute to polygenic traits. A pivotal advancement in GWAS came with the 2007 study by the Wellcome Trust Case Control Consortium (WTCCC), which analyzed over 14,000 cases across seven common diseases and 3,000 shared controls using the Affymetrix GeneChip 500K array, identifying novel susceptibility loci and demonstrating the feasibility of large-scale genotyping for complex traits. Since then, GWAS have proliferated, with the NHGRI-EBI GWAS Catalog now documenting associations for over 15,000 traits from nearly 7,000 publications as of 2024. These studies have mapped polygenic signals for diverse phenotypes, such as height, where meta-analyses have identified associations at more than 700 genomic loci, collectively explaining a substantial portion of the trait's heritability. Results are often visualized using Manhattan plots, which plot -\log_{10}(p-values) against genomic position to highlight peaks of significant associations amid the background noise of non-significant tests. Despite their success, GWAS face challenges including population stratification, where differences in ancestry between cases and controls can produce spurious associations, which is commonly corrected by including principal components derived from genome-wide SNP data as covariates in regression models. Additionally, linkage disequilibrium (LD)—the non-random correlation of alleles at nearby SNPs—can complicate the identification of causal variants, as associated signals may tag multiple linked polymorphisms rather than the functional one, necessitating fine-mapping efforts to disentangle these effects. These methodological considerations have refined GWAS to enhance accuracy and replicability across diverse populations.

Emerging Computational Approaches

Whole-genome sequencing (WGS) has enabled the detection of rare variants contributing to complex traits by overcoming limitations of earlier genotyping arrays that primarily captured common variants. Burden tests, which aggregate rare variants within genes or regions to assess their collective impact, have become standard for this purpose; for instance, the sequence kernel association test (SKAT) evaluates associations while accounting for heterogeneity in variant effects. In the 2020s, large-scale WGS efforts in cohorts like the UK Biobank, encompassing over 500,000 participants, have identified rare variants with large effects on traits such as lipid levels and body mass index, revealing that these variants explain a portion of missing heritability not captured by common variants. These advances leverage deep sequencing to achieve minor allele frequency thresholds below 1%, enhancing power for polygenic risk assessment in diverse populations. Functional genomics approaches integrate expression quantitative trait loci (eQTLs) data to prioritize causal variants among GWAS signals, linking genetic variants to gene expression changes in relevant tissues. The GTEx database, derived from multi-tissue RNA sequencing in over 900 donors, provides comprehensive eQTL maps that facilitate this integration by identifying tissue-specific regulatory effects for complex traits like schizophrenia and height. Fine-mapping with Bayesian methods, such as the sum of single effects (SuSiE) model, refines credible sets of variants by incorporating eQTL priors to shrink non-causal signals and estimate posterior inclusion probabilities. This framework has improved resolution in fine-mapping eQTLs for traits like type 2 diabetes, reducing credible set sizes by up to 50% compared to frequentist methods. Machine learning techniques, particularly deep learning on multi-omics data, have advanced predictions of trait variance by modeling nonlinear interactions across genomics, transcriptomics, and epigenomics layers. These models capture epistatic effects and environmental modifiers that linear methods overlook, achieving up to 20% higher explained variance for traits like in integrated datasets. Generative models, such as variational autoencoders adapted for causal inference, predict sets of causal genes underlying complex traits by simulating transcriptional networks informed by perturbation data, identifying multi-target pathways for diseases like with 15-30% improved accuracy over traditional . These approaches, often trained on large , prioritize biologically plausible gene sets for functional validation. Recent tools like CAUSALdb2, an updated database launched in 2024, catalog putative causal variants for over 10,000 complex traits by integrating fine-mapping, colocalization, and functional annotations from diverse ancestries, enabling queries for trait-specific regulatory elements. For evolutionary insights, polygenic scores applied to ancient DNA reconstruct trait distributions across populations, revealing selection pressures on height and immune response; however, predictive accuracy declines substantially in ancient samples due to allele frequency shifts from natural selection. These methods bridge modern and historical genomics to infer adaptive architectures of complex traits.

Genetic Architecture

Distribution of Genetic Effects

The genetic architecture of complex traits is characterized by the contributions of numerous variants with varying effect sizes and allele frequencies, often following non-uniform distributions. In the infinitesimal model, proposed by R.A. Fisher, traits are influenced by an effectively infinite number of loci, each with infinitesimally small and roughly equal additive effects, leading to a Gaussian distribution of phenotypic variation under . This model aligns with empirical observations from , where the majority of identified variants explain only a tiny fraction of trait variance, typically less than 0.1% per . Building on this, the omnigenic model extends the polygenic framework by emphasizing the interconnected nature of gene regulatory networks, positing that nearly all genes expressed in trait-relevant cells can indirectly influence phenotypes through pervasive effects on core pathways, resulting in many small effects distributed across the genome. Effect sizes for these variants often follow a power-law distribution, where most SNPs have negligible impacts, but a small number of loci exert larger effects, contributing disproportionately to heritability. For instance, in human height—a canonical complex trait—GWAS have identified thousands of common variants, with effect sizes skewed toward the small end of this distribution. Regarding allele frequencies, common variants (minor allele frequency >1-5%) predominate in the genetic basis of evolutionarily stable complex traits, such as , where they tag causal sites and collectively account for substantial . In contrast, rare variants (minor allele frequency <1%) are more prominent in traits under recent selective pressures or with heterogeneous etiologies, often driving or risk through larger individual effects. A key mechanism linking these is synthetic association, where rare causal variants are indirectly captured by with nearby common SNPs, creating apparent signals from common markers in GWAS. Empirical evidence from large-scale GWAS meta-analyses supports these patterns through estimates of chip heritability, which quantifies the proportion of phenotypic variance explained by common SNPs genotyped on arrays. For most human complex traits, including , , and psychiatric disorders, chip heritability ranges from 20% to 50%, reflecting the cumulative impact of myriad small-effect common variants while highlighting the role of rarer or structural variants in the remaining heritability. In height specifically, common variants explain approximately 40% of the narrow-sense heritability, demonstrating near-saturation of detectable effects in well-powered studies.

Gene-Environment Interactions

Gene-environment interactions (GxE) occur when the effect of a genetic variant on a varies depending on the environmental context, contributing to the complexity of polygenic traits. These interactions can manifest as the genetic influence being amplified or diminished by specific environmental factors, such as diet or lifestyle. For instance, the fat mass and obesity-associated ( variant rs9939609 has a stronger association with risk in individuals consuming high-calorie diets compared to those on lower-calorie regimens, where the genetic effect is minimal. Similarly, , or gene-gene (GxG) interactions, involves non-additive effects between loci, where the impact of one genetic variant depends on the presence of another, further shaping phenotypic outcomes in complex traits like or susceptibility. Detection of GxE often relies on variance components models that incorporate interaction terms to partition phenotypic variance into genetic, environmental, and interactive components. These models, commonly applied in twin and family studies, estimate the proportion of variance attributable to GxE by fitting linear functions of environmental moderators to genetic effects. A notable example is the interaction between the APOE ε4 allele and smoking, which exacerbates the risk of ; carriers who smoke show accelerated cognitive decline compared to non-smokers with the same . Recent advancements, such as scalable variance components methods, have identified significant GxE for traits like modulated by , revealing interaction heritabilities up to 5-10% in large cohorts. Quantitative models of GxE extend classical by expressing total phenotypic variance as V_P = V_G + V_E + 2 \Cov(G, E), where V_G is genetic variance, V_E is environmental variance, and the covariance term captures the interaction. This framework highlights how correlated genetic and environmental influences can inflate or mask estimates. Reaction norms, a key concept in , describe the phenotypic response of a across a continuous environmental , often visualized as a linear or nonlinear function that quantifies . For example, in , reaction norms illustrate how different QTLs alter growth under varying nutrient levels, aiding in the selection of stable varieties. These interactions underpin , allowing organisms to adjust traits like body size or to fluctuating environments without genetic change, which is evolutionarily advantageous in heterogeneous habitats. In , GxE promotes canalization—the buffering of phenotypes against environmental perturbations—enhancing developmental robustness while enabling adaptive divergence. Unmodeled GxE may partly explain missing in complex traits, as interactions can contribute substantial variance not captured by additive models alone.

Challenges and Future Directions

Missing Heritability

The phenomenon of missing heritability describes the discrepancy between the high estimates derived from family and twin studies—often capturing 60–80% of phenotypic variance for traits like —and the substantially lower proportion of variance explained by genetic variants identified through genome-wide studies (GWAS). For instance, twin studies estimate the of at approximately 80%, whereas common single-nucleotide polymorphisms (SNPs) collectively account for only 40–50% of this variance, a gap sometimes referred to as the "dark matter" of due to its elusive nature. This shortfall highlights that standard GWAS, which primarily detect common variants, fail to fully elucidate the genetic basis of complex traits. Several factors contribute to this missing heritability. Rare variants, which occur at low frequencies and often have larger effect sizes, are underrepresented in typical GWAS arrays and contribute significantly to the unexplained portion; for example, whole-genome sequencing analyses have shown that incorporating rare variants can increase explained variance by 25–40% for . Structural variants, such as copy number variations, epistatic interactions among loci, and gene-environment (GxE) interactions are also not adequately captured by current methods, as GWAS assumes additive effects from common SNPs. Additionally, —non-random partnering based on similar traits—can inflate twin-based heritability estimates by enhancing genetic correlations across generations, thereby exaggerating the apparent gap. The concept was prominently outlined in a , which quantified the issue across complex diseases and traits, noting that GWAS at the time explained less than 10–20% of for many phenotypes, prompting calls for expanded sequencing and studies. Progress has since narrowed the gap; by 2022–2024, larger GWAS and imputation techniques had reduced unexplained to around 20% for and similar traits, though a 2025 whole-genome sequencing study of over 300,000 individuals further advanced this by capturing approximately 88% of pedigree-based narrow-sense on average across phenotypes (70.9% specifically for ), leaving about 12% unexplained and highlighting the role of rare variants in colocalizing with common variant signals. These advances underscore the need for whole-genome approaches to close the divide further, particularly for traits like psychiatric disorders where substantial portions remain unexplained. This persistent missing heritability has critical implications, particularly limiting the accuracy of polygenic risk scores for prediction and clinical applications, as they rely heavily on identified . Addressing it requires larger, more diverse cohorts to detect rare and population-specific effects, alongside refined models for non-additive and environmental interactions.

Advances in Polygenic Risk Scores and

Polygenic risk scores (PRS), also known as polygenic scores, provide a single numerical estimate of an individual's to a complex trait or by summing the effects of multiple genetic variants, typically single nucleotide polymorphisms (), weighted by their effect sizes derived from genome-wide association studies (GWAS). These scores aggregate the cumulative impact of common variants, each contributing small effects, to predict phenotypic liability. To address and incorporate (LD) patterns, methods like LDpred apply Bayesian shrinkage, estimating posterior SNP effect sizes by assuming a fraction of causal variants and shrinking others toward zero, thereby improving predictive accuracy over simple GWAS-based weights. However, PRS transferability remains limited across ancestral populations due to differences in LD structure and allele frequencies, leading to reduced performance in non- groups when models are trained on European GWAS data. Advancing beyond mere association, causal inference methods integrate PRS with genetic tools to distinguish correlation from causation in complex traits. Mendelian randomization (MR) leverages genetic variants as instrumental variables—strongly with an exposure but independent of confounders—to infer causal effects on outcomes, such as the impact of lipid levels on . Complementing MR, colocalization analyses assess whether GWAS signals for a trait overlap with (eQTLs) in specific tissues, identifying candidate causal genes by testing for shared genetic variants driving both association and gene expression changes. These approaches enhance the interpretability of PRS by linking predictive variants to underlying biological mechanisms. Recent developments have focused on generative models and multi-ancestry frameworks to refine PRS for and . In 2025, a generative approach using transcriptional data with causal priors predicted sets of causal genes underlying complex traits like and , outperforming traditional methods by amplifying sparse signals to identify multitarget therapeutic candidates. Simultaneously, multi-ancestry PRS models, trained on diverse GWAS summaries, have improved risk prediction across populations, reducing biases in non-European ancestries and promoting equitable clinical applications, such as for coronary heart disease where ancestry-specific tuning boosted performance in underrepresented groups. In applications, PRS have demonstrated utility in disease risk prediction; for instance, a 313-variant PRS for achieves an area under the curve () of approximately 0.62 in validation cohorts, enabling stratification of lifetime risk and integration with clinical models to identify high-risk individuals. Additionally, PRS applied to reveal historical shifts in trait distributions, such as increases in polygenic scores for and in European populations from the to modern eras, reflecting evolutionary pressures and admixture events.

References

  1. [1]
    Human Complex Trait Genetics: Lifting the Lid of the Genomics ... - NIH
    Complex traits are determined by the interplay of multiple genetic and environmental factors. Understanding the interaction of nature and nurture in the ...
  2. [2]
    Genetics and Mathematical Modeling of Complex Traits
    Complex traits are features whose properties are controlled by many genes and whose inheritance does not follow the simple rules of Mendelian genetics.
  3. [3]
    Systems genetics approaches for understanding complex traits with ...
    Introduction. Complex traits, including common diseases, result from the combined effects of multiple genetic variations together with environmental factors.
  4. [4]
    Complex Traits & Diseases | University of Michigan Medical School
    Complex traits such as height, body mass index (BMI), diseases such as type 2 diabetes, and psychological disorders are influenced by a multitude of genetic ...
  5. [5]
    Polygenic Trait - National Human Genome Research Institute (NHGRI)
    A polygenic trait is a characteristic, such as height or skin color, that is influenced by two or more genes. Because multiple genes are involved, polygenic ...
  6. [6]
    Genetic prediction of complex traits with polygenic scores - NIH
    Complex traits are traits that do not perceivably follow simple Mendelian inheritance laws. Example complex traits include binary ones such as type 2 diabetes ...
  7. [7]
    Polygenic inheritance and environmental effects - Khan Academy
    Polygenic inheritance means traits are controlled by multiple genes, and environmental factors also influence traits, such as diet and exercise.
  8. [8]
  9. [9]
    Liability threshold model-based disease risk prediction ... - Nature
    Nov 3, 2025 · The liability threshold model operates under the assumption that an underlying liability distribution exists for polygenic binary traits, and ...
  10. [10]
    A saturated map of common genetic variants associated ... - Nature
    Oct 12, 2022 · This study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants.
  11. [11]
    Complex genetic architecture of Drosophila aggressive behavior
    Studies in Drosophila reveal the complex genetic architecture of aggressive behavior, with a large mutational target size, pleiotropic mutational effects, and ...
  12. [12]
    Global genetic differentiation of complex traits shaped by ... - Nature
    May 14, 2018 · Our results support the hypothesis that natural selection has shaped the genetic differentiation of complex traits, such as height and schizophrenia, among ...
  13. [13]
    Impact of genomics on animal agriculture and opportunities for ...
    Genomics will speed genetic improvement, increase production, and improve traits like growth, lean content, and reproduction, and may lead to disease ...
  14. [14]
    Polygenic risk scores and embryonic screening: considerations for ...
    Ethical considerations · Reproductive autonomy · Harm principle · Eugenics, discrimination and designer babies · Justice.Missing: insurance complex
  15. [15]
    Future implications of polygenic risk scores for life insurance ... - NIH
    Mar 30, 2024 · The use of genetic risk information in life insurance underwriting is a major ethical, legal, and psychosocial concern.Missing: eugenics | Show results with:eugenics
  16. [16]
    The Correlation between Relatives on the Supposition of Mendelian ...
    Jul 6, 2012 · XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. Published online by Cambridge University Press: 06 July 2012.
  17. [17]
    From R.A. Fisher's 1918 Paper to GWAS a Century Later - PMC - NIH
    Apr 3, 2019 · IT has been a century since the landmark paper “The Correlation between Relatives on the Supposition of Mendelian Inheritance” by R.A. Fisher ...Missing: original source
  18. [18]
    Animal Breeding Plans : Lush, Jay L. - Internet Archive
    Jan 24, 2017 · Animal Breeding Plans ; Publication date: 1943 ; Topics: Osmania ; Collection: digitallibraryindia; JaiGyan ; Language: English ; Item Size: 1.3G.
  19. [19]
    Jay L. Lush - Department of Animal Science - Iowa State University
    He [Lush] has defined the problems of genetically changing farm animals in a logical, biological, quantitative and economic way.
  20. [20]
    The Green Revolution: Norman Borlaug and the Race to Fight ... - PBS
    Apr 22, 2025 · Borlaug's wildly successful efforts to increase crop yields came to be known as the “Green Revolution” and earned him the Nobel Peace Prize in ...Missing: post- | Show results with:post-
  21. [21]
    Q&A: Genetic analysis of quantitative traits - PMC - PubMed Central
    Apr 17, 2009 · Quantitative, or complex, traits are traits for which phenotypic variation is continuously distributed in natural populations, with population ...
  22. [22]
    Quantitative Trait - an overview | ScienceDirect Topics
    A trait defined by a numerical value, often determined by multiple genes at different genomic locations that have small effects and are influenced by ...<|control11|><|separator|>
  23. [23]
    [PDF] The Correlation between Rela.tives on the Supposition of Mendelian ...
    He shows the similarity of the effects of dominance and of environment in reducing the correlations between relatives, but states that they are identical, an ...
  24. [24]
    Polygenic inheritance, GWAS, polygenic risk scores, and the ... - PNAS
    Aug 4, 2020 · ... traits, was provided by R. A. Fisher (1) in his seminal 1918 paper. This was entitled “The correlation between relatives on the supposition ...
  25. [25]
    Chapter 5: Gene Effects – Quantitative Genetics for Plant Breeding
    P = G + E ,. Equation 1 Linear model for evaluating phenotype. where: P is the phenotypic value, G is the genotypic value, and. E represents the non-genetic ...
  26. [26]
    The infinitesimal model: Definition, derivation, and implications
    The infinitesimal model is a simple and robust model for the inheritance of quantitative traits, in which these are the sum of a genetic and a non-genetic ( ...
  27. [27]
    [PDF] Quantitative genetic methods depending on the nature of the ... - HAL
    Dec 7, 2020 · These traits are most often analysed using an appropriate GLMM (e.g.. Poisson, binomial, negative-binomial), although using a LMM is possible.
  28. [28]
    Drosophila bristles and the nature of quantitative genetic variation
    Drosophila bristle numbers have abundant and largely additive naturally segregating variation, with heritabilities of the order of 0.5 (Clayton et al. 1957).
  29. [29]
    Quantitative trait loci analyses for meristic traits in Oncorhynchus ...
    Within Oncorhynchus mykiss, there is considerable variation in meristic characters such as numbers of vertebrae, lateral line scales, fin rays, gill rakers, ...
  30. [30]
    Linkage Analysis of a Model Quantitative Trait in Humans - NIH
    Sep 28, 2007 · The finger ridge count (a measure of pattern size) is one of the most heritable complex traits studied in humans and has been considered a model ...Missing: meristic | Show results with:meristic
  31. [31]
  32. [32]
    Mode of inheritance of nonsyndromic cleft lip with or ... - PubMed - NIH
    Nonsyndromic cleft lip with or without cleft palate (CL +/- P) is traditionally recognized as a multifactorial threshold trait (MFT).
  33. [33]
    Genetics of Cleft Lip and Cleft Palate - PMC - NIH
    Orofacial clefts are common birth defects and can occur as isolated, nonsyndromic events or as part of Mendelian syndromes.
  34. [34]
    Schizophrenia as a Complex Trait: Evidence From a Meta-analysis ...
    This model specifies that individuals in the population have a frequency distribution of liability to schizophrenia that is described by a normal distribution.
  35. [35]
    [PDF] The inheritance of liability to certain diseases, estimated from the ...
    The purpose of this paper is to suggest that the method developed in quanti- tative genetics for dealing with 'threshold characters ' is applicable to data on ...
  36. [36]
    The Liability Threshold Model for Predicting the Risk of ... - NIH
    The discrete genetic liability threshold model consisting of 231 SNPs (GLT 231 ) correctly classified 87.7% of 2378 T2D patients as high or low risk of DCVD.
  37. [37]
    Link Functions in Multi‐Locus Genetic Models - Wiley Online Library
    Apr 16, 2012 · For example, the “liability threshold” model widely used in genetics ... probit and logit links become nearly linear, approximating the additive ...
  38. [38]
    Polygenic Inheritance - an overview | ScienceDirect Topics
    Polygenic inheritance refers to the genetic transmission of traits that are influenced by multiple genes, where the severity of the condition is expected to ...Missing: seminal | Show results with:seminal
  39. [39]
    A polygenic score method boosted by non-additive models - Nature
    May 29, 2024 · We present GenoBoost, a flexible polygenic modeling framework capable of incorporating both additive and non-additive genetic dominance effects ...
  40. [40]
    Non-additive (dominance) effects of genetic variants associated with ...
    Mar 29, 2020 · Dominant and recessive inheritance of monogenic traits are well known examples of non-additive allele-allele interaction effects. However, tests ...
  41. [41]
    Rare and Common Variants: Twenty arguments - PMC
    The debate is whether genetic variance comes from many rare variants of large effect, or many common variants of small effect, or a combination.
  42. [42]
    Distribution of allele frequencies and effect sizes and their ... - PNAS
    In this report, we investigate distributions of various population genetic parameters and their interrelationships using estimates of allele frequencies and ...Sign Up For Pnas Alerts · Results · Discussion
  43. [43]
    Estimating Trait Heritability | Learn Science at Scitable - Nature
    Heritability is a concept that summarizes how much of the variation in a trait is due to variation in genetic factors.
  44. [44]
    Reconsidering the Heritability of Intelligence in Adulthood - NIH
    Heritability estimates of general intelligence in adulthood generally range from 75 to 85%, with all heritability due to additive genetic influences.
  45. [45]
    Genetic and environmental influences on adult human height across ...
    Dec 14, 2016 · Human height variation is determined by genetic and environmental factors, but it remains unclear whether their influences differ across birth-year cohorts.
  46. [46]
    Who discovered the twin method? | Behavior Genetics
    The twin method is usually credited to Francis Galton's 1875 article on twins. ... Siemens, both in 1924, 50 years after Galton's paper. Article PDF. Download ...Missing: original | Show results with:original
  47. [47]
    Genetics of Schizophrenia: Overview of Methods, Findings and ...
    One seminal study (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014) combined available schizophrenia GWAS samples into a single ...
  48. [48]
    Overview of Behavioral Genetics Research for Family Researchers
    This article provides an overview of the methods, assumptions, and key findings of behavioral genetics methodology for family researchers with a limited ...
  49. [49]
    Introduction to quantitative genetics : Falconer, D. S - Internet Archive
    Apr 22, 2013 · 1960. Topics: Genetics, Genetics. Publisher: New York,: Ronald Press Co ... PDF download · download 1 file · SINGLE PAGE ORIGINAL JP2 TAR ...
  50. [50]
    Genetic Contribution to the Variance of Blood Pressure and Heart Rate
    Mar 6, 2015 · The results indicated that heritability estimates based on twin studies of both SBP and DBP are around 50%. Our study confirmed that the blood ...
  51. [51]
    Genetic and environmental influences on personality trait stability ...
    Stability and change in personality traits from late adolescence to early adulthood: A longitudinal twin study. Journal of Personality. 2008;76:229–266. doi ...
  52. [52]
    Beyond Heritability: Twin Studies in Behavioral Research - PMC - NIH
    Twin studies help identify causal environmental effects by controlling for genetic and shared environmental influences, acting as quasi-experimental tests.
  53. [53]
    Quantitative Trait Loci (QTL) Mapping - PubMed
    Quantitative trait loci (QTL) are genetic regions influencing phenotypic variation, identified through QTL mapping, a statistical genetic analysis.
  54. [54]
    Quantitative Trait Locus Mapping - an overview | ScienceDirect Topics
    QTL mapping is defined as a genetic analysis technique that uses interval mapping based on genetic markers to reliably identify quantitative trait loci (QTLs) ...
  55. [55]
    [PDF] Mapping Mendelian Factors Underlying Quantitative Traits Using ...
    This paper describes methods for mapping QTLs using RFLP linkage maps, including identifying crosses, interval mapping, and selective genotyping.
  56. [56]
    Simple Interval Mapping | Quantitative Trait Locus (QTL) Analysis 2
    For example, if the presence of a QTL is 1000 times more probable than its absence, then the LOD score = 3, because log10 of 1000 = 3. scores are plotted along ...
  57. [57]
    [PDF] Precision Mapping of Quantitative Trait Loci
    This test can be constructed by using a combination of interval mapping with multiple regression analysis (ZENG 1993). Various properties of the multiple ...
  58. [58]
    Mapping quantitative trait loci for yield-related traits and predicting ...
    Nov 6, 2019 · It is of great significance for maize breeder to map QTLs, to predict candidate genes, to clone functionlal genes and to clarify gene's genetic ...
  59. [59]
    Mapping of QTLs for morphophysiological and yield traits under ...
    Our study aimed to identify genomic regions associated with drought tolerance-related traits by phenotyping a mapping population of recombinant inbred lines ( ...
  60. [60]
  61. [61]
    Genetic mapping of quantitative trait loci in crops - ScienceDirect.com
    ... QTL to 10–20 cM intervals. Additionally, detection of QTL in biparental populations depends on the phenotypic diversity of the two parents, which may ...
  62. [62]
    Genome-wide association studies | Nature Reviews Methods Primers
    Aug 26, 2021 · Genome-wide association studies (GWAS) aim to identify associations of genotypes with phenotypes by testing for differences in the allele ...
  63. [63]
    10 Years of GWAS Discovery: Biology, Function, and Translation
    Here we review the remarkable range of discoveries that genome-wide association studies (GWASs) have facilitated in population and complex-trait genetics.
  64. [64]
    Genome-wide association study of 14,000 cases of seven ... - Nature
    The Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447 ...
  65. [65]
    The NHGRI-EBI GWAS Catalog: standards for reusability ... - PubMed
    Currently, it contains close to 7 000 publications for >15 000 traits, from which more than 625 000 lead associations have been curated.
  66. [66]
    Principal components analysis corrects for stratification in genome ...
    Jul 23, 2006 · We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale.
  67. [67]
    Whole-genome sequencing analysis identifies rare, large-effect ...
    Feb 24, 2025 · Our ability to assess rare variants in the noncoding genome has until recently been limited, because of the lack of whole-genome sequencing (WGS) ...
  68. [68]
    Recent advances and challenges of rare variant association ...
    These large amounts of sequencing data provide both opportunities and challenges for rare-variant association analysis.Missing: 2020s | Show results with:2020s
  69. [69]
    Integration of expression QTLs with fine mapping via SuSiE
    Jan 25, 2024 · Our new fine-mapping framework, SuSiE2, connects two sum of single-effects (SuSiE) models, one for eQTL analysis and one for genetic fine- ...
  70. [70]
    Integration of Expression QTLs with fine mapping via SuSiE - PMC
    In this study, we propose a new method of incorporating eQTL information to improve fine mapping results based on the SuSiE framework. Our new method, named ...
  71. [71]
    Strategies to include prior knowledge in omics analysis with deep ...
    Mar 14, 2025 · Here, we describe three major strategies proposed to use prior knowledge in deep learning models to make predictions based on molecular profiles.
  72. [72]
    Generative prediction of causal gene sets responsible for ... - PNAS
    We predict sets of genes that explain the emergence of complex traits, which suggest possible multitarget disease treatments.
  73. [73]
    Generative prediction of causal gene sets responsible for complex ...
    Apr 22, 2025 · We develop an approach that leverages transcriptional data endowed with causal information and a generative machine learning model to strengthen statistical ...Missing: deep omics
  74. [74]
    CAUSALdb2: an updated database for causal variants of complex ...
    Nov 18, 2024 · The first version of CAUSALdb provided a unique resource for biologists and geneticists to search and visualize causal variants in a trait, and ...Abstract · Introduction · Materials and methods · Results
  75. [75]
    Natural selection acting on complex traits hampers the predictive ...
    We show how natural selection could impact the predictive accuracy of ancient polygenic scores for two widely studied traits: height and body mass index.Article · Results · Discussion
  76. [76]
    Polygenic prediction of human complex traits using ancient DNA
    Natural selection acting on complex traits hampers the predictive accuracy of polygenic scores in ancient samples. Am J Hum Genet, 112 (2025), pp. 1547-1561.
  77. [77]
    [PDF] Fisher's infinitesimal model: a story for the ages - eScholarship
    Dec 1, 2017 · Fisher (1918) addressed the Gaussian distribution of genetic effects by assuming polygenic inheritance. (Mather, 1943) and calculated genetic ...
  78. [78]
    Progress and Promise of Genome-Wide Association Studies ... - NIH
    The distribution of effect sizes for common variants affecting human complex traits is highly skewed toward small effect sizes, and the true distribution is ...
  79. [79]
    Simple scaling laws control the genetic architectures of human ...
    This echoes recent results showing that distributions of effect sizes for complex traits do indeed span multiple orders of magnitude (in contrast to the ...
  80. [80]
    Mapping Rare and Common Causal Alleles for Complex Human ...
    Sep 30, 2011 · This Primer discusses the population genetics features of rare and common alleles, strategies for connecting these alleles to disease, and strategies to ...
  81. [81]
    Rare Variants Create Synthetic Genome-Wide Associations
    In conclusion, uncommon or rare genetic variants can easily create synthetic associations that are credited to common variants, and this possibility requires ...
  82. [82]
    Article Improved Heritability Estimation from Genome-wide SNPs
    With the use of this approach, it has been estimated that over half the heritability of human height can be attributed to the ∼300,000 SNPs on a genome-wide ...Material And Methods · Results · Polygenic Assumption
  83. [83]
    'Landmark' study resolves a major mystery of how genes govern ...
    Nov 3, 2020 · Studies of identical and fraternal twins suggest up to 80% of variation in height is genetic. But the genes responsible have largely eluded researchers.
  84. [84]
    Finding the missing heritability of complex diseases - Nature
    Oct 8, 2009 · Here we examine potential sources of missing heritability and propose research strategies, including and extending beyond current genome-wide association ...
  85. [85]
  86. [86]
  87. [87]
    How to estimate heritability: a guide for genetic epidemiologists
    Nov 25, 2022 · Common issues across methods developed to estimate heritability include bias due to population stratification, epistasis and assortative mating.
  88. [88]
    A guide to performing Polygenic Risk Score analyses - PMC
    We define polygenic risk scores, or polygenic scores, as a single value estimate of an individual's common genetic liability to a phenotype, calculated as a ...
  89. [89]
    Recent advances in polygenic scores: translation, equitability ...
    Feb 19, 2024 · Polygenic scores (PGS) can be used for risk stratification by quantifying individuals' genetic predisposition to disease, and many potentially clinically ...Disease Risk Prediction... · Risk Stratification · Use In Clinical Trials And...
  90. [90]
    Mendelian randomization: genetic anchors for causal inference in ...
    Mendelian randomization (MR) is a method that utilizes genetic variants that are robustly associated with such modifiable exposures to generate more reliable ...
  91. [91]
    Colocalization of GWAS and eQTL signals at loci with ... - PMC - NIH
    Integration of genome-wide association study (GWAS) signals with expression quantitative trait loci (eQTL) studies enables identification of candidate genes.
  92. [92]
    Optimization of multi-ancestry polygenic risk score disease ... - Nature
    May 20, 2025 · Optimization of multi-ancestry polygenic risk score disease ... In addition, P.G.Y. has equity in Systomic Health LLC and Ethobiotics LLC.
  93. [93]
    Polygenic risk scores for prediction of breast cancer risk in Asian ...
    ... [AUC] = 0.586) was much lower than PRS-313 (AUC = 0.617), derived from European ancestry women, for predicting breast cancer risk in Asian women. In this ...
  94. [94]
    Evolutionary Trends of Polygenic Scores in European Populations ...
    Mar 6, 2024 · This study examines the temporal and geographical evolution of polygenic scores (PGSs) across cognitive measures (Educational Attainment [EA], Intelligence ...