Genetic correlation
Genetic correlation is a core parameter in quantitative genetics that quantifies the shared additive genetic basis between two traits, formally defined as the ratio of their additive genetic covariance to the square root of the product of their additive genetic variances, r_g = \frac{\cov_g}{\sqrt{V_{g1} V_{g2}}}.[1]This coefficient ranges from -1 (indicating opposing genetic effects) to +1 (indicating aligned effects), with a value of 0 signifying no shared additive genetics after accounting for linkage disequilibrium decay; it primarily arises from pleiotropic effects of alleles or persistent linkage among loci influencing the traits.30110-1)[2]
In evolutionary contexts, genetic correlations shape multivariate responses to selection by dictating how changes in one trait indirectly alter others, often imposing constraints on adaptation when antagonistic to fitness gradients or facilitating coordinated evolution under aligned pressures.[3][4]
Such correlations are empirically estimated via methods like restricted maximum likelihood in pedigree data or linkage disequilibrium score regression in genomic datasets, enabling predictions of breeding outcomes in agriculture and insights into shared causal architectures for complex human phenotypes like diseases and cognitive traits.00427-4)[5]
While methodological advances have improved precision, estimates remain sensitive to sample size, population structure, and assumptions about additive versus non-additive effects, underscoring the need for large-scale, cross-validated data to discern true pleiotropy from transient linkage.[6]30110-1)
Definition and Fundamentals
Mathematical Definition
The genetic correlation r_g between two traits is formally defined as the correlation between their breeding values (additive genetic deviations), expressed aswhere \cov_g denotes the additive genetic covariance between the traits, and V_{g1} and V_{g2} are the corresponding additive genetic variances.[7][8] This formulation parallels the Pearson product-moment correlation coefficient but applies specifically to the additive genetic components of phenotypic variation in quantitative genetics.[7] The additive genetic covariance \cov_g captures shared effects from alleles influencing both traits (e.g., via pleiotropy), while the variances V_{g1} and V_{g2} represent the heritable portions of each trait's variability, typically estimated as h^2 \sigma_P^2 where h^2 is narrow-sense heritability and \sigma_P^2 is phenotypic variance.[7] Values of r_g range from -1 (perfect negative genetic association) to +1 (perfect positive), with 0 indicating no shared additive genetic basis; antagonistic correlations (negative r_g) constrain independent evolution of traits under selection.[7][8] In matrix notation for multivariate analysis, the genetic correlation forms part of the additive genetic covariance matrix \mathbf{G}, where off-diagonal elements are covariances and diagonals are variances; the correlation matrix is then \mathbf{D}^{-1} \mathbf{G} \mathbf{D}^{-1}, with \mathbf{D} as the diagonal matrix of genetic standard deviations.[9] Standard errors for r_g estimates account for sampling variability in variance components, approximated as \sigma(r_g) = \frac{1 - r_g^2}{\sqrt{2}} \cdot \sqrt{ \frac{ \sigma(h_x^2) \cdot \sigma(h_y^2) }{ h_x^2 \cdot h_y^2 } }, though precise computation requires restricted maximum likelihood or similar methods.[10]
Biological Interpretation
Genetic correlation quantifies the proportion of additive genetic variance shared between two traits, reflecting the extent to which the same genetic variants contribute to phenotypic variation in both.[11] A value of r_g = 1 indicates that genetic effects on the traits are perfectly aligned in direction and proportional in magnitude, while r_g = 0 implies independence of genetic influences, and negative values signify opposing effects.[12] Biologically, this parameter captures the net directional consistency of allelic effects across traits, providing evidence of overlapping genetic architectures where variants beneficial (or detrimental) for one trait tend to have similar impacts on the other.[13] In evolutionary terms, genetic correlations influence multivariate trait evolution by constraining or channeling responses to selection pressures, as the genetic covariance matrix determines correlated changes even when selection acts univariately.[8] For example, positive genetic correlations between morphological traits like limb length and body mass in vertebrates often arise from shared regulatory genes affecting overall size, leading to coordinated evolutionary shifts.[3] Such correlations highlight underlying biological realities, such as conserved developmental pathways or physiological trade-offs, rather than mere statistical artifacts.[14] Empirical estimates reveal that genetic correlations are ubiquitous in complex traits; for instance, human studies show r_g \approx 0.8 between height and body mass index, underscoring common genetic determinants of growth and metabolism.[15] These shared effects facilitate interpretation of polygenic scores and genome-wide associations, where cross-trait enrichments signal pleiotropic hotspots.[16] However, the magnitude and sign of r_g must be interpreted cautiously, as they integrate population-specific linkage patterns and allele frequencies, potentially varying across contexts without altering fundamental biological linkages.[2]Historical Background
The foundational concepts underlying genetic correlation emerged from efforts to reconcile Mendelian inheritance with the continuous variation observed in quantitative traits. In 1918, Ronald A. Fisher published "The Correlation Between Relatives on the Supposition of Mendelian Inheritance," demonstrating that correlations among relatives could arise from the additive effects of numerous Mendelian factors, thereby establishing the genetic variance-covariance structure central to quantitative genetics.[17] This work introduced the infinitesimal model, positing that polygenic traits behave as if influenced by infinitely many loci of small effect, with covariances between traits attributable to shared genetic effects across loci.[18] Practical applications in selective breeding advanced the explicit consideration of genetic correlations between distinct traits. In the 1930s and 1940s, animal breeders like Jay L. Lush developed methods for multi-trait selection, recognizing that correlated genetic responses could constrain or enhance progress. L.N. Hazel formalized this in 1943 with "The Genetic Basis for Constructing Selection Indexes," introducing genetic correlations as a key parameter in deriving optimal indices that weigh traits by their heritabilities, genetic covariances, and economic values, thereby enabling efficient multi-trait improvement despite pleiotropy or linkage.[19] [20] The concept gained systematic exposition in Douglas S. Falconer's 1960 textbook Introduction to Quantitative Genetics, which derived the genetic correlation coefficient as the ratio of additive genetic covariance to the square root of the product of trait-specific genetic variances, providing formulas for estimation from relatives' data and emphasizing its role in predicting correlated responses to selection.[21] Falconer's framework, updated in the 1996 edition with Trudy Mackay, integrated empirical methods from twin and family studies, solidifying genetic correlation as a core tool in dissecting the architecture of complex traits despite challenges in distinguishing additive from non-additive components.[22]Mechanisms and Causes
Pleiotropy
Pleiotropy occurs when a single genetic locus influences variation in multiple phenotypic traits, thereby inducing genetic covariance between those traits as the same alleles contribute to their respective genetic values. This mechanism underlies a substantial portion of observed genetic correlations in complex traits, where shared genetic effects across loci lead to non-independent inheritance patterns measurable as correlations in breeding values or SNP-based estimates. For instance, pleiotropic effects at a locus generate covariance terms in the genetic variance-covariance matrix, distinct from environmental or non-genetic sources of correlation.[2][23] Empirical studies across model organisms and human populations confirm pleiotropy's role in structuring genetic correlations. In plants, flowering-pathway genes exhibit pleiotropic regulation of developmental traits, contributing to correlated responses in flowering time and morphology. In humans, analyses of genome-wide association studies (GWAS) reveal pleiotropy as a key driver of genetic correlations between psychiatric disorders, such as schizophrenia and bipolar disorder, with shared loci affecting multiple symptom domains. Hormonal pleiotropy, where a single hormone modulates diverse phenotypes like growth and reproduction, has been experimentally linked to genetic covariance in animal models, as demonstrated in studies of endocrine signaling pathways. Over half of the human genome harbors pleiotropic signals when assessed across trait domains, underscoring its prevalence in polygenic architectures.[24][25][26][27] Pleiotropy can be categorized into horizontal forms, where a locus exerts independent effects on traits, and vertical forms, involving sequential impacts through biological pathways, both biasing estimates of genetic parameters like heritability if unaccounted for. Unlike genetic linkage disequilibrium, which arises from physical proximity of loci affecting separate traits, pleiotropy reflects true multifunctionality of genetic variants, such as enzymes or transcription factors targeting multiple downstream processes. Disentangling these requires methods like genetic crosses or conditional analyses to isolate pleiotropic from linkage-driven covariance.[28][2][29]Genetic Linkage and Disequilibrium
Genetic linkage occurs when genes or genetic loci are physically close on the same chromosome, reducing the frequency of recombination between them during meiosis and causing them to be inherited together more often than expected under independent assortment.[30] This physical proximity leads to incomplete linkage, where the recombination rate is inversely proportional to the distance between loci, measured in centimorgans (cM). In quantitative genetics, such linkage can contribute to genetic correlations between traits if the linked loci influence different phenotypes, as the joint inheritance of their allelic variants generates covariance in additive genetic effects.[8] Linkage disequilibrium (LD) quantifies the non-random association of alleles at different loci, often arising from linkage when mutation, selection, genetic drift, or population structure disrupts allele frequencies from equilibrium expectations.[31] LD is commonly measured by the coefficient D = p_{AB} - p_A p_B, where p_{AB} is the frequency of haplotypes carrying alleles A and B, and p_A, p_B are marginal allele frequencies; normalized forms like D' or r^2 account for maximum possible disequilibrium. In multivariate contexts, LD between loci affecting distinct traits induces genetic covariance by coupling their variant effects, such that breeding values for the traits are correlated even without shared causal genes.[30] This effect is distinct from pleiotropy, as it stems from co-inheritance rather than direct multi-trait causation by single variants.[32] Unlike pleiotropic effects, which produce stable genetic correlations persisting across generations, LD-generated correlations are transient and decay over time due to recombination eroding associations, typically at rates proportional to genetic distance and effective population size.[33] In randomly mating populations, complete linkage equilibrium is approached asymptotically, but selection or assortative mating can maintain LD, sustaining trait covariances. Empirical studies in model organisms and plants, such as Drosophila life-history traits, demonstrate that LD accounts for a substantial but variable fraction of observed genetic correlations, often 20-50% depending on chromosomal architecture and demographic history.[30] For instance, simulations and genomic dissections reveal higher LD contributions in recently admixed or bottlenecked populations, where transient covariances mimic pleiotropy in twin or pedigree-based estimates.[32] Distinguishing LD from pleiotropy is critical for interpreting genetic correlations, as methods like linkage disequilibrium score regression (LDSC) leverage population-level LD patterns to partition covariance sources in genome-wide data.30110-1) Failure to account for LD can inflate correlation estimates in finite samples, particularly for traits with polygenic architectures involving many weakly linked loci. In evolutionary contexts, LD-induced correlations may constrain or facilitate short-term responses to selection, but long-term multivariate evolution favors pleiotropic stability unless linkage is tight.[33] High-quality genomic resources, such as those from the 1000 Genomes Project, enable fine-mapping to quantify LD decay rates, confirming its role as a secondary but mechanistically distinct driver of genetic correlations.[31]Estimation Methods
Classical Approaches in Quantitative Genetics
Classical approaches to estimating genetic correlations in quantitative genetics primarily involve analyzing phenotypic covariances among relatives or observing correlated responses to selection on related traits. These methods, developed in the mid-20th century, assume an additive polygenic model with many loci of small effect (the infinitesimal model) and minimal epistasis or linkage disequilibrium.[34] They partition observed phenotypic covariances into genetic and environmental components based on expected sharing of additive genetic effects among kin.[35] In family-based designs, such as parent-offspring pairs or half-sib groups, the additive genetic covariance between traits X and Y (covAXY) is estimated from twice the parent-offspring covariance or four times the half-sib covariance, under assumptions of random mating and no dominance variance: covAXY = 2 × cov(POXY) or covAXY = 4 × cov(HSXY).[35] Additive genetic variances VA1 and VA2 for each trait are similarly derived (e.g., VA = 2 × cov(PO) for mid-parent offspring in balanced designs), enabling calculation of the genetic correlation rg = covAXY / √(VA1 VA2).[36] Early applications in livestock breeding used paternal half-sib correlations to minimize maternal effects, though biases from assortative mating or selection of parents required corrections via maximum likelihood methods introduced by Henderson in 1959.[35] An alternative classical method employs artificial selection experiments, measuring the correlated response (CRY) in trait Y when selecting on trait X. The formula is CRY = iX hX hY rg σPY, where iX is the selection intensity, h are narrow-sense heritabilities, and σPY is the phenotypic standard deviation of Y; thus, rg = CRY / (iX hX hY σPY).[8] Heritabilities are typically estimated separately from realized responses or relative designs. This approach, detailed in Falconer and Mackay (1996), was widely applied in plant and animal breeding but demands large sample sizes and multiple generations for precision.[37] These techniques laid the groundwork for quantitative genetics but face limitations, including low statistical power for modest correlations (e.g., requiring thousands of relatives for standard errors below 0.1), sensitivity to violations of additivity, and confounding by shared environments in non-experimental pedigrees.[35] By the 1970s, restricted maximum likelihood (REML) with animal models extended these methods to handle unbalanced data and polygenic relationships more robustly.[35]Genomic and Computational Methods
Genomic methods estimate genetic correlations by leveraging genome-wide single-nucleotide polymorphism (SNP) data or genome-wide association study (GWAS) summary statistics to model additive genetic covariances among traits in unrelated individuals, circumventing limitations of pedigree-based approaches such as shared environmental confounds and small sample sizes.[38] These techniques partition phenotypic variance into genetic components using linear mixed models or regression frameworks that account for linkage disequilibrium (LD), enabling robust inference across diverse populations.[39] Key advantages include scalability to millions of markers and compatibility with large consortia data, though they assume additive effects and may underestimate correlations if non-additive variance predominates.[40] Linkage disequilibrium score regression (LDSC), introduced by Bulik-Sullivan et al. in 2015, computes genetic correlations from GWAS summary statistics alone by regressing the product of χ² association statistics from two traits against LD scores, which measure the expected contribution of LD to test statistic inflation in genomic regions.[38] The regression slope approximates the genetic covariance, scaled by SNP heritability estimates for each trait to yield the genetic correlation coefficient r_g, typically ranging from -1 to 1.[41] LDSC distinguishes polygenic signals from biases like population stratification via an intercept term and has been applied to estimate correlations across hundreds of traits, revealing, for instance, negative r_g between height and schizophrenia (approximately -0.11).[42] Extensions like cross-trait LDSC handle sample overlap and ancestry differences, with standard errors derived from jackknife resampling.[43] Genomic restricted maximum likelihood (GREML), implemented in the GCTA software suite, constructs a genomic relationship matrix (GRM) from standardized SNP genotypes to fit variance components in a multivariate linear mixed model.[44] In bivariate analyses, GREML simultaneously estimates additive genetic variances V_{g1} and V_{g2} for two traits along with their covariance \operatorname{cov}_g, computing r_g = \frac{\operatorname{cov}_g}{\sqrt{V_{g1} V_{g2}}}.[45] This method explicitly models LD through the GRM and supports multi-trait extensions (MTGREML) for higher-dimensional correlations, outperforming LDSC in accuracy when individual-level data are available but requiring computational resources scaling with sample size (e.g., feasible for n > [10,000](/page/10,000)).[46] Simulations indicate GREML's bias is low under infinitesimal models but increases with sparse polygenicity.[40] Genomic structural equation modeling (Genomic SEM) integrates LDSC-derived genetic covariance matrices with GWAS summary statistics to fit confirmatory factor models, estimating latent genetic factors and their correlations with observed traits.[47] Developed by Grotzinger et al. in 2019, it decomposes multivariate genetic architectures into common and unique components, as in modeling a general factor for psychopathology with r_g up to 0.70 across disorders.[47] This approach enhances power for downstream analyses like identifying pleiotropic loci via MTAG, though it relies on accurate heritability inputs and assumes multivariate normality.[48] Benchmarks across methods show LDSC and GREML yielding convergent r_g estimates (correlation > 0.90) for traits like BMI and educational attainment, with Genomic SEM excelling in dimensionality reduction.[49]Applications
Selective Breeding in Agriculture and Animal Science
Selective breeding programs in agriculture and animal science routinely account for genetic correlations to predict and manage correlated responses across traits, as selection on one trait can inadvertently alter others due to shared genetic variants.[50] In livestock, such as dairy cattle, intense selection for milk yield since the mid-20th century has generated antagonistic genetic correlations with fertility and health traits; for instance, genetic correlations between milk production and days open (a measure of reproductive efficiency) range from -0.20 to -0.50, leading to declines in fertility unless explicitly counter-selected.[51] Similarly, in beef cattle breeds like Angus, genetic correlations between growth traits (e.g., weaning weight) and reproduction (e.g., heifer pregnancy) are often moderately positive (0.10 to 0.30), enabling multi-trait selection indices that balance carcass value with calving ease, with estimated breeding values incorporating these parameters since the 1970s.[52] In swine breeding, genetic correlations between litter size and growth rate are typically low to moderate (0.05 to 0.25), but negative correlations with backfat thickness (-0.30 to -0.50) necessitate index selection to avoid excessive lean growth compromising meat quality; programs like those from the National Swine Improvement Federation have used these estimates since the 1980s to achieve annual genetic gains of 0.5-1% in balanced traits.[53] Poultry breeding exemplifies management of indirect genetic effects, where social dominance traits show genetic correlations with body weight (0.15-0.40) and survival (-0.20 to -0.10), prompting genomic selection models that disentangle direct and indirect components to reduce aggression-related losses, as demonstrated in broiler lines with heritabilities around 0.20 for dominance.[54] Plant breeding faces analogous challenges, particularly antagonistic genetic correlations constraining simultaneous improvement; in wheat, alleles at loci like TaGW2 influence seed size positively but seed dormancy negatively (correlations around -0.40), explaining up to 47% of variation in pre-harvest sprouting risk under selection for yield since domestication intensified post-1950s Green Revolution.[55] Maize breeding programs mitigate yield-drought tolerance antagonisms (genetic correlations -0.10 to -0.30) via multi-environment trials and marker-assisted selection, achieving 1-2% annual yield gains while stabilizing stress resilience, as genetic correlations decay over generations due to recombination breaking linkage disequilibria.[56] Overall, modern approaches integrate genomic estimated breeding values (GEBVs) to forecast correlated responses more accurately than classical methods, with purebred-crossbred genetic correlations in species like chickens averaging 0.70, enabling hybrid vigor without fully sacrificing pure-line progress.[57]Enhancing Genome-Wide Association Studies
Multi-trait analysis of genome-wide association studies (GWAS), such as the Multi-Trait Analysis of GWAS (MTAG) method introduced in 2018, exploits genetic correlations between traits to jointly analyze summary statistics from multiple GWAS, thereby increasing statistical power and enabling the detection of novel associations that single-trait analyses might miss.[58] MTAG models the observed GWAS associations as a function of true effect sizes plus noise, incorporating estimates of genetic correlations derived from linkage disequilibrium (LD) score regression to "borrow strength" across traits, which effectively amplifies the sample size for correlated phenotypes by up to 35-50% in applications like educational attainment and schizophrenia.[58] For instance, applying MTAG to summary statistics from the Social Science Genetic Association Consortium for educational attainment alongside related traits like cognitive performance identified hundreds of additional independent loci compared to univariate GWAS.[58] Cross-trait LD score regression further enhances GWAS by estimating genome-wide genetic correlations (rg) using only summary statistics, without requiring individual-level data or non-overlapping samples, which helps prioritize traits for joint analysis and reveals pleiotropic signals underlying multiple phenotypes.[43] This approach has been instrumental in mapping shared genetic architectures, such as positive rg between body mass index and type 2 diabetes (rg ≈ 0.3-0.5), informing targeted multi-trait models that refine effect size estimates and reduce false positives in GWAS for complex diseases.[43] By integrating rg into fine-mapping pipelines, researchers can weight variants based on their consistency across correlated traits, improving causal variant prioritization; for example, local genetic correlation methods like SUPERGNOVA, developed in 2021, decompose genome-wide signals into regional estimates to uncover heterogeneity in trait architectures, aiding in the dissection of pleiotropic hotspots.[15] In polygenic risk score construction and cross-ancestry GWAS, genetic correlations facilitate portability by reweighting scores according to local rg patterns, mitigating biases from population-specific LD and environmental confounders; a 2022 study demonstrated that incorporating local rg enhanced prediction accuracy for traits like height across ancestries by 10-20%.[59] These enhancements collectively address GWAS limitations, such as low power for rare variants or polygenic traits with small per-SNP effects, by leveraging the causal overlap implied by rg to yield more robust, biologically interpretable associations.[60] Empirical validations across thousands of trait pairs confirm that methods relying on rg outperform univariate approaches when |rg| > 0.2, though they assume bivariate normality of effects and can be sensitive to sample overlap or weak correlations.[10]Evolutionary and Trait Evolution Analysis
Genetic correlations mediate the joint evolution of multiple traits under multivariate selection, as encapsulated in the additive genetic variance-covariance matrix (G-matrix), whose off-diagonal elements represent genetic covariances normalized as correlations. The predicted evolutionary response in mean trait values is given by the multivariate breeder's equation, \Delta \bar{\mathbf{z}} = \mathbf{G} \boldsymbol{\beta}, where \boldsymbol{\beta} is the vector of directional selection gradients on the traits; this formulation, introduced by Lande in 1979, demonstrates that selection on one trait induces indirect responses in others proportional to their genetic correlation.[61] Positive genetic correlations align trait responses with selection, potentially accelerating adaptation, whereas negative correlations can deflect trajectories away from optima, imposing evolutionary constraints.[62] In life-history evolution, negative genetic correlations between fitness components, such as early-life reproduction and longevity, frequently constrain independent optimization, as observed in wild bird populations where selection for increased fecundity elicits correlated reductions in survival due to shared genetic bases.[63] Similarly, in morphological traits, genetic correlations between body size and limb proportions in mammals limit decoupling, maintaining scaling relationships despite selection pressures for divergence, as evidenced by comparative analyses across species.[64] Artificial selection experiments confirm these dynamics: in Drosophila, selection for increased bristle number on one thorax segment induces correlated changes in others via pleiotropic effects, altering multivariate phenotypes beyond direct targets.[65] Genetic correlations are not immutable; the G-matrix itself evolves under sustained selection, mutation, and drift, with empirical comparisons across populations and phylogenies revealing shifts in correlation structure over generations or deeper timescales.[66] For instance, analyses of 1798 genetic correlations across 51 animal and plant species indicate that while many persist, directional selection can erode antagonistic ones, releasing variation for adaptation, as modeled in theoretical frameworks where pleiotropic mutation and recombination reshape covariances.[67] Recent genomic studies integrating GWAS and quantitative genetics further show that macroevolutionary patterns in G-matrices reflect historical selection, enabling prediction of trait co-evolution in complex environments.[68] Thus, while genetic correlations often canalize evolutionary paths, their lability ensures they function as transient rather than absolute barriers to change.Empirical Evidence in Human Traits
Correlations with Cognitive Abilities and Personality
Genetic correlations among diverse measures of cognitive ability, such as general intelligence (g), verbal comprehension, and processing speed, are typically moderate to high, ranging from 0.56 to over 0.60, indicating substantial shared genetic etiology underlying variation in these traits.[69][70] This pattern emerges from both twin studies and genomic methods like linkage disequilibrium score regression (LDSC), where the g factor captures overlapping additive genetic effects across cognitive domains.[70] Genome-wide association studies (GWAS) and LDSC analyses reveal specific genetic correlations between cognitive abilities and Big Five personality traits. Cognitive function shows a positive genetic correlation with openness to experience (rg ≈ 0.35) and conscientiousness (rg ≈ 0.17), alongside a negative correlation with neuroticism (rg ≈ -0.21).[69] Twin studies corroborate these directions, estimating rg between intelligence and openness or agreeableness at 0.3–0.4, and between intelligence and neuroticism at approximately -0.18, with weaker or non-significant links to extraversion.[71] These estimates derive from bivariate genetic covariance partitioned by liability thresholds and sample sizes exceeding hundreds of thousands in recent GWAS consortia.[69] Multivariate GWAS across cognitive and personality domains identify hundreds of pleiotropic loci—431 in one analysis—explaining cross-trait genetic overlap beyond univariate signals, with 35% of associations driven by effects spanning both categories.[69] Such pleiotropy enriches discovery of trait-specific variants when conditioning on multivariate signals, highlighting causal genetic pathways in brain tissues and synaptic functions.[69] While phenotypic correlations between cognition and personality are modest (e.g., 0.1–0.3), the genetic component often exceeds them, suggesting evolutionary pressures favoring aligned selection on these heritable dimensions.[72]Correlations with Health Outcomes and Psychopathology
Genome-wide studies using linkage disequilibrium score regression (LDSC) have revealed substantial genetic overlap between psychiatric disorders, with pairwise correlations often ranging from 0.2 to 0.7; for example, schizophrenia and bipolar disorder exhibit a genetic correlation of approximately 0.45, while major depressive disorder shows moderate positive correlations with anxiety disorders (rg ≈ 0.4-0.6).[73][74] These findings indicate shared polygenic risk architectures across psychopathology, potentially reflecting common neurodevelopmental or inflammatory pathways, though effect directions vary and disorder-specific variants remain limited.[75] Psychiatric traits also display mixed genetic correlations with physical health outcomes. Schizophrenia is positively genetically correlated with Crohn's disease (rg ≈ 0.10-0.18), an autoimmune condition, suggesting pleiotropic effects involving immune dysregulation, while showing negative correlations with psoriasis and type 2 diabetes.[76][77] Similarly, anorexia nervosa, obsessive-compulsive disorder, and schizophrenia are negatively correlated with body mass index (BMI) and body fat percentage (rg ≈ -0.2 to -0.4), contrasting with phenotypic observations in some cohorts.[78] Major depressive disorder exhibits a modest positive genetic correlation with obesity (rg ≈ 0.26), though subtype analyses reveal heterogeneity, with appetite-increasing depression variants aligning more closely with higher BMI risk.[79][80]| Psychiatric Trait | Health Outcome | Genetic Correlation (rg) | Method/Source |
|---|---|---|---|
| Schizophrenia | Crohn's Disease | +0.10 to +0.18 | LDSC[76] |
| Anorexia Nervosa | BMI/Body Fat % | -0.2 to -0.4 | LDSC[78] |
| Major Depressive Disorder | Obesity | +0.26 | Family/genomic[79] |
| ADHD | Physical Illness Traits (e.g., metabolic) | +0.3 to +0.5 (average) | Multivariate GWAS[81] |