Lactase persistence is a genetic adaptation in humans that enables the continued production of the enzymelactase into adulthood, allowing the digestion of lactose—the primary sugar in milk—long after weaning, in contrast to the typical mammalian pattern where lactase activity declines post-infancy.[1] This trait, which affects approximately 35% of the global adult population, is most prevalent in populations with a history of dairy farming and pastoralism.[2]The genetic basis of lactase persistence primarily involves single nucleotide polymorphisms (SNPs) in an enhancer region of the MCM6 gene, which regulates the expression of the nearby LCT gene encoding lactase.[2] In populations of European descent, the most common variant is -13910C>T (rs4988235), an autosomal dominant allele that emerged around 7,500 years ago and spread rapidly due to positive selection.[3] Independent mutations have arisen elsewhere, such as -14010C>G in East African pastoralists and -13915G in some African and Middle Eastern groups, illustrating convergent evolution driven by local dairy consumption.[1] These variants enhance LCT transcription in intestinal cells, preventing the onset of lactose intolerance that affects the majority of humans worldwide.[2]Lactase persistence exemplifies gene-culture coevolution, as its rise coincided with the Neolithicdomestication of lactating animals like cattle and goats around 10,000 years ago, providing a nutritional advantage in calcium-scarce or famine-prone environments.[4]Prevalence varies dramatically by region: it reaches 89–96% in Northern Europeans (e.g., Scandinavians and British Isles populations) and certain Africanherding groups like the Sudanese Beni Amer (64%), but is rare (<5%) in East Asians and Native Americans without dairy traditions.[2] Ancient DNA evidence confirms its absence in early Neolithic Europeans, underscoring its status as one of the strongest and most recent signals of natural selection in human evolution.[3]
Overview and Physiology
Definition and Phenotype
Lactase persistence (LP) is a genetic trait characterized by the continued activity of the lactase enzyme into adulthood, enabling the hydrolysis of lactose—the primary sugar in milk—into its constituent monosaccharides, glucose and galactose.[5] This sustained enzymatic function contrasts with the ancestral mammalian condition, where lactase expression is typically downregulated after weaning, resulting in the inability to efficiently digest lactose in later life.[6]Phenotypically, individuals with LP can consume dairy products containing unfermented milk without experiencing common symptoms of lactose maldigestion, such as abdominal bloating, flatulence, diarrhea, or cramping, as the lactose is properly broken down in the small intestine and absorbed.[7] In contrast, lactase non-persistence (LNP), the default physiological state in most humans, leads to a sharp decline in lactase production post-infancy—often to 5-10% of infantile levels—causing undigested lactose to ferment in the gut and trigger these gastrointestinal symptoms upon dairy intake.[6] This distinction manifests as lactose tolerance in LP individuals versus lactose intolerance in those with LNP.Globally, LP affects approximately 35% of adults, while LNP predominates in about 65% of the human population, reflecting the trait's uneven distribution shaped by historical and cultural factors.[7] The emergence of LP is a relatively recent evolutionary adaptation, arising around 10,000 years ago in conjunction with the domestication of dairy animals and the onset of pastoralism in regions like Southwest Asia and Europe.[5]
Lactose Digestion Mechanism
Lactase, formally known as β-galactosidase or lactase-phlorizin hydrolase, is a glycoside hydrolase enzyme embedded in the brush border membrane of enterocytes in the small intestine. It catalyzes the hydrolysis of lactose, the primary disaccharide in mammalian milk, by cleaving the β-1,4-glycosidic bond between its glucose and galactose moieties, yielding absorbable monosaccharides.[8][9]The digestion of lactose begins in the duodenum and jejunum, where dietary lactose is broken down into equimolar amounts of glucose and galactose through lactase activity. These monosaccharides are then efficiently absorbed, whereas in lactase non-persistence (LNP), insufficient enzyme leads to undigested lactose reaching the colon. There, colonic microbiota ferment the osmotically active lactose, producing short-chain fatty acids, hydrogen, methane, and carbon dioxide gases, which draw water into the lumen and trigger gastrointestinal symptoms such as bloating, flatulence, and osmotic diarrhea.[8][10]Absorption of the liberated glucose and galactose occurs via secondary active transport across the apical membrane of enterocytes, primarily mediated by the sodium-glucose linked transporter 1 (SGLT1), which couples monosaccharide uptake to a sodium electrochemical gradient established by the Na+/K+-ATPase. These sugars then diffuse across the basolateral membrane through the facilitative transporter GLUT2 into the portal bloodstream, where they contribute metabolic energy at approximately 4 kcal per gram, equivalent to other dietary carbohydrates.[11][12][13]Lactase expression is tightly regulated by transcription factors that drive high levels during infancy to support milk-based nutrition, but in LNP individuals, enzyme activity declines sharply after weaning due to progressive epigenetic silencing of the lactase gene, resulting in reduced mRNA and protein production without altering the DNA sequence itself.[14][15]
Genetics
Molecular Basis and Gene Regulation
The LCT gene, which encodes the enzyme lactase-phlorizin hydrolase (LPH) responsible for lactose digestion, is located on the long arm of chromosome 2 at position 2q21.3.[16] This gene spans approximately 50 kilobases (kb) and consists of 17 exons, with its transcription producing a messenger RNA of about 6.3 kb that translates into the mature LPH protein primarily expressed in the brush border of intestinal enterocytes.[17] LPH functions as a β-galactosidase, hydrolyzing lactose into glucose and galactose, but its expression is tightly regulated during development, declining post-weaning in most mammals, including humans without lactase persistence.[18]The primary regulatory control of LCT expression occurs through an enhancer element located approximately 14 kb upstream in intron 13 of the neighboring MCM6 gene, which encodes a DNA replication factor but harbors this tissue-specific regulatory sequence.[19] This enhancer directs LCT transcription in the intestine while preventing expression in other tissues, and it contains multiple single nucleotide polymorphisms (SNPs) that modulate its activity; for instance, the -13910C>T SNP serves as a well-studied example of a variant altering enhancer function.[20] In individuals with lactase non-persistence (LNP), the enhancer's default activity allows for the developmental silencing of LCT, whereas persistence-associated alleles enhance transcription factor binding, such as to the HNF1α protein, thereby maintaining expression into adulthood.[21]Epigenetic mechanisms further fine-tune LCT regulation, with DNA methylation at CpG sites in the promoter and enhancer regions, along with repressive histone modifications like H3K27me3, contributing to post-weaning silencing in LNP phenotypes.[14] These modifications accumulate with age in non-persistent individuals, effectively repressing transcription, while lactase persistence alleles disrupt this process by reducing methylation susceptibility and promoting active chromatin marks, such as H3K4me3, to sustain enhancer-promoter interactions.[22]
Key Genetic Variants and Alleles
Lactase persistence (LP) is primarily conferred by single nucleotide polymorphisms (SNPs) in an enhancer region approximately 14 kb upstream of the LCT gene, located within an intron of the neighboring MCM6 gene, which modulate LCT transcription in intestinal cells.[1] These variants arose independently in different populations and exhibit dominant inheritance, allowing sustained lactaseenzyme production into adulthood.The most well-characterized LP-associated variant in populations of European descent is the -13910C>T SNP (rs4988235), where the derived T allele disrupts a transcriptional repressorbinding site, leading to increased LCT expression. Functional assays demonstrate that this T allele enhances transcriptional activity of the LCT promoter by approximately 2- to 3-fold compared to the ancestral C allele, with the effect mediated through improved binding of transcription factors such as Oct-1.[23] The T allele acts in a dominant manner, with heterozygotes maintaining sufficient lactase levels for lactose digestion.In African populations, multiple independent LP alleles have been identified in the same enhancer region, including -14010G>C (rs145946881), -13915T>G (rs41380347), and -13907C>G (rs41525747), each originating separately and conferring similar enhancer effects by altering transcription factor binding to sustain LCT expression. These variants show variable penetrance but collectively explain a portion of LP variation in pastoralist groups, with functional studies confirming their role in upregulating lactase activity comparably to the European allele.[24]A distinct variant associated with LP in some Middle Eastern and South Asian populations is -13907C>G (rs41525747), which exhibits partial dominance and enhances LCT transcription through modifications in the enhancer sequence, though with potentially lower penetrance than the European allele.[1]Overall, LP inheritance follows an autosomal dominant pattern at the LCT locus, where heterozygotes typically retain 50-80% of lactase activity relative to homozygotes, sufficient for effective lactose hydrolysis; additional polygenic factors contribute minimally to the phenotype.[25] A 2023 review highlights that over 10 such LP alleles have now been identified globally across diverse ancestries, with varying degrees of penetrance and functional strength.[1]
Global Prevalence
Regional Distributions
Lactase persistence (LP) shows marked geographic variation, with prevalence strongly associated with historical dairy pastoralism in different regions of the world.In Europe, LP frequencies are highest in northern and western populations, ranging from 70% to 100%, such as approximately 96% among Scandinavians and British Isles groups.[26] Frequencies decline southward, reaching as low as 15-22% in southern European populations like those in Sardinia.[27] These European patterns are primarily driven by the -13910*T genetic variant.[28]In Africa, LP distribution is patchy and largely confined to pastoralist groups, with frequencies of 20-40%, including approximately 43% among the Fulani of Mali.[28] In contrast, non-pastoralist populations in East and Southern Africa exhibit very low prevalence, typically under 5%.[24]In the Middle East, LP is generally uncommon but varies by lifestyle; a 2024 study in Jordan reported 16% prevalence in urban populations compared to 30% in rural farming communities, with even higher rates observed among Bedouin pastoralists.[29]Across Asia, LP is rare in East Asian populations at less than 5%.[28] Prevalence rises to 20-40% in northwestern regions like parts of India and Pakistan, linked to local pastoral traditions, while remaining low in Southeast Asia.[30]In the Americas and Oceania, indigenous populations show low LP rates under 20%, reflecting limited historical dairy use, though admixed groups with European ancestry exhibit higher frequencies due to gene flow.[31]Globally, about one-third of adults are lactase persistent, with distributions varying markedly according to regional histories of milk consumption.[28]
Demographic and Environmental Influences
Population movements have significantly influenced the distribution of lactase persistence (LP) alleles across continents. In Europe, the primary LP-associated allele spread through the migrations of Neolithic farmers from the Near East around 8,000 years ago, introducing dairy pastoralism and selecting for LP in subsequent generations.[26] Additionally, Indo-European migrations from the Eurasian steppes during the Bronze Age further disseminated these alleles westward into Europe, contributing to higher LP frequencies in northern and central regions.[32] In Africa, distinct LP alleles, such as the -14010C>G variant, expanded southward with the Bantu migrations starting approximately 3,000–5,000 years ago, integrating into populations practicing mixed farming and herding in sub-Saharan regions.[33]Admixture events following colonial-era population mixing have reshaped LP prevalence in the Americas. In Latin American countries, LP frequencies range from 30% to 50%, largely attributable to European genetic input during the colonial period, which introduced high-LP alleles into predominantly non-persistent indigenous populations.[34] In contrast, indigenous Native American populations exhibit very low LP rates (typically 10-20%), reflecting their historical absence of dairypastoralism and minimal pre-colonial exposure to LP-conferring alleles.[35]Lifestyle differences between urban and rural settings can modulate observed LP patterns. A 2024 study in Jordan found LP incidence at 16% among urban residents compared to 30% in rural Jordan Valley farmers, with genetic testing indicating variation linked to historical pastoralist ancestry in these communities.[29]Environmental factors indirectly influence LP distribution without altering the underlying genetics. Dairy consumption patterns show a strong positive correlation with LP allele frequencies globally, as pastoralist societies with historical milk use exhibit higher persistence rates, though this reflects cultural selection pressures rather than causation.[36] Similarly, LP is indirectly linked to vitamin D levels, as persistent individuals in high-latitude populations consume more milk—a fortified source of the vitamin—leading to elevated serum concentrations independent of sunlight exposure.[37]Recent ancestry modeling supports the role of pastoralist migrations in LP dissemination. A 2025 analysis using genetic ancestry components and polygenic risk scores predicted that LP alleles propagated primarily through ancient herder movements across Eurasia and Africa, aligning observed modern distributions with migration routes rather than local de novo evolution.[38]
Evolutionary Origins
Selection Pressures and Timeline
Lactase persistence (LP) first emerged in human populations approximately 7,000 to 10,000 years ago, shortly following the domestication of dairy animals such as cattle and sheep in the Near East around 9,000 BCE.[36][39] This temporal alignment indicates that the genetic adaptations enabling adult lactose digestion arose in response to the increasing availability of milk as a dietary resource after the Neolithic transition to pastoralism.[2]The evolution of LP has been driven by exceptionally strong positive selection, among the most intense documented in human history, with estimated selection coefficients ranging from 0.04 to 0.097 per generation, facilitating a rapid rise in allele frequencies over a few thousand years.[1] This selective pressure is evidenced by the swift spread of LP-associated alleles, such as the European -13910*T variant, which transitioned from rarity to prevalence in pastoralist groups within millennia.[40] A primary driver appears to be the nutritional value of post-weaning milk consumption, providing a critical calorie source during famines and promoting enhanced childhood growth through increased energy intake from glucose and galactose.[41][42]Archaeological and genetic evidence from ancient DNA supports this timeline, with LP alleles first detectable in European samples dating to around 5,000 BCE, coinciding with the expansion of dairying practices.[26] In Africa, convergent evolution of distinct LP variants occurred independently around 3,000 BCE, linked to local pastoral traditions.[43] Recent analysis of ancient genomes further ties LP to nutritional advantages, revealing that carriers of the LP allele exhibited increased stature by 0.20 standard deviations in prehistoric populations, underscoring the role of milk-derived nutrition in driving selection for height and survival.[38]
Gene-Culture Coevolution
Lactase persistence (LP) exemplifies gene-culture coevolution, where the cultural practice of dairy farming exerted selective pressure favoring the spread of LP alleles, while these alleles in turn facilitated more intensive milk consumption and the development of pastoral economies. In populations adopting animal domestication and milk use around 10,000 years ago, the nutritional availability of fresh milk created an environment where individuals with LP variants gained a survival advantage, particularly during periods of food scarcity or weaning, thereby increasing the frequency of these alleles over generations. This reciprocal dynamic is a hallmark of human niche construction, as the cultural transmission of dairying practices amplified genetic selection for LP.[36][2]A key feedback loop in this process involves milk serving as a reliable weaning food that reduced infant and child mortality rates among LP carriers, enabling herder communities to thrive and expand. Successful pastoralists not only passed on LP alleles genetically but also disseminated dairying knowledge culturally through social learning and migration, accelerating the allele's propagation beyond what natural selection alone could achieve. Simulations from the 2000s demonstrate this effect: cultural transmission of milk consumption increased the spread of the primary European LP allele (-13910*T) by 5-10 times compared to scenarios relying solely on genetic drift and selection, with models estimating origins around 7,500 years ago in Central Europe.[2]Illustrative cases highlight this coevolution in diverse contexts. Among Eurasian steppe herders like the Yamnaya culture (circa 3300–2500 BCE), the adoption of dairying—evidenced by proteomic analysis of dental calculus showing widespread milk consumption—coincided with population expansions, laying the groundwork for subsequent LP allele selection despite its initial rarity in these groups. Independently, African pastoralists such as the Fulani developed high LP frequencies linked to their nomadic cattleherding, driven primarily by the Eurasian-derived -13910*T allele, with Africa-specific variants like -14010G present at low frequencies, underscoring parallel gene-culture interactions in non-overlapping regions. Updated models from 2023 incorporating multiple alleles across continents confirm that such coevolutionary processes occurred repeatedly, reinforcing LP's role as a adaptive response to pastoralism in varied environments.[44][45][1]
Regional Evolutionary Histories
Europe and Central Asia
In Europe, the primary genetic variant associated with lactase persistence (LP), the -13910T allele in the MCM6 gene, is estimated to have arisen around 7,500 years ago in Central Europe, likely among early Neolithic populations practicing animal husbandry.[46] This allele's emergence coincided with the spread of dairy farming introduced by Linearbandkeramik (LBK) culture farmers, who migrated from the Near East and Anatolia into Central Europe around 5500 BCE, facilitating the initial dissemination of LP through pastoral economies reliant on cattle herding.[46]Ancient DNA evidence confirms the allele's absence in pre-Neolithic hunter-gatherers across Europe, underscoring its novelty in the context of early agricultural expansions.[47]By the Bronze Age, the -13910T allele appeared at low frequencies in populations such as the Corded Ware culture (circa 2900–2350 BCE), which spanned much of Northern and Central Europe and incorporated Indo-European pastoralist elements from the Eurasian steppes. This presence, though rare (less than 10% in sampled individuals), marks an early phase of allele diffusion amid broader migrations and cultural shifts toward intensified dairying.[48] In medieval Central Europe, ancient DNA from a 12th-century site in Dalheim, Germany, and other regions reveals a sharp increase, with LP frequencies exceeding 70% by around 1200 CE, reflecting ongoing strong positive selection in northern latitudes where dairy formed a dietary staple.[49]Archaeogenetic analysis of Iron AgeBritain (circa 800 BCE–43 CE) further illustrates rapid regional adaptation, with the -13910T allele frequency rising markedly during this period—reaching levels not seen on the continental mainland until over a millennium later—likely driven by local intensification of dairy practices among Celtic populations. This acceleration predates Roman influence and highlights Britain's insular dynamics in LP evolution.In Central Asia, the -13910T allele spread via Indo-European migrations from the Pontic-Caspian steppe, beginning around 4000–3000 BP, as Yamnaya-related herders moved eastward, introducing pastoralism and facilitating allele transmission among horse- and cattle-rearing groups. Among modern Central Asian populations with Indo-European linguistic ties, such as Kazakhs and Mongolians, LP frequencies range from 12% to 30%, attributed in part to historical reliance on fermented horse milk products that mitigated lactose intolerance while enabling gradual genetic adaptation. This pattern contrasts with higher European prevalences but underscores the allele's Eurasian-wide dispersal through shared migratory and dairying networks.
Africa and Middle East
In Africa, lactase persistence (LP) has evolved independently through multiple genetic variants, with the -14010C>G allele (rs145946881) being prominent among East African pastoralists. This variant emerged approximately 3,000 to 7,000 years ago, coinciding with the intensification of cattleherding practices in the region. Among Nilo-Saharan and Afro-Asiatic speaking groups, such as the Maasai in Kenya and the Tutsi in Rwanda and Burundi, LP frequencies range from 20% to 80%, reflecting strong selective pressure in dairy-dependent communities.Ancient evidence supports the early association of LP with pastoralism in northern Africa, where dairy consumption is attested from around 6,000 years ago through proteomic analysis of dental calculus from northeastern African individuals, indicating milk processing during the Neolithic period.[50]Ancient DNA from eastern African pastoralists around 3000 BCE shows low or absent LP alleles, aligning with the spread of cattle domestication from the Nile Valley southward, which facilitated the migration of herding economies across sub-Saharan landscapes.[51][33]In the Middle East, the -13907C>G allele (rs41525747) represents a key LP variant, with origins estimated around 4,000 BCE, linked to the expansion of pastoralism following the domestication of goats, sheep, and camels in the Arabian Peninsula and surrounding areas.[52] This allele is particularly elevated among Bedouin populations, where LP prevalence reaches 30-50%, supporting their traditional reliance on camel and goat milk as vital nutritional sources in arid environments.[24] A 2024 study in Jordan documented a notable urban-rural gradient, with LP incidence at 16% in urban dwellers, 30% among Jordan Valley farmers, and 62% in Bedouins, suggesting a decline in the trait's frequency with modernization and reduced pastoral lifestyles.[29]These regional developments illustrate convergent evolution of LP, where distinct alleles underwent parallel positive selection despite genetic differences, driven by the nutritional demands of consuming unfermented milk from camels and goats in pastoralist societies across Africa and the Middle East.
South Asia and Other Regions
In South Asia, lactase persistence (LP) is characterized by the presence of specific genetic variants, including the -13907C>G allele (rs41525747), which emerged approximately 4,000 years ago, likely linked to the introduction of pastoralism and Steppe migrations.[53] This variant, along with others such as -13910C>T shared with European populations, shows elevated frequencies in northwestern regions associated with historical herding practices. Among Punjabi herders in northwest India and Pakistan, LP prevalence ranges from 20% to 40%, reflecting adaptation to dairy consumption in agro-pastoral communities, while it remains low (often below 10%) in southern and eastern populations.[54] The overall lower LP across much of South Asia is attributed to cultural practices emphasizing fermented dairy products like yogurt and paneer, which reduce lactose content and diminish selective pressure for persistence alleles.[55]In the Americas, LP was rare among pre-Columbian indigenous populations, with frequencies below 5% and no evidence of local alleles or widespread dairy use prior to European contact in 1492.[35] Post-colonial admixture with Europeans introduced the -13910C>T allele, leading to increased LP proportional to European ancestry; for instance, admixed populations in Uruguay and Brazil exhibit LP rates of 50-60%.[35] Among US Hispanics, particularly those of Mexican descent, LP reaches approximately 50% due to varying degrees of European genetic contribution, though gastrointestinal symptoms persist in non-persistent individuals consuming dairy.[56]LP prevalence in Oceania and Australia is low among indigenous groups, typically 0-10%, with near absence of known persistence alleles in Aboriginal Australians, consistent with the lack of historical dairy pastoralism.[57] European descendants in these regions show higher rates mirroring global patterns for that ancestry.
Hypotheses for Evolutionary Advantages
Nutritional and Developmental Benefits
Lactase persistence (LP) enables adults to digest lactose, the primary carbohydrate in milk, into glucose and galactose, providing a significant source of energy. A standard cup (approximately 240 ml) of cow's milk contains 100-150 kcal, with lactose contributing about 12 grams of carbohydrates that yield roughly 48 kcal upon hydrolysis, making milk an efficient calorie source in environments where other foods may be scarce.[58] Individuals with LP derive up to 70% more calories from equivalent volumes of milk compared to those without, as undigested lactose in non-persistent individuals passes unabsorbed, reducing net energy gain.[58] This caloric advantage from fresh milk consumption has been proposed as a key driver for the evolution of LP, particularly during periods of nutritional stress in early childhood when growth demands are high.[59]Developmental benefits of LP include enhanced childhood growth metrics, such as increased height and weight, linked to sustained milk intake. Studies in modern populations show that children with the LP genotype and higher milk consumption exhibit greater body height, with associations persisting into adolescence.[60] In dairy-reliant groups, LP correlates with accelerated linear growth and improved weight-for-age, supporting overall physical development during critical periods.[61] Ancient DNA analyses further confirm these effects, revealing that carriers of the LP allele in prehistoric populations had statures approximately 0.20 standard deviations taller than non-carriers, indicating a substantial genetic contribution to height independent of other factors.[62]LP also promotes bone health by facilitating greater intake of milk-derived calcium and vitamin D, essential for mineralization and skeletal integrity. Milk is a primary dietary source of bioavailable calcium, with LP allowing persistent consumption that enhances absorption efficiency compared to lactose-intolerant individuals who may limit dairy.[63] Overall, these nutritional pathways underscore LP's role in optimizing developmental outcomes through dairy utilization.[61]
Environmental and Pathogen Resistance Hypotheses
One hypothesis posits that lactase persistence (LP) evolved to facilitate vitamin D-independent calcium absorption in regions with limited sunlight, such as northern latitudes, where low UVB exposure impairs endogenous vitamin D synthesis essential for calcium uptake. This "calcium assimilation hypothesis," originally proposed in the 1970s, suggests that digesting lactose enhances calcium bioavailability from milk, mitigating risks of rickets and osteomalacia in early dairy-consuming populations reliant on low-calcium cereal-based diets.[2] Computational models simulating LP spread in Europe indicate stronger selective pressures at higher latitudes (selection coefficients of 0.8–1.8%), correlating with archaeological evidence of dairying from around 8500 BP.[64] However, empirical studies in Iberia reveal that while LP alleles show positive selection, latitude alone does not fully explain their distribution, suggesting calcium absorption was influential but not the sole driver.[65]In arid environments, LP may have provided adaptive advantages through milk's role in hydration and nutrient delivery for pastoralist groups facing water scarcity and climate variability. African pastoralists, such as the Fulani (~50% LP frequency) in West Africa and the Tutsi (up to 90%) in East Africa, exhibit high LP frequencies, coinciding with the adoption of herding practices in semi-arid regions where milk serves as a reliable source of fluids and calories during dry seasons or droughts. Genetic and isotopic evidence from ancient remains supports this, showing LP alleles emerging alongside pastoralism around 3000–5000 years ago in response to environmental shifts toward drier conditions, enabling sustained mobility and survival in resource-limited landscapes.[3] For instance, migrations of herders through Tanzania to southern Africa carried LP variants, linking genetic adaptation to arid adaptations in Khoe populations.A proposed link between LP and pathogen resistance involves heterozygote advantages against malaria in endemic areas, where LP alleles might confer partial protection via milk's immunomodulatory effects or nutrient competition with Plasmodium parasites. Preliminary studies from the early 2000s in Mali's Fulani population, known for both high dairy intake and innate malaria resistance, found lower asymptomatic Plasmodium falciparum parasitemia (18% vs. 24%) among LP individuals compared to non-persistent genotypes, though differences were not statistically significant (P=0.29).[66] This suggests potential benefits from milk consumption, such as para-aminobenzoic acid (PABA) deficiency impairing parasite growth, but the data remain inconclusive and require larger-scale validation.[67]Recent reviews critique these environmental hypotheses, emphasizing that nutritional energy gains and growth promotion better explain LP selection than latitude-specific calcium needs or arid hydration alone. A 2023 analysis argues that lactose digestion yields substantial caloric benefits (up to 20% more from milk), driving rapid allele fixation in dairying populations beyond what vitamin D limitations predict. Similarly, pathogen resistance models, including malaria links, show weaker selective signals compared to famine or growth-related pressures.[68] Counterexamples, such as low LP prevalence among Mongolian herders (despite heavy reliance on fermented dairy like airag, which reduces lactose content via bacterial action), highlight that cultural processing mitigates needs for genetic persistence in some arid, milk-dependent groups.[4]
Health Implications
Modern Nutritional Effects
Adults with lactase persistence (LP) can tolerate higher dairy consumption without experiencing lactose intolerance symptoms, leading to increased intake of calcium-rich foods compared to those with lactase non-persistence (LNP).[69] This elevated dairy intake is associated with improved bone mineral density, particularly at the femoral neck, as demonstrated in Mendelian randomization studies linking genetically predicted milk consumption to higher bone density in middle-aged and older populations.[70] Furthermore, higher consumption of certain dairy products like yogurt and cheese has been linked to a reduced risk of hip fractures in meta-analyses of prospective cohorts, with relative risk reductions of up to 20-30% for moderate intakes.[71]However, excessive dairy intake enabled by LP may carry risks, including potential associations with increased prostate cancer incidence, where high milk consumption (over 400 g/day) correlates with a 25% higher risk, possibly mediated by elevated insulin-like growth factor 1 (IGF-1) levels from milk.[72] Similarly, milk consumption has been implicated in aggravating acne through IGF-1-induced sebaceous gland activity and inflammation, with meta-analyses showing a modest odds ratio of 1.2-1.5 for acne in high-dairy consumers.[73] These risks are often balanced by the overall nutritional benefits of dairy, such as contributions to protein and micronutrient needs, when consumed in moderation within a varied diet.[74]In response to varying LP prevalence globally, the market for lactose-free dairy products has surged, growing from approximately $12.9 billion in 2024 to $13.9 billion in 2025, driven by demand in regions with high LNP rates like Asia and Africa.[75] Public health policies, such as U.S. dietary guidelines recommending dairy for calcium, have faced scrutiny for potential racial biases, as LP is less common in African American and Asian populations, prompting calls for more inclusive alternatives to ensure equitable nutritionaccess.[76]Studies in multi-ethnic U.S. adults indicate that the LP genotype correlates with higher lactose and dairy intake, as well as improved status of key micronutrients like calcium and vitamin D, supporting better overall dietary nutrient profiles in LP individuals.[69]
Associations with Microbiota and Disease
Lactase persistence (LP) has been associated with distinct gut microbiota compositions compared to lactase non-persistence (LNP), particularly in relation to lactose intake levels. In healthy U.S. adults, individuals with the LP genotype (rs4988235 AA/AG) consume higher average daily lactose (approximately 12 g/day) than those with LNP (GG, approximately 9 g/day), influencing microbial abundances. LNP individuals with elevated lactose intake (>12 g/day) exhibit increased relative abundances of Firmicutes families such as Lactobacillaceae and Lachnospiraceae, which are involved in lactatefermentation, whereas LP individuals show lower levels of these taxa, potentially due to efficient host lactase digestion reducing undigested lactose available for microbial utilization.[77] Similarly, a 2025 study in a multi-ethnic U.S. cohort found that persistent lactase activity in LP genotypes competitively excludes lactic acid bacteria (e.g., Lactobacillus and Lactococcus genera), leading to reduced abundances of these Firmicutes compared to LNP individuals with high lactose intake (>10 g/day), who displayed enriched lactate-utilizing taxa like Veillonella and Megamonas.[78]The LP genotype also modulates microbiota diversity and function. The rs4988235 variant influences overall microbial diversity in healthy adults, with LP associated with greater beta-diversity in some cohorts, reflecting shifts in functional pathways such as beta-galactosidase activity, which is higher in LNP regardless of intake. Higher lactose consumption in LP individuals correlates with altered Firmicutes/Bacteroidetes ratios, promoting a more diverse Firmicutes profile without the pronounced lactate fermentation seen in LNP. These genotype-specific interactions highlight how LP facilitates lactose metabolism primarily in the small intestine, minimizing microbial adaptations in the colon, as evidenced by recent metagenomic analyses.[79] Studies from 2024 and 2025 have begun addressing prior gaps in understanding post-2020 microbiota-LP dynamics, emphasizing the role of undigested lactose as a prebiotic in LNP, which enhances short-chain fatty acid production like propionate.[78]Regarding disease associations, LP is linked to reduced risk of irritable bowel syndrome (IBS) symptoms, as LNP genotypes correlate with higher prevalence of lactose intolerance-related gastrointestinal distress, including bloating and diarrhea, which overlap with IBS phenotypes. A meta-analysis confirmed that lactose maldigestion, more common in LNP, is not independently associated with IBS, but intolerance symptoms are, suggesting LP confers protection by preventing osmotic diarrhea and microbial over-fermentation.[80] A 2024 study further indicated that higher milk intake in LNP individuals is associated with lower type 2 diabetes risk, potentially mediated by gut microbiota effects including enrichment of Bifidobacterium species.[81] No strong, consistent links exist between the LP genotype and cancer risks; early correlations with ovarian or prostate cancers appear driven by dietary milk consumption rather than the genotype itself, as confirmed in genetic studies.[82][83]
Comparative Aspects
Lactase in Other Mammals
In most mammals, lactaseenzyme activity is high at birth to facilitate the digestion of lactose in maternal milk, but it undergoes a significant post-weaning decline, typically dropping to less than 10% of neonatal levels by adulthood.[84] This downregulation, often exceeding 90-99% reduction in expression, aligns with the shift to a post-nursing diet lacking lactose, conserving energy by limiting unnecessary enzyme production in the small intestine.[14] The pattern is conserved across diverse mammalian species, from rodents to carnivores, reflecting an adaptive response to dietary changes after infancy.[85]Humans stand out as the only species exhibiting widespread lactase persistence (LP) into adulthood, enabling continued lactose digestion beyond weaning—a trait absent in other mammals without exception until recent evolutionary contexts.[1] This human-specific adaptation emerged in conjunction with animal domestication around 10,000 years ago, particularly in pastoralist societies where fresh milk became a dietary staple, driving strong selective pressure for LP alleles.[1] In non-human primates, such as rhesus, bonnet, and owl monkeys, adult lactase levels remain low, mirroring the typical mammalian decline and confirming no natural LP in primate lineages outside humans.[86]Rare exceptions to the mammalian norm occur in domesticated species with historical milk exposure. In dogs, LP is breed- and region-dependent, with a key adaptive mutation (LCT-G) present in 91.7% of European breeds, 61.8% of Southeast Asian indigenousdogs, and only 6.1% of gray wolves, indicating selection for milk digestion in dairy-associated populations starting less than 6,500 years ago.[87] Natural variants conferring LP have not been documented in wild mice or other undomesticated mammals, though lab models using human-derived mutations demonstrate retained juvenile lactase expression.[88] Evolutionarily, human LP represents a form of neoteny, where the pre-weaning lactase production pattern is extended into adulthood, unlike the strict ontogenetic shutdown in other species.[2]
Exceptions and Variations
While mammals typically express lactase-phlorizin hydrolase (LPH) for lactose digestion during infancy, non-mammalian vertebrates and invertebrates generally lack this enzyme due to the absence of milk production in their lineages. In birds, such as chickens, homologues of the mammalian LCT gene encoding LPH have been identified through reverse transcriptase-PCR, producing transcripts similar in size and structure, including a membrane-anchoring domain. However, these avian proteins show sequence similarity to mammalian lactase but lack confirmed functionality in hydrolyzing lactose, as birds do not consume milk. Reptiles similarly exhibit no evidence of LPH expression tailored to lactose digestion, consistent with their evolutionary divergence from milk-producing lineages. In contrast, some fish possess analogous β-galactosidase enzymes in their intestines that cleave β-1,4-galactosidic bonds in plant-derived polysaccharides, aiding carbohydrate breakdown; for instance, in Tilapia nilotica, intestinal β-galactosidase activity is high in the juice of the upper and middle intestine, with optimal function at pH 5.0 and 40°C, though it primarily targets substrates like galactan rather than lactose.[89][89][90]Wildlife variations in lactase expression highlight adaptations to diets low in lactose, reducing selective pressures for persistence beyond weaning. In carnivorous mammals like polar bears (Ursus maritimus), adult lactase activity remains undetermined but is likely minimal, as their milk contains only about 0.49% lactose—far lower than in many herbivores—minimizing the need for sustained enzyme production in offspring or adults. Polar bear diets, dominated by high-fat seal blubber rather than milk, further alleviate any potential demand for lactose digestion in maturity, contrasting with herbivorous or omnivorous mammals under dairy-related pressures. This low-lactose profile in ursid milk exemplifies how ecological niches in carnivores can preserve the ancestral pattern of lactase decline without evolutionary retention for adult use.[91][92]Laboratory models have elucidated regulatory mechanisms of lactase persistence through genetic engineering. Transgenic mice incorporating 3.3 kb of human LPH 5′ flanking sequence from lactase-persistent individuals express the transgene in the small intestine during the suckling period, mirroring developmental patterns, but activity declines post-weaning, akin to murine lactase regulation. When the human lactase enhancer element is included, it drives persistent expression into adulthood in these mice, demonstrating that human regulatory variants can override the typical post-weaning silencing observed in rodents. These knock-in models reveal key differences in enhancer-driven epigenetics and transcription factors, such as Oct-1 binding, that prevent methylation-based repression in persistent humans but not in mice.[88][88][21]In human populations with high lactase persistence, such as northern Europeans, rare cases of lactase non-persistence (LNP) arise from compound heterozygous mutations in the LCT coding region, leading to congenital lactase deficiency (CLD). The Finnish-major allele (c.4170T>A, p.Y1390X), a nonsense mutation, is prevalent (carrier frequency ~1:35 in some regions) and combines with rarer variants like frameshifts (e.g., c.4998_5001delTGAG) or missenses (e.g., c.804G>C, p.Q268H) to abolish enzyme function from birth. These compound heterozygotes, documented in Finnish families, exhibit severe watery diarrhea upon milk ingestion despite the population's ~90% adult persistence rate, underscoring how biallelic loss-of-function disrupts the dominant LP trait. Such outliers highlight the distinction between regulatory LNP (common globally) and coding mutations causing outright deficiency.[93][93][93]
Research Considerations
Confounding Factors
Research on lactase persistence (LP) is often complicated by self-reporting bias, where individuals' perceptions of lactose intolerance symptoms do not always align with their actual genetic phenotype. Symptoms such as bloating, diarrhea, and abdominal pain can vary widely in severity and may be influenced by individual tolerance thresholds, leading to under- or over-reporting of intolerance. In populations with high cultural emphasis on dairy consumption, non-persistent individuals may adapt by continuing milk intake despite mild symptoms, masking the true prevalence of lactase non-persistence and weakening correlations between self-reports and genetic markers like the -13910C/T variant. For instance, studies have shown no strong association between self-reported milk avoidance and LP genotypes in groups accustomed to dairy, as cultural exposure encourages persistence in consumption regardless of underlying physiology.[94][95]Admixture and population structure further confound genetic analyses of LP by introducing heterogeneous ancestry that can mimic or obscure selection signals. In admixed populations, such as those in West Africa or among the Fulani nomads, European or North African genetic contributions carrying the LP allele (e.g., -13910T) can inflate apparent persistence frequencies without reflecting local adaptation. This requires statistical corrections for ancestry, such as principal component analysis or admixturemapping, to disentangle true LP variants from background genetic drift. Failure to account for these effects has led to overestimations of LP spread in regions with recent migration histories, particularly in Africa where multiple LP alleles coexist amid complex population histories.[96][24][45]Age and gender introduce potential confounders in LP studies, though evidence for significant effects on lactase activity or symptom reporting is lacking. Reviews indicate no inherent susceptibility differences based on age or sex, with symptom responses primarily influenced by lactose dose, body size, and genotype rather than these demographic factors. These considerations still necessitate age- and sex-stratified analyses to control for possible variations in study cohorts and ensure robust phenotype-genotype correlations.[97][98]Recent studies from 2023 to 2025 have increasingly highlighted the gut microbiota as a key confounder in LP research, mediating the relationship between genotype and phenotypic outcomes. In lactase non-persistent individuals consuming high-lactose diets, microbial communities adapt by enriching taxa like Lactobacillaceae and lactate-utilizers (e.g., Veillonella), which ferment undigested lactose and produce short-chain fatty acids such as propionate, potentially alleviating symptoms and decoupling genetic predictions from observed tolerance. This microbial compensation can explain discordant genotype-phenotype matches, as metagenomic analyses reveal elevated β-galactosidase genes in non-persistent guts regardless of host lactase levels. Such findings underscore the need to incorporate microbiome profiling in future LP studies to clarify environmental-genetic interactions.[99][100][101]
Diagnostic Methods
The hydrogen breath test serves as the gold standard for diagnosing lactose malabsorption, a key indicator of lactase non-persistence. In this noninvasive procedure, the patient ingests a standardized lactose load, typically 25 grams dissolved in water, after an overnight fast. Breath samples are then collected at baseline and at intervals (usually every 15-30 minutes) over 2-3 hours to measure exhaled hydrogen (H₂) levels using a breath analyzer. Undigested lactose reaches the colon, where it is fermented by gut bacteria, producing H₂ that is absorbed into the bloodstream and exhaled; an increase of more than 20 parts per million (ppm) above baseline indicates malabsorption.[102][103] This test is preferred for its simplicity and patient comfort, though false positives can occur in cases of small intestinal bacterial overgrowth.[104]Genetic testing provides a direct assessment of lactase persistence status by targeting single nucleotide polymorphisms (SNPs) in the MCM6 gene enhancer region, particularly the rs4988235 variant (also known as -13910C>T). The presence of the T allele is associated with lactase persistence in individuals of European descent, while the CC genotype indicates non-persistence. Testing typically involves polymerase chain reaction (PCR) amplification followed by genotyping via methods like TaqMan assays or next-generation sequencing, often accessible through direct-to-consumer (DTC) kits such as those from 23andMe. In European populations, this SNP predicts lactase persistence with approximately 90% accuracy when correlated with phenotypic tests, though sensitivity drops to around 87% and accuracy is lower in non-European groups due to population-specific variants.[84][105][54]The blood glucose tolerance test, or oral lactose tolerance test, evaluates lactase activity through systemic glucose response but is less commonly used due to its invasive nature requiring multiple venipunctures. After fasting, the patient consumes 50 grams of lactose, and blood glucose levels are measured at baseline and at 30, 60, and 120 minutes post-ingestion. A rise of less than 20 mg/dL above baseline suggests malabsorption, as insufficient lactase hydrolysis prevents glucose-galactose absorption. This method correlates well with breath tests but is avoided in favor of noninvasive alternatives, particularly in pediatric or anxious patients.[106][103]Recent advancements, including 2024 studies, have begun integrating gut microbiota analysis to resolve ambiguous diagnoses in lactase non-persistence cases, where traditional tests yield inconclusive results. Metagenomic sequencing of fecal samples can identify microbial shifts, such as reduced Bifidobacterium levels, that influence lactose fermentation and symptom severity, complementing genetic and breath-based assessments. European guidelines updated in recent reviews emphasize optimizing breath test protocols while exploring microbiota profiling for personalized diagnostics, particularly in diverse populations.[107][108]