Fact-checked by Grok 2 weeks ago

Gene cluster

A gene cluster is a group of two or more genes that are physically linked in close proximity on a , often encoding proteins with related functions and exhibiting coordinated or conserved organization across species. These clusters arise through evolutionary processes such as and rearrangement, providing selective advantages like efficient co-expression and functional integration. Gene clusters occur in both prokaryotes and eukaryotes, with diverse roles in metabolism, development, and immunity. In prokaryotes, they frequently organize as operons or neighboring transcriptional units encoding enzymes for biosynthetic pathways, particularly for secondary metabolites in bacteria and fungi that contribute to ecological adaptations like antibiotic production. A biosynthetic gene cluster (BGC) is specifically defined as a physically clustered group of two or more genes that together encode a complete enzymatic pathway for producing specialized compounds. In eukaryotes, prominent examples include the clusters, which consist of homeobox-containing transcription factors arranged in a collinear manner to direct anterior-posterior body patterning during embryonic in bilaterian animals. Another key instance is the globin gene clusters on human chromosomes 16 and 11, which encode α- and β-like globin chains essential for assembly and oxygen transport, with expression developmentally regulated across fetal and adult stages. The evolutionary conservation and dynamics of gene clusters highlight their biological significance, enabling mechanisms like in microbes and sequence exchange via gene conversion in multicellular organisms. Disruptions in these clusters, such as rearrangements or dysregulation, are implicated in diseases including developmental disorders, thalassemias from mutations, and immune deficiencies from (MHC) variations. Advances in have revealed thousands of such clusters, aiding applications and insights into proteome diversity.

Overview

Definition

A gene cluster is a group of two or more genes that are physically clustered on the same and typically encode proteins with related functions or participate in the same biochemical pathway. These genes cooperate to perform coordinated biological roles, such as in metabolic processes or developmental pathways. The concept of gene clusters gained prominence in the 1960s through the discovery of the in , where and Monod described a set of adjacent genes regulated together to metabolize . This bacterial example highlighted how clustered genes enable efficient, coordinated expression. In eukaryotes, similar clusters were later identified, such as the human alpha-globin gene cluster on , characterized in the late 1970s and early 1980s as containing multiple alpha-globin genes essential for production. Identification of gene clusters relies on three primary criteria: physical proximity within the , typically spanning tens to hundreds of kilobases; functional relatedness among the encoded products, such as involvement in a shared pathway; and frequent , often via common promoter regions or enhancers that synchronize their expression. These features distinguish gene clusters from randomly distributed genes and facilitate their detection through genomic sequencing and comparative analyses.

Key Characteristics

Gene clusters are characterized by their physical linkage, where multiple genes are located in close proximity on the , typically spanning from a few kilobases to several megabases of DNA. This arrangement often minimizes intervening non-coding regions, particularly in prokaryotic organisms, facilitating efficient transcription and reducing the risk of disruptions from transposable elements or recombination events. In contrast to scattered genes, this ensures that the genes remain inherited together, preserving their functional integrity across generations. A defining functional property of gene clusters is the similarity in the roles of their constituent s, which are frequently involved in sequential or interdependent steps of a biochemical pathway. For instance, biosynthetic gene clusters enzymes that catalyze the production of secondary metabolites, such as antibiotics or pigments, enabling organisms to respond to environmental challenges. This coordinated functionality contrasts with dispersed genes, where pathway components may be spread across the genome, potentially complicating stoichiometric balance and efficiency. Coordinated regulation represents another hallmark, achieved through shared cis-regulatory elements like promoters, enhancers, or insulators that synchronize in response to specific developmental stages, environmental cues, or cellular conditions. Such mechanisms lead to similar expression patterns across tissues or stressors, enhancing the overall of the pathway without requiring individual regulatory controls for each . Gene clusters exhibit considerable variability in size, ranging from small assemblies of 2-3 genes to expansive regions encompassing dozens or more, depending on the organism and pathway complexity. A prominent example is the (MHC) in vertebrates, which spans approximately 4 megabases and includes over 200 genes involved in immune recognition and response. This scalability allows gene clusters to adapt to diverse biological demands while maintaining their core organizational principles.

Evolutionary Formation

Gene Duplication Mechanisms

is a fundamental process in that contributes to the formation of gene clusters by generating paralogous copies of s, which can be retained and organized in tandem or dispersed arrangements. These duplications provide raw material for functional innovation while often leading to non-functionalization or subfunctionalization of copies. In the context of gene clusters, duplications are particularly significant when they result in the juxtaposition of paralogs, facilitating coordinated expression or shared regulatory elements. Tandem duplication involves the local copying of adjacent genes, typically through mechanisms such as during or replication errors like fork stalling and template switching (FoSTeS). occurs when misaligned homologous exchange segments, producing one with duplicated and another with a deletion, often resulting in head-to-tail arrangements of paralogs that form the basis of many gene clusters. Replication errors, including those mediated by microhomology-mediated break-induced replication (MMBIR), can also generate tandem duplicates during , particularly in regions with repetitive sequences. These processes are prevalent in both prokaryotes and eukaryotes, contributing to clusters involved in metabolic pathways or stress responses. Segmental duplication refers to the copying of larger chromosomal segments, often spanning multiple , which can lead to the formation or expansion of gene clusters through non-allelic (NAHR). NAHR arises from misalignment and recombination between low-copy repeats or segmental duplicates during , generating duplications that may integrate into existing clusters or create new ones with paralogous gene families. This mechanism is common in mammalian genomes, where it contributes to structural variation and the evolution of immunity-related gene clusters, such as those encoding olfactory receptors. Unlike tandem events, segmental duplications can relocate genes over longer distances, influencing cluster diversity. Whole-genome duplication (WGD) events, primarily observed in eukaryotes, involve that doubles the entire complement, leading to widespread paralog retention in clusters across lineages like and vertebrates. In , multiple ancient WGDs have shaped families in clusters related to , while in vertebrates, the teleost fish-specific WGD approximately 350 million years ago resulted in retained paralogs forming clusters for developmental s. These events provide a genome-wide source of duplicates that can coalesce into clusters through subsequent chromosomal rearrangements. Post-WGD, paralogs in clusters often undergo biased retention due to dosage sensitivity or regulatory interdependence. Overall, only about 10-20% of gene duplicates from these are retained long-term, with higher retention rates observed for those integrated into clusters, as this arrangement promotes functional through neofunctionalization or subfunctionalization. This selective retention underscores the role of clusters in buffering against deleterious effects while enabling evolutionary adaptability.

Divergence and Clustering Processes

Following gene duplication, the resulting paralogous genes undergo sequence divergence through the accumulation of point mutations, insertions, and deletions in their coding and non-coding regions. This divergence can lead to subfunctionalization, where the ancestral gene's functions are partitioned between the duplicates, or neofunctionalization, where one or both copies acquire novel functions, thereby preserving both genes from degenerative loss. For instance, coding-sequence divergence increases with the age of duplicates, often driven by relaxed purifying selection on redundant copies, allowing exploration of new adaptive roles. Over evolutionary time, selection pressures favor the maintenance of physical proximity among these diverging s to enable coordinated , particularly for genes involved in shared pathways or developmental processes. This clustering is reinforced by the reduction or loss of intergenic spacers, which minimizes regulatory and facilitates co-transcription, as seen in prokaryotic operons where short spacers promote polycistronic mRNA production for efficient pathway expression. In eukaryotes, similar selective forces preserve gene clusters by enhancing chromatin-level , reducing the fitness costs of dispersed loci and promoting transcriptional coherence across related genes. Post-duplication, cis-regulatory elements evolve to support clustering, with duplicated genes often acquiring or sharing enhancers and silencers that drive synchronized expression patterns. This includes the development of shared regulatory modules that partition or expand ancestral control, ensuring precise spatiotemporal activation within clusters. A notable example occurs in Hox clusters, where conserved enhancers post-duplication coordinate collinear expression along the anterior-posterior axis. Genomic analyses since 2010 have revealed that transposable elements (TEs) play a crucial role in facilitating clustering by mediating chromosomal rearrangements, such as inversions and translocations, that bring dispersed paralogs into proximity. These mobile elements provide substrates for unequal recombination, accelerating the assembly of stable clusters in both prokaryotes and eukaryotes, particularly in biosynthetic pathways. For example, miniature inverted-repeat TEs have been implicated in the formation of plant gene clusters through targeted insertions and recombinations.

Theoretical Models

Fisher Model

The Fisher Model conceptualizes genes as points within a multidimensional phenotypic space, where an organism's is represented as a vector in this space, and decreases with the from an optimal phenotypic point. Originally developed by in the 1930s to describe , the model posits that mutations shift the phenotypic position, with beneficial mutations being those that move closer to the optimum. In the context of gene clusters, this framework helps explain how gene duplications can facilitate incremental by allowing parallel shifts in phenotypic space, particularly when multiple genes contribute to the same pathway or trait. Retention of duplicate genes is favored when they diverge to specialize in subcomponents of a complex pathway, reducing the distance to the phenotypic optimum more effectively than single-gene mutations alone; this is particularly relevant for clustered genes where coordinated effects may enhance without pleiotropic costs. The mathematical basis of the model lies in the fitness landscape, where the dimensionality (number of traits) influences the probability of beneficial mutations, and duplicate genes enable finer-grained exploration of this landscape. This probability scales with pathway complexity, as more genes increase the potential for coordinated adaptation. The model assumes a neutral or isotropic fitness landscape where all trait dimensions are equivalent, facilitating geometric predictions but overlooking ruggedness from epistatic interactions. Post-2015 studies have critiqued it for neglecting regulatory constraints, such as transcriptional interference in clusters or horizontal transfer dynamics in prokaryotes, which can disrupt coadaptation despite geometric advantages.

Coregulation and Molarity Models

The model proposes that clusters evolve to facilitate shared cis-regulatory elements, such as enhancers and promoters, which coordinate the transcriptional activation or repression of multiple simultaneously. This arrangement minimizes the risk of —where are inappropriately activated in non-native contexts—by allowing a single regulatory signal to influence an entire cluster through physical proximity in the . looping studies, particularly those utilizing techniques developed in the late 2000s and applied extensively in the , have provided empirical support for this model by demonstrating frequent long-range interactions between clustered and their shared regulatory regions. For example, in eukaryotic systems, data reveal loops that juxtapose paralogous with common enhancers, enabling synchronized expression during developmental processes like . The molarity model complements by focusing on the biochemical advantages of clustering, where the increases the local concentration of products, thereby enhancing the efficiency of protein-protein interactions and stoichiometric assembly in functional pathways. In metabolic clusters, for instance, enzymes encoded by adjacent genes can form transient complexes more readily due to reduced distances, leading to faster rates and reduced intermediate leakage. This effect is particularly pronounced in prokaryotic operons, such as those involved in , where clustering ensures near-equimolar production of interacting proteins, as observed in the arginine deiminase pathway. Seminal analyses have highlighted how such local molarity provides a selective advantage, especially in resource-limited environments. Integrative frameworks from 2020s computational models combine and molarity by positing that evolutionary selection favors clusters optimizing both transcriptional coordination and local biochemical efficiency. Such approaches simulate how gene order and proximity balance regulatory sharing with interaction kinetics, as tested in constructs and pathway reconstructions. Recent single-cell analyses further validate these models by revealing cluster-specific co-expression patterns, with genes in genomic clusters exhibiting higher transcriptional synchrony across cell populations compared to dispersed homologs, underscoring the persistence of these mechanisms in heterogeneous tissues.

Types and Distinctions

Prokaryotic Gene Clusters

In prokaryotes, including bacteria and archaea, gene clusters are exemplified by operons, which consist of contiguous genes transcribed coordinately from a single promoter into a polycistronic mRNA molecule that encodes multiple proteins. This organization enables efficient, synchronized expression of genes involved in shared metabolic pathways, minimizing regulatory complexity in compact genomes. A well-studied example is the lac operon in Escherichia coli, where the lacZ, lacY, and lacA genes are co-transcribed to produce enzymes essential for lactose catabolism: β-galactosidase for hydrolysis, permease for transport, and transacetylase for detoxification of non-metabolizable analogs. This inducible system responds to lactose availability, illustrating how operon-based clusters facilitate rapid adaptation to environmental substrates. Prokaryotic gene clusters extend to biosynthetic gene clusters (BGCs) that direct the of secondary metabolites, such as antibiotics and pigments, often spanning 10–100 and comprising dozens of . Polyketide synthase (PKS) clusters, for instance, encode modular enzymes that iteratively assemble backbones, yielding structurally diverse compounds with ecological roles in competition and defense. These BGCs frequently integrate tailoring enzymes for post-synthetic modifications, enhancing metabolite complexity. The evolutionary dynamics of prokaryotic gene clusters are driven by (HGT), which allows to acquire intact clusters via conjugation, , or , promoting rapid innovation in . HGT fosters cluster assembly by juxtaposing compatible genes but also enables disassembly through mobile elements or recombination, contributing to genomic plasticity and biodiversity. This mechanism is particularly prevalent in BGCs, where phylogenetic incongruence signals frequent interspecies exchange. Metagenomic surveys indicate that nearly all bacterial contain biosynthetic gene clusters, with an average of approximately 3–6 per genome, underscoring their prevalence in environmental and host-associated microbiomes and their role as hotspots for undiscovered natural products. Co-regulation within these clusters, often via shared promoters and cis-regulatory elements, ensures stoichiometric production of pathway components, as detailed in broader genomic analyses.

Eukaryotic Gene Clusters and Tandem Arrays

In eukaryotes, gene clusters refer to groups of functionally related located in close genomic proximity, often facilitating coordinated regulation through shared enhancers, insulators, or domains, though they are typically transcribed as individual monocistronic units rather than polycistronic operons as in prokaryotes. This contrasts with bacterial systems by decoupling from due to the , yet it enables efficient coexpression for developmental or metabolic purposes. Such clusters evolve through mechanisms like local gene duplications, inversions, or translocations, and their prevalence varies across eukaryotic lineages, with notable examples in , fungi, and plants. Prominent examples include the clusters, which are highly conserved across bilaterian animals and encode homeodomain transcription factors that specify anterior-posterior body patterning during embryogenesis. In vertebrates, whole-genome duplications have expanded the ancestral single cluster into four paralogous clusters (HoxA, HoxB, HoxC, and HoxD) on separate chromosomes, each containing 8–11 genes arranged in a collinear manner that mirrors their spatial and temporal expression along the body axis. Similarly, globin gene clusters in mammals, such as the human alpha-globin cluster on and beta-globin cluster on , organize embryonic, fetal, and adult genes in the order of their developmental activation, allowing stage-specific expression regulated by locus control regions. gene clusters, essential for assembly and packaging, form large arrays in many eukaryotes; the human major cluster on spans over 60 genes encoding core histones H2A, H2B, H3, and H4, with replication-dependent expression tightly coupled to the of the . In fungi and , metabolic clusters often involve non-homologous genes for biosynthetic pathways, particularly secondary metabolites. For instance, the sterigmatocystin cluster in the Aspergillus nidulans comprises 25 coregulated genes spanning ~150 kb, enabling precursor production for defense or toxicity. In , the benzoxazinoid cluster in and ( family) assembles independently across species via transposition and duplication, producing defensive compounds against herbivores and pathogens. These clusters enhance pathway efficiency by localizing enzymes, minimizing intermediate diffusion, and allowing rapid evolutionary adaptation through gene recruitment. Tandem arrays, a subset of eukaryotic gene clusters, consist of duplicated genes arranged head-to-tail in direct repeats, often arising from unequal crossing-over or replication slippage, and representing ~10–14% of genes in vertebrate genomes. They are prevalent in multigene families requiring high copy number or diversity, such as the ribosomal RNA (rRNA) genes forming hundreds of tandem repeats at nucleolar organizer regions for ribosome biogenesis, or olfactory receptor genes in mammals, where large arrays (>1,000 copies) on multiple chromosomes enable sensory adaptation. In vertebrates, tandemly arrayed genes (TAGs) account for ~25% of all duplications, with most arrays containing only two members and showing parallel transcriptional orientation, facilitating concerted evolution through gene conversion to maintain sequence homogeneity. Examples include the human SPANX family (6 genes in two arrays) for sperm surface proteins and immunoglobulin variable region genes in jawed vertebrates, which undergo somatic hypermutation and recombination for immune diversity. In non-vertebrates like dinoflagellates, tandem arrays of housekeeping genes exhibit high sequence conservation despite rapid evolution elsewhere in the genome, suggesting selective pressure for coordinated expression. Overall, these structures provide evolutionary flexibility, dosage amplification, and regulatory simplicity, though they are prone to instability from recombination.

Biological Significance

Functional Roles

Gene clusters confer adaptive advantages by optimizing pathway efficiency through the maintenance of stoichiometric in the production of multi-subunit protein complexes. In hemoglobin synthesis, the α-globin gene cluster on , comprising multiple α-like genes (HBA1, HBA2, and pseudogenes), coordinates with the β-globin cluster on to ensure equimolar expression of α and β subunits, forming the functional α₂β₂ tetramer essential for oxygen transport. This clustered organization prevents the toxic aggregation of excess unpaired chains, which occurs when is disrupted. Similarly, histone gene clusters in eukaryotes, such as the major cluster on human , enable synchronized transcription of H2A, H2B, , and H4 genes to produce proteins in precise ratios for assembly, supporting structure and DNA packaging. Without such coordination, imbalances would impair cellular processes like replication and transcription. In , gene clusters facilitate precise spatiotemporal regulation of expression, ensuring genes are activated in the correct sequence and location during embryogenesis. clusters exemplify this, where large regulatory landscapes encompassing enhancers and silencers drive collinear expression patterns that dictate anterior-posterior body axis formation in vertebrates. This tight control minimizes off-target effects and supports robust patterning. Additionally, the redundancy inherent in many clusters buffers against deleterious mutations; for instance, multiple functional α-globin genes in the cluster allow partial deletions to be tolerated without complete loss of function, reducing the severity of phenotypic outcomes. Such buffering enhances evolutionary resilience by maintaining essential outputs despite genetic variation. Disruptions in gene clusters have profound disease implications, often leading to imbalances that underlie genetic disorders. In , deletions or mutations within the α- or β- clusters cause unequal globin chain synthesis, resulting in ineffective and ; for example, deletion of both α-globin genes from one cluster produces a carrier state, while affecting both clusters causes severe α-thalassemia. In the , the MHC gene cluster on exhibits extreme polymorphism to diversify , but this variability imposes evolutionary trade-offs: heightened pathogen resistance comes at the cost of increased risk, as mismatched MHC alleles can trigger self-reactive T-cell responses. These trade-offs reflect the balance between broad immune adaptability and the potential for pathological over-reactivity. Systems biology models highlight quantitative benefits of clustering, demonstrating reduced stochastic noise in expression levels compared to dispersed genes, which enhances overall in coordinated . In a study of metabolic pathways, chromosomal clustering synchronized production, lowering the in expression ratios and reducing toxic intermediate accumulation by up to several-fold, thereby improving cellular fitness by approximately 6% under selective conditions. This ensures more reliable stoichiometric balance, providing a selective for pathway and developmental across organisms.

Examples Across Organisms

In animals, clusters exemplify conserved genomic organization for developmental patterning. These clusters, typically consisting of 8–13 paralogous genes arranged in tandem, specify regional identity along the anterior-posterior body axis during embryogenesis, with collinear expression mirroring their chromosomal order. In vertebrates, two rounds of whole-genome duplication (2R-WGD) at the base of the lineage expanded the ancestral single cluster into four paralogous sets—HoxA, HoxB, HoxC, and HoxD in humans—retained in tetrapods for enhanced regulatory complexity in axial patterning. This duplication pattern underscores how gene clustering facilitates coordinated expression essential for morphological diversity across bilaterian animals. Mammalian globin gene clusters illustrate eukaryotic tandem arrays adapted for physiological function. The alpha-globin cluster on and beta-globin cluster on in humans each contain multiple paralogs encoding subunits critical for oxygen transport in erythrocytes. These clusters feature locus control regions (LCRs)—such as the beta-LCR upstream of the HBB gene and HS-40 for alpha—that orchestrate high-level, tissue-specific expression by interacting with enhancers and promoters during . The developmental switching between embryonic, fetal, and adult globins within these clusters ensures efficient oxygen delivery across life stages. A prominent prokaryotic example is the in , a tightly linked gene cluster for . This comprises five structural genes—trpE, trpD, trpC, trpB, and trpA—encoding enzymes that catalyze the conversion of chorismate to , an essential nutrient under limiting conditions. Regulated by repression and mechanisms responsive to levels, the exemplifies how bacterial clusters enable efficient, coordinated production of components. In , cytochrome P450 (CYP) gene clusters drive the synthesis of secondary metabolites for defense. These often arise from tandem duplications, forming arrays of up to 15 CYP genes that facilitate multi-step oxidations in biosynthetic pathways. For instance, in (), a compact cluster including CYP79A1 (cytochrome P450 79A1) and encodes enzymes that convert to p-hydroxyphenylacetaldoxime and then to dhurrin, a cyanogenic released upon damage to produce toxic . Similarly, in (Zea mays), four tandem CYP71C paralogs catalyze sequential hydroxylations in the of 2,4-dihydroxy-7-methoxy-1,4-benzoxazin-3-one (DIMBOA), a benzoxazinoid phytoalexin that deters and pathogens by inhibiting their feeding and growth. Such clusters highlight ' reliance on genomic proximity for evolving chemical defenses against biotic stresses.

References

  1. [1]
    The gentle art of gene arrangement: the meaning of gene clusters
    Feb 22, 2002 · Genes that are imprinted may also be tightly clustered, one of the best examples being the Igf2 group of loci; in this case, clustering might ...
  2. [2]
    Minimum Information about a Biosynthetic Gene cluster - Nature
    Aug 18, 2015 · A wide variety of enzymatic pathways that produce specialized metabolites in bacteria, fungi and plants are known to be encoded in biosynthetic gene clusters.
  3. [3]
    Seeking Sense in the Hox Gene Cluster - PMC
    Nov 15, 2022 · The Hox gene cluster, responsible for patterning of the head–tail axis, is an ancestral feature of all bilaterally symmetrical animals (the Bilateria)
  4. [4]
    Globin Gene - an overview | ScienceDirect Topics
    Globin genes refer to the genes that encode globin chains, which are essential components of hemoglobin and are located on two distinct chromosomes.
  5. [5]
    Gene Clusters, Molecular Evolution and Disease: A Speculation - PMC
    In summary, clustering of co-expressed genes occurs with each species and across species [12, 21-23].
  6. [6]
    Gene Clusters Reveal Fundamental Principles of Genome Folding ...
    Oct 1, 2025 · Gene clusters generate proteome diversity required for cell fate and function. Given their genomic organization, wherein tandemly arranged ...
  7. [7]
    Gene Cluster - an overview | ScienceDirect Topics
    A gene cluster is defined as a group of functionally related genes that are located close together in the genome, often sharing a single regulatory sequence ...4.12. 4 Identifying Shared... · 4.12. 4.3 Clustering Methods · 4.12. 4.3. 2 Partitioning...
  8. [8]
    Gene Clusters for Secondary Metabolic Pathways - NIH
    Gene clusters containing nonhomologous functionally related genes are common in bacterial genomes. Most are organized as operons, in which the different genes ...
  9. [9]
  10. [10]
    Comparison of gene clustering criteria reveals intrinsic uncertainty in ...
    Oct 30, 2023 · We studied the implications of adopting homology, orthology, or synteny conservation as formal criteria for gene clustering by performing comparative analyses ...
  11. [11]
    Conditions for the Evolution of Gene Clusters in Bacterial Genomes
    Feb 12, 2010 · Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed ...
  12. [12]
    Genome-wide identification of physically clustered genes suggests ...
    Co-regulation of physically linked genes may be mediated by common regulatory elements or by a shared chromatin environment due to localized changes in ...
  13. [13]
    Genomic structure and function in the MHC - PubMed
    The major histocompatibility complex (MHC) is a fascinating region of the human genome. More is known about this 4 Mb of DNA (0.1% of the genome) on the ...
  14. [14]
    Genome duplication and networks for evolutionary adaptation
    We show that duplicating genomes and their encoded gene regulatory networks increase signal output variation, leading to niche expansion and increased ...
  15. [15]
    Genome destabilization by homologous recombination in the germline
    Non-allelic homologous recombination (NAHR) can markedly alter genome architecture during gametogenesis by generating chromosomal rearrangements.
  16. [16]
    Mechanisms of structural chromosomal rearrangement formation
    Jun 14, 2022 · Most recurrent rearrangements are caused by a mechanism named Non-Allelic Homologous Recombination (NAHR) that occurs between Low Copy Repeats ( ...
  17. [17]
    The rainbow trout genome provides novel insights into evolution ...
    Apr 22, 2014 · The rainbow trout genome offers a unique opportunity to investigate the early evolutionary fate of a duplicated vertebrate genome.
  18. [18]
    Gene Loss and Evolutionary Rates Following Whole-Genome ...
    Teleost fishes provide the first unambiguous support for ancient whole-genome duplication in an animal lineage. Studies in yeast or plants have shown that the ...
  19. [19]
    Systematic Variation in the Pattern of Gene Paralog Retention ...
    Teleost fish underwent whole-genome duplication around 450 Ma followed by diploidization and loss of 80–85% of the duplicated genes.
  20. [20]
    Whole-genome duplicate gene retention analysis | Science
    Jun 26, 2020 · Gene duplication provides a source of new genes (9), and duplicate retention may lead to the development of specialized functional modules ...
  21. [21]
    Evidence of neofunctionalization after the duplication of the highly ...
    Jan 17, 2017 · The duplicates have suffered clear neofunctionalization, with the action of strong purifying selection on the ancestral copy and of positive ...
  22. [22]
    Regulatory mechanisms ensuring coordinated expression of ... - NIH
    Coordinated spatiotemporal expression of large sets of genes is required for the development and homeostasis of organisms.
  23. [23]
    Genome-wide identification of physically clustered genes suggests ...
    Feb 8, 2017 · Such chromosomal clustering may confer a selective advantage as it enables coordinated gene regulation at the chromatin level. We studied ...
  24. [24]
    Evolution of Cis-Regulatory Elements and Regulatory Networks in ...
    We found that duplicated gene pairs vary greatly in their cis-regulatory element architecture, resulting in changes in regulatory network connectivity.Missing: post- | Show results with:post-
  25. [25]
    Cis-regulatory landscapes in the evolution and development of the ...
    May 15, 2023 · In case the whole enhancer-gene sequence was duplicated, the original and new regulatory elements may evolve to drive differential expression of ...
  26. [26]
    Transcriptional Regulation and Implications for Controlling Hox ...
    With respect to the Hox clusters, these approaches will be relevant for understanding how shared enhancers can coordinately regulate multiple genes and why some ...
  27. [27]
    Multigenome analysis implicates miniature inverted-repeat ... - PNAS
    Jun 25, 2018 · Thus, TE-mediated recombination may contribute to plant biosynthetic gene cluster formation. TEs may also facilitate establishment of regulons.
  28. [28]
    Transposable elements: multifunctional players in the plant genome
    TEs play a key role in genome restructuring, gene loss, and maintenance of genomic balance after duplication events.
  29. [29]
    Characterization of the transposable element landscape shaping the ...
    Sep 29, 2025 · Transposable elements (TEs) are mobile genetic units that display substantial mechanistic diversity and are near-ubiquitous across the tree of ...
  30. [30]
    Co-regulation of paralog genes in the three-dimensional chromatin ...
    These looping interactions can be measured by genome-wide chromatin conformation capture (Hi-C) experiments, which revealed self-interacting regions called ...
  31. [31]
    Landscape of cohesin-mediated chromatin loops in the ... - Nature
    Jul 29, 2020 · Loops frequently contain more than one gene, which could facilitate the co-regulation and co-expression of gene pairs. We tested ...
  32. [32]
    Chromatin gene-gene loops support the cross-regulation of genes ...
    Mar 7, 2024 · Conceptually, shared cis regulatory landscapes are much more likely for paralogous genes that remain in relatively close genomic proximity, e.g. ...
  33. [33]
    Evolution of arginine deiminase (ADI) pathway genes - ScienceDirect
    Among the first, the molarity model suggests that gene clustering results in a beneficially high local concentration of proteins. An extension of this model ...
  34. [34]
    The Operon as a Conundrum of Gene Dynamics and Biochemical ...
    Apr 21, 2023 · The “Fisher model” proposes that gene clusters result from co-adaptation. The physical proximity of co-adapted genes in the genome reduces ...
  35. [35]
    Regulatory mechanisms ensuring coordinated expression of ...
    Aug 5, 2025 · By combining gene-editing tools with computational modeling, recent studies tested the advantages of adjacent genes located in pairs and ...
  36. [36]
    Genetics, Inducible Operon - StatPearls - NCBI Bookshelf
    Oct 17, 2022 · An inducible operon is one whose expression increases quantitatively in response to an enhancer, an inducer, or a positive regulator.
  37. [37]
    Phages carry interbacterial weapons encoded by biosynthetic gene ...
    Aug 23, 2021 · For example, BGCs encode polyketide synthases (PKS BGCs) and non-ribosomal peptide synthetases (NRPSs), which are extremely large (10–100 kb), ...
  38. [38]
    Diversity of Bacterial Secondary Metabolite Biosynthetic Gene ...
    Dec 29, 2022 · Clusters predicted to encode fatty acid synthases and polyketide synthases (PKS) were the most dominant BGC types, followed by clusters ...
  39. [39]
    Enrichment of horizontally transferred gene clusters in bacterial ...
    Aug 29, 2025 · Horizontal gene transfer (HGT), a fundamental mechanism of microbial evolution [1], enables bacterial cells to acquire novel phenotypic ...
  40. [40]
    Horizontal Gene Transfer: From Evolutionary Flexibility to Disease ...
    May 19, 2020 · Horizontal gene transfer (HGT) or lateral gene transfer (LGT) as a general mechanism leads to biodiversity and biological innovations in nature.
  41. [41]
    Metabolic Gene Clusters in Eukaryotes | Annual Reviews
    Nov 23, 2018 · Here we review metabolic gene clusters from fungi and plants, highlight commonalities and differences, and consider how these clusters form and ...
  42. [42]
    Gene Clustering in Eukaryotes - Wiley Online Library
    Oct 18, 2013 · Key Concepts: Albeit not as common as the gene clustering within operons observed in bacteria, linear gene clustering does occur in eukaryotes.
  43. [43]
    The eukaryotic genome: a system regulated at different hierarchical ...
    Oct 15, 2003 · A limited number of gene clusters have been studied in detail - for example, the α-globin genes, β-globin genes, histone genes and Hox genes.
  44. [44]
    Structure and in Vitro Transcription of Human Globin Genes - Science
    In both clusters, each of the genes is transcribed from the same DNA strand, and the genes are arranged in the order of their expressions during development.
  45. [45]
    A standardized nomenclature for mammalian histone genes
    Oct 1, 2022 · In the human genome the largest cluster is on chromosome 6 and contains more than 60 genes, and the second cluster on chromosome 1 contains 10– ...
  46. [46]
    Tandemly Arrayed Genes in Vertebrate Genomes - PMC
    Tandemly arrayed genes (TAGs) are duplicated genes that are linked as neighbors on a chromosome, many of which have important physiological and biochemical ...
  47. [47]
    Dinoflagellate tandem array gene transcripts are highly conserved ...
    Sep 10, 2012 · We also have used RNA-Seq to assess the degree of sequence conservation in tandem array genes and found their transcripts to be highly conserved ...
  48. [48]
    Lessons from the post-genomic era: Globin diversity beyond oxygen ...
    The predominant adult forms of Hb display a heterotetrameric α2β2 structure, composed of two α-subunits (Hbα, HBA gene) and two β-subunits (Hbβ, HBB gene), ...
  49. [49]
    The Molecular Basis of α-Thalassemia - PMC - NIH
    There are currently approximately 50 deletions from the α-globin cluster that either completely or partially delete both α-globin genes, and consequently no α- ...
  50. [50]
    Coordinated expression of replication-dependent histone genes ...
    How all five RD histone genes within these clusters are coordinately regulated such that neither too few nor too many histones are produced, a process referred ...
  51. [51]
    A Histone Cycle | IntechOpen
    Feb 20, 2013 · Expression of all canonical histones must be stoichiometric and several studies show that an imbalance between the different histone subtypes ...
  52. [52]
    The regulatory landscapes of developmental genes | Development
    Feb 3, 2020 · Regulatory landscapes have been defined in vertebrates as large DNA segments containing diverse enhancer sequences that produce coherent gene transcription.ABSTRACT · Introduction · Regulatory landscapes · single enhancer dominates...
  53. [53]
    Evolutionary trade-offs constraining the MHC gene expansion
    The immune system is as much shaped by the pressure of pathogens as it is by evolutionary trade-offs that constrain its structure and function.
  54. [54]
    Evolutionary trade-offs constraining the MHC gene expansion
    The immune system is as much shaped by the pressure of pathogens as it is by evolutionary trade-offs that constrain its structure and function.
  55. [55]
    Synchronization of stochastic expressions drives the clustering of ...
    Oct 9, 2019 · In this study, we propose that the benefit of chromosomal clustering of functionally related genes is to reduce the stochastic fluctuations of ...Results · The Yeast Gal Cluster As A... · Materials And Methods
  56. [56]
    Hox Genes and Regional Patterning of the Vertebrate Body Plan
    Here, we review recent novel insight into the modalities of Hox protein function in imparting specific identity to anatomical regions of the vertebral column.Hox Genes And Regional... · A Function For Hox And... · Hox Genes Of The Central...
  57. [57]
    Evolution of Hoxgene clusters in deuterostomes
    Jul 2, 2013 · Following the 2R-WGD at the base of the vertebrates, tetrapods retained four clusters, whereas teleost fishes expanded to seven or eight ...
  58. [58]
    HOX-Gene Cluster Organization and Genome Duplications ... - MDPI
    The organization of Hox clusters in vertebrates is a result of several genome duplications: two rounds of duplication in the ancestors of all vertebrates and a ...
  59. [59]
    Mechanisms of Globin Gene Regulation in Mammals - PMC - NIH
    Sep 23, 2025 · Studies of globin gene clusters have established many paradigms of gene regulation. This review focuses on the alpha- and beta-globin gene ...
  60. [60]
    [PDF] The Normal Structure and Regulation of Human Globin Gene Clusters
    Globin gene promoters fall into the category of promoters with well-defined TATA boxes at a restricted location and one major start site for transcription.
  61. [61]
    Locus control regions of mammalian beta-globin gene clusters
    This review brings together the large amount of DNA sequence data from the beta-globin LCR with the vast amount of functional data obtained through the use of ...Missing: alpha oxygen transport tissue-
  62. [62]
    Attenuation in the control of expression of bacterial operons - Nature
    Feb 26, 1981 · Attenuation in the control of expression of bacterial operons. Charles Yanofsky. Nature volume 289, pages 751–758 (1981)Cite this article.
  63. [63]
    The complete nucleotide sequence of the tryptophan operon of ...
    In this summary report we present the complete nucleotide sequence for the five structural genes of the trp operon of E. coli together with the internal and ...<|control11|><|separator|>
  64. [64]
    Cytochromes P450: a success story | Genome Biology | Full Text
    Dec 8, 2000 · In plants, chemical defense seems to be a major reason for P450 diversification. ... These gene clusters can contain up to 15 P450 genes, the ...
  65. [65]
    The biosynthetic gene cluster for the cyanogenic glucoside dhurrin ...
    Nov 14, 2016 · Here we report that the gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor additionally contains a gene, SbMATE2, encoding a transporter.
  66. [66]
  67. [67]
    Cytochrome P450 Gene Families: Role in Plant Secondary ... - PMC
    Jul 25, 2023 · The CYP gene families have diverse roles in the biosynthesis of metabolic compounds and promote the plant's development and defense (Table 1).