Fact-checked by Grok 2 weeks ago

Chromosome conformation capture

Chromosome conformation capture () is a technique designed to map the three-dimensional () organization of chromosomes within the by quantifying the frequency of physical interactions between specific genomic loci. Introduced in , the method relies on crosslinking of to preserve spatial proximities, followed by digestion, intramolecular ligation of nearby fragments, and () amplification to detect and measure interaction frequencies, thereby generating a contact matrix that reveals folding patterns. Subsequent advancements have expanded 3C into a family of high-throughput variants, including 4C (circular chromosome conformation capture), which profiles interactions from a single bait locus to all others genome-wide; 5C ( chromosome conformation capture), enabling multiplexed analysis of interactions among multiple predefined loci; and Hi-C, a genome-wide all-to-all approach that uses deep sequencing to produce comprehensive contact maps at kilobase resolution or better. These methods have evolved to incorporate nucleosome-level resolution (e.g., Micro-C using micrococcal ) and multiway capture, addressing limitations like enzymatic biases and noise in earlier protocols. The primary applications of 3C-based techniques lie in elucidating how architecture influences gene regulation, revealing structures such as topologically associating domains (TADs), enhancer-promoter loops, and A/B compartments that segregate active and inactive genomic regions. Disruptions in these 3D configurations, often detected via , are implicated in developmental disorders, cancers, and other diseases by altering long-range regulatory interactions. By integrating with complementary approaches like and multi-omics, 3C methods continue to provide insights into dynamic changes during processes such as cell differentiation and , underscoring the functional importance of spatial organization.

Fundamentals

Chromatin organization basics

Chromatin consists of DNA packaged with histone and non-histone proteins into a hierarchical structure that compacts the genome while allowing access for cellular processes. The fundamental unit is the nucleosome, formed by approximately 147 base pairs of DNA wrapped around an octamer of core histone proteins (H2A, H2B, H3, and H4), with linker DNA connecting adjacent nucleosomes and often bound by histone H1. These nucleosomes further fold into higher-order structures, including chromatin loops, topologically associating domains (TADs), and large-scale compartments (A and B). TADs are megabase-scale regions of frequent self-interaction, typically spanning 100 kilobases to 1 megabase, that insulate regulatory interactions and are stabilized by proteins like CTCF and cohesin. A compartments represent transcriptionally active, gene-rich euchromatin, while B compartments correspond to inactive, gene-poor heterochromatic regions. The organization of within the is essential for regulating key genomic functions, including transcription, , and . Spatial proximity enables long-range interactions, such as enhancer-promoter looping, which can span distances up to several megabases and facilitate precise activation by bringing distant regulatory elements into contact. This architecture also influences replication timing, with early-replicating domains often enriched in open , and supports efficient by clustering damage sites or juxtaposing repair factors. Disruptions in structure, such as altered looping, have been linked to dysregulation of these processes, underscoring its functional significance. Individual chromosomes occupy distinct territories in the , a non-random radial arrangement first visualized through (FISH) in the 1980s. Pioneering studies by Thomas Cremer and colleagues demonstrated that human chromosomes maintain separate territories, with gene-poor regions positioned toward the nuclear periphery and gene-rich regions more centrally, influencing transcriptional activity. This territorial organization, building on earlier Rabl and Boveri models from the late , laid the groundwork for molecular investigations into how folding modulates function. Techniques such as chromosome conformation capture have since enabled quantitative mapping of these spatial interactions at high .

Principles of 3C technology

Chromosome conformation capture (3C) technology relies on the biochemical preservation of spatial interactions within the fiber to infer three-dimensional organization. The method captures pairwise interactions between genomic loci by fixing proteins and DNA in their configurations, followed by enzymatic processing to generate quantifiable ligation products. This approach assumes that the frequency of detected interactions corresponds to the likelihood of physical proximity in the , enabling quantitative mapping of chromatin contacts. The molecular mechanism begins with formaldehyde crosslinking, which covalently links proteins to DNA and proteins to each other, stabilizing transient chromatin interactions. Typically, cells are treated with 1-2% for 10 minutes at to achieve efficient fixation without excessive damage to DNA structure; this concentration and duration balance crosslinking yield with downstream enzymatic accessibility. The crosslinked chromatin is then lysed, and a , such as , which recognizes 4-6 sequences, is used to digest the DNA into fragments. Digestion occurs under conditions that ensure complete cleavage, often overnight at 37°C, generating sticky ends at restriction sites. Subsequently, under dilute conditions to favor intramolecular , the ends of nearby fragments—brought into proximity by crosslinking—are joined by , forming chimeric DNA molecules that join sequences originally distant in the linear but close in three-dimensional space. Finally, crosslinks are reversed through digestion and heat treatment (e.g., 65°C for several hours), purifying the ligation products for analysis, typically by to quantify specific junctions. A central principle of 3C is that the frequency of products between two loci directly reflects their spatial proximity , as crosslinking probability increases with physical closeness, and efficiency is enhanced for juxtaposed fragments. This frequency decays with genomic distance s along the , often following a power-law relationship in models of , where the interaction probability P(s) scales as s^{-1}, indicating a basic decay consistent with or equilibrium globule configurations. P(s) \propto s^{-1}

Historical Development

Invention of 3C

Chromosome conformation capture (3C) was developed by Job Dekker, Karsten Rippe, Martijn Dekker, and Nancy Kleckner in 2002 while working in Nancy Kleckner's laboratory at Harvard University. The method was introduced in a seminal paper published in Science, where it was applied to investigate long-range chromatin interactions at the yeast mating-type switching locus on chromosome III. This locus was chosen to explore spatial organization during meiosis, revealing how distant genomic regions come into proximity to facilitate double-strand break formation and recombination. The technique's core innovation involved formaldehyde crosslinking to preserve transient chromatin contacts in intact nuclei, followed by restriction digestion, ligation, and PCR-based detection of proximity events. The development of 3C was motivated by the limitations of prior methods like (FISH), which provided qualitative snapshots of conformation in individual s but struggled to quantify average interaction frequencies across cell populations. FISH, while valuable for visualizing spatial arrangements, was low-throughput and prone to variability due to its reliance on fixed cell imaging, making it inadequate for statistical analysis of dynamic, long-range interactions. 3C addressed these challenges by enabling quantitative measurement of interaction frequencies between specific loci in large populations of cells, offering a population-averaged view of architecture that was both reproducible and scalable for targeted studies. This approach thus provided a biochemical means to infer three-dimensional folding from efficiencies, where higher contact frequencies indicated closer spatial proximity. A key application of 3C shortly after its invention demonstrated enhancer-promoter looping at the murine β-globin locus, as reported by Tolhuis et al. in late 2002. Using 3C, they showed frequent interactions between the locus control region (an enhancer) and the β-major globin promoter in erythroid cells, confirming a looping model for transcriptional activation that brought regulatory elements into close contact despite their linear separation. This finding validated 3C's utility in mammalian systems and highlighted its ability to capture functional loops driving . The "one-vs-one" design of 3C distinguished it as a targeted method, employing custom primers to query interactions between two predefined genomic regions of interest, rather than surveying the entire . This focused strategy allowed precise quantification of specific contacts, such as those between regulatory elements, while minimizing bias from unbiased sequencing approaches that would emerge later. By normalizing interaction frequencies against a control locus, 3C provided reliable data on relative proximities, establishing a foundation for understanding how folding influences nuclear processes like regulation and .

Progression to genome-wide methods

Following the invention of chromosome conformation capture (3C), which focused on targeted pairwise interactions, subsequent methods expanded to genome-wide scales by increasing throughput and reducing bias. In , the technique was developed to profile interactions between one specific genomic locus and all others across the , employing circularization of ligation products followed by inverse amplification for unbiased detection. Also in , the 5C method enabled mapping of interactions between large sets of predefined genomic regions (many-versus-many), utilizing ligation-mediated amplification of multiplexed primers for high-throughput via microarrays or sequencing. A pivotal advancement occurred in with the introduction of , which achieved comprehensive all-versus-all contact profiling across the entire through proximity-based of crosslinked fragments, followed by adaptor attachment and sequencing on Illumina platforms. This approach generated the first genome-wide contact maps in and lines, revealing plaid-like patterns that signified large-scale compartments characterized by self-associating active and inactive domains. The transition from PCR-centric detection in and 5C to biotinylated adaptor and deep sequencing in marked a key innovation, dramatically improving scalability and enabling unbiased capture of billions of interactions without prior locus selection. Building on these foundations, the saw evolve further through refined , culminating in the identification of topologically associating domains (TADs) as stable, megabase-scale units with enriched intra-domain contacts, first systematically characterized in mammalian genomes in 2012.

Standard Methods

3C, 4C, and 5C

The chromosome conformation capture (3C) method captures pairwise interactions between specific genomic loci through a series of biochemical steps that preserve and detect spatial proximity. Cells or nuclei are first fixed with 1-2% formaldehyde for 5-10 minutes at room temperature to covalently cross-link proteins to DNA, stabilizing chromatin interactions in their native three-dimensional configuration. The cross-linked chromatin is then permeabilized and digested overnight with a restriction enzyme, typically a 6-cutter such as HindIII or a 4-cutter like MboI for finer resolution, generating sticky-end fragments averaging 1-4 kb in length. To promote ligation of spatially proximal fragments, the ends are filled in with dNTPs and Klenow polymerase to create blunt ends, followed by intra-molecular ligation under highly dilute conditions (e.g., 10-20 ng/μL DNA) using T4 DNA ligase at 16°C overnight, favoring chimeric molecule formation over random intermolecular joins. Cross-links are reversed by proteinase K digestion at 65°C, and the DNA is purified via phenol-chloroform extraction and ethanol precipitation, yielding a 3C library enriched for ligation junctions. Specific interactions are quantified using TaqMan quantitative PCR with primers flanking the expected ligation sites for each locus pair, where signal intensity reflects interaction frequency normalized against a control such as the BAC clone containing both loci. Building on 3C, the (circular chromosome conformation capture) method enables unbiased genome-wide profiling of all interactions involving a single "viewpoint" locus, revealing its contact landscape. The protocol initiates with the core 3C steps— fixation, primary restriction (e.g., with ), and blunt-end under dilute conditions—to generate the initial interaction library. A secondary is then performed with a frequent-cutting like DpnII (4-cutter, ~256 fragments) to trim ligated products into smaller pieces, followed by a second step that circularizes these fragments at low DNA concentration (~0.3 ng/μL), enriching for closed loops containing the viewpoint. Inverse amplification is carried out from the viewpoint using one primer internal to the viewpoint fragment (facing outward toward potential partners) and a second primer in the adjacent known sequence, producing linear fragments of all interacting regions. Amplified products are labeled (e.g., with Cy5) and hybridized to genome-wide microarrays or, in later variants, deep-sequenced to map contacts at ~1 kb resolution, with normalization to random controls. The 5C (chromosome conformation capture carbon copy) technique scales up to multiplexed detection of interactions among hundreds to thousands of predefined fragments within targeted genomic regions, such as multi-megabase domains. It starts with 3C library preparation, using formaldehyde cross-linking of 10^7-10^8 cells, digestion preferably with a 4-cutter enzyme like NlaIII for ~300 bp fragments to enhance resolution, and blunt-end ligation in dilute solution to capture proximities. Pooled bait-prey primer sets (e.g., 300-500 pairs) are designed with universal tails: bait primers (forward, T7-tailed) anneal to one end of target fragments, and phosphorylated prey primers (reverse, T3-tailed) to the opposite ends of potential partners, at concentrations of 2-5 nM each. These are hybridized to 1-5 μg of 3C DNA at 48°C overnight, then ligated with Taq DNA ligase to create specific nested products only for true interactions. Universal-tailed PCR (30-35 cycles) pools and amplifies all ligation products in 96- or 384-well format, followed by pooling, purification, and quantification via custom microarrays (probing ~40-mer sequences) or 454 sequencing (~10^5 reads per library), enabling interaction matrices with sub-kb precision. These targeted methods—3C for pairwise queries, for one-to-all scans, and 5C for many-to-many networks—provide ~1 resolution dictated by restriction fragment sizes, making them ideal for hypothesis-driven investigations of specific regulatory elements or domains rather than unbiased genome-wide mapping. Experiments typically start with 10^7 cells and yield 10^6-10^7 unique chimeric molecules post-ligation, sufficient for thousands of reactions or array hybridizations while minimizing background noise from random ligations.

Hi-C protocol

The Hi-C protocol, introduced in , enables genome-wide, unbiased detection of interactions by adapting chromosome conformation capture principles to high-throughput sequencing, producing comprehensive maps of pairwise contacts across the entire . This method builds briefly on prior targeted approaches like 3C, , and 5C by scaling to all-vs-all interactions without predefined bait regions. It typically requires approximately 10^7 cells for standard applications in mammalian systems, though optimizations allow scaling to fewer cells for specific uses. The workflow emphasizes proximity-based under dilute conditions to minimize artifacts such as random collisions or unligated fragments. The protocol commences with formaldehyde crosslinking of intact cells to preserve three-dimensional chromatin structure, typically at 1-2% formaldehyde for 10 minutes at , followed by quenching with . Cells are lysed in a containing detergents like Igepal CA-630 or , and nuclei are isolated by . The is then digested overnight at 37°C with a ; the original protocol used (a 6-bp cutter producing fragments averaging several kb), while later optimizations employ 4-bp cutters such as DpnII (GATC) or MboI for denser fragmentation and higher resolution of ~1-5 kb. Digestion efficiency is verified by , aiming for near-complete cleavage to avoid uncut DNA artifacts. Post-digestion, the overhangs are filled in with biotin-14-dCTP (along with dATP, dGTP, and dTTP) using at 37°C for 45 minutes, incorporating specifically at ligation junctions for subsequent enrichment. The ends are blunted if necessary, and proximity is performed under highly dilute conditions (e.g., ~1-5 nM DNA concentration) with T4 at 16°C for 4-6 hours, promoting intramolecular joins between spatially close fragments while suppressing random intermolecular ligations that could introduce noise from unligated or uncut DNA. Crosslinks are reversed by digestion at 65°C overnight, followed by DNA purification via phenol-chloroform extraction and . The purified DNA is sheared to 300-500 bp fragments using (e.g., via Covaris), then end-repaired, A-tailed, and size-selected on gels to enrich for ligated products. Biotinylated junctions are captured on beads, washed stringently, and the bound DNA is adapter-ligated for library preparation before amplification (typically 10-15 cycles to avoid bias). Paired-end sequencing on platforms like Illumina is performed, with a depth of 100-400 million reads recommended for mammalian genomes to achieve sufficient coverage for population-level contact maps at megabase to kilobase resolutions. Common troubleshooting focuses on restriction fragment biases, where varying enzyme efficiencies due to sequence context (e.g., GC content or methylation) can skew contact frequencies; this is mitigated by using methylation-insensitive enzymes like DpnII and confirming uniform digestion via quality control assays such as PCR across multiple loci. Dilution during ligation remains essential to control artifacts from uncut or undigested DNA, with ligation efficiency assessed by gel analysis showing a shift to higher-molecular-weight bands.

Advanced Variants

Single-cell and low-input techniques

Single-cell and low-input techniques in chromosome conformation capture address the limitations of bulk methods by enabling analysis of chromatin interactions in cells or small numbers of starting cells, thereby uncovering cell-to-cell heterogeneity that is averaged out in population-level data. These adaptations are essential for studying dynamic processes in heterogeneous tissues, such as embryonic , where cell states vary significantly. By reducing input requirements to as few as one , these methods facilitate high-resolution profiling in rare or precious samples without the need for millions of cells typical in standard Hi-C protocols. A foundational method is single-cell Hi-C (scHi-C), introduced in 2013, which applies combinatorial indexing to generate sparse contact maps from individual mammalian cells. This technique builds on by processing single nuclei to capture long-range interactions while assigning unique barcodes to each cell for demultiplexing. Another key approach is Dip-C, a single-cell conformation capture method that integrates transposon-based amplification to reconstruct diploid 3D genome structures with resolution, applied to human cells to reveal stochastic folding patterns. Protocol highlights include microfluidics-based isolation of individual nuclei to ensure high purity and viability, followed by chromatin digestion and under dilute conditions to favor intra-molecular contacts. Split-pool barcoding then enables , with methods like sciHi-C allowing simultaneous of up to 10,000 cells through iterative rounds of combinatorial labeling, substantially increasing throughput compared to early scHi-C implementations that handled dozens of cells. enrichment and paired-end sequencing follow to recover ligation products, though overall recovery remains challenging due to the sparsity of single-cell data. These techniques achieve a typical resolution of approximately 100 , limited by the low signal yield of about 0.25–1% of possible contacts per , which captures domain organization but misses fine-scale loops in many cases. Analysis of scHi-C data has revealed substantial cell-to-cell variability in topologically associating domains (TADs), with individual cells showing dynamic inter-TAD contacts and compartmentalization despite conserved overall folding principles. In applications to embryos, single-nucleus Hi-C in has profiled chromatin folding during development, highlighting stochastic domain boundaries and their role in gene activation timing. Challenges persist, including high technical noise from random events and low ligation efficiency, often resulting in only ~1% capture rate of true interactions, necessitating advanced and imputation strategies for reliable interpretation.

Capture and immunoprecipitation-based methods

Capture and immunoprecipitation-based methods enrich for specific interactions by targeting sequence features or protein-binding sites, thereby improving the detection of rare or low-frequency contacts that are challenging to resolve in unbiased approaches like . These techniques integrate elements of chromosome conformation capture with hybridization capture or antibody-based pull-down, enhancing signal-to-noise ratios and enabling focused analysis of regulatory elements such as enhancers and promoters. By prioritizing interactions mediated by particular proteins or genomic loci, they facilitate high-resolution mapping of functional architectures. Sequence capture methods utilize targeted hybridization to enrich Hi-C libraries for predefined genomic regions, allowing efficient interrogation of specific interactions across the genome. Capture-C, introduced in 2016, employs oligonucleotide probes on microarrays or in solution to hybridize and capture ligation junctions involving bait regions of interest after initial 3C library preparation, enabling multiplexed analysis of multiple loci with high sensitivity from limited cell numbers. This approach has been applied to study promoter-enhancer interactions in human cells, revealing conformational changes associated with gene regulation. Similarly, HiChIP, developed in 2016, combines Hi-C with chromatin immunoprecipitation (ChIP) using antibodies against proteins like H3K27ac or cohesin, followed by biotinylation of ligation junctions and streptavidin pull-down to enrich for protein-mediated contacts. HiChIP achieves approximately a 10-fold increase in the yield of informative reads compared to standard Hi-C, making it particularly useful for mapping enhancer-promoter networks in primary cell types such as T helper cells. Both methods reduce sequencing depth requirements while preserving genome-wide context, though Capture-C is more locus-specific and HiChIP emphasizes protein-directed interactions. Immunoprecipitation-based variants directly target protein-DNA complexes to isolate chromatin loops anchored by transcription factors or architectural proteins. ChIA-PET, established in 2009, performs against factors like or prior to proximity ligation and paired-end sequencing, generating maps of long-range interactions bound by specific proteins across the . This method has elucidated the interactome of in cells, identifying thousands of enhancer-promoter contacts that drive hormone-responsive . Building on this, PLAC-seq, introduced in 2016, incorporates in situ proximity ligation before and biotin-streptavidin enrichment, optimizing for high-efficiency capture of loops mediated by proteins such as . PLAC-seq demonstrates superior sensitivity over earlier immunoprecipitation methods, detecting CTCF-anchored loops at kilobase resolution with reduced background noise, and has been instrumental in characterizing domain boundaries in mouse embryonic stem cells. These techniques excel in dissecting the functional roles of specific proteins in 3D genome organization, such as loop extrusion by cohesin-CTCF complexes. ChIP-loop represents an early extension of 3C principles to protein-targeted analysis, integrating to validate interactions between specific genomic elements bound by a protein of interest. First described in using MeCP2 in a model, ChIP-loop performs cross-linking, digestion, and ligation as in 3C, followed by to enrich for loops involving the target protein, such as silent domains regulating imprinted genes like DLX5. Genome-wide adaptations of ChIP-loop, often through sequencing, extend this to broader interactomes, providing a framework for integrating protein occupancy with structure to uncover disease-associated conformational defects. Overall, these methods have revolutionized the study of targeted dynamics, offering enriched datasets that link protein binding to regulatory outcomes like enhancer .

Emerging high-resolution methods

Recent advancements in chromosome conformation capture (3C) technologies have focused on achieving higher to map interactions at the scale, overcoming limitations of standard methods that typically resolve structures at kilobase levels. These emerging techniques, developed primarily after 2020, employ refined enzymatic digestions, optimized crosslinking, and innovative barcoding strategies to enhance signal-to-noise ratios and capture multi-way interactions. Micro-C represents a pivotal high-resolution variant, utilizing micrococcal (MNase) to digest at boundaries, enabling mapping of interactions at approximately 100-200 resolution. Introduced in 2015 and upgraded through Micro-C XL in subsequent years, this method incorporates long crosslinkers like disuccinimidyl glutarate (DSG) combined with , along with insoluble isolation to reduce noise and improve detection of fine-scale features such as sub-topologically associating domains (sub-TADs) and enhancer-promoter loops. Micro-C XL has revealed -level organization, including periodic patterns in contact frequencies that reflect positioning and modifications. In plant systems, Micro-C XL applied to has uncovered enhancer-promoter interactions and three-dimensional folding at unprecedented detail, highlighting evolutionary conservation of regulatory elements. Hi-C 3.0, refined in 2023, introduces protocol optimizations such as dual crosslinkers ( and DSG) and dual restriction enzymes (e.g., DpnII and DdeI) to generate more reliable interaction signals and higher valid contact rates exceeding 50%. This upgrade doubles the detection of loops compared to earlier versions and improves compartment identification, particularly in challenging tissues with rigid walls and metabolites. In ( spp.), 3.0 has been adapted for high-throughput analysis during , elucidating dynamic interactions that underpin developmental transitions and regulation. These enhancements facilitate the study of sub-TAD structures and provide clearer insights into folding in non-model organisms. SPRITE (split-pool recognition of interactions by tag extension), established in , employs an iterative split-pool barcoding approach with religation to capture multi-way contacts, including higher-order hubs involving , , and proteins, without sequence bias. Unlike pairwise-focused methods, SPRITE maps ensemble-averaged three-dimensional nuclear organization, revealing inter-chromosomal interactions and nuclear body associations at genome-wide scales. Its ability to detect simultaneous multi-contact events has advanced understanding of hubs in . Targeted variants like Region Capture Micro-C integrate guides or biotinylated probes to enrich specific genomic regions, boosting throughput and resolution for focused studies of regulatory elements while maintaining nucleosome-level detail. In 2025, the Exo-C method combined chromosome conformation capture with to simultaneously detect structural variants and point mutations, leveraging interaction data to phase variants and identify disease-associated rearrangements with enhanced accuracy. These innovations collectively enable finer dissection of architecture, with advantages in revealing sub-TAD dynamics and integrating multi-omics for .

Data Analysis

Contact map generation and normalization

Contact map generation begins with the processing of paired-end sequencing reads obtained from Hi-C experiments, which capture junctions between fragments. After aligning reads to a using tools such as HiCUP, which handles mapping and initial artifact removal, valid pairs are identified by parsing the alignments to retain only those spanning ligation sites while discarding uncut fragments or non-proximal ligations. Pairs are then filtered to remove self-circles (intra-fragment ligations), random ligations (non-restriction-site joins), and PCR duplicates, typically retaining 1-5% of raw reads as valid contacts depending on library quality. These filtered pairs are binned into a square matrix representing the , where rows and columns correspond to genomic loci divided into fixed-size bins (e.g., 10 or 40 ), and each entry counts the number of read pairs connecting bin i to bin j. Raw contact maps are systematically biased by experimental and sequencing factors, including restriction enzyme efficiency (creating uneven fragment lengths), sequence mappability (repetitive regions align poorly), and (affecting fragmentation and amplification). These biases lead to uneven row and column sums in , confounding biological interpretation by inflating or depleting apparent interaction frequencies. Normalization aims to correct these by estimating and removing locus-specific visibility factors, assuming the expected contact probability decays smoothly with genomic distance in the absence of structure. One seminal approach is , introduced in , which iteratively adjusts the to equalize row and column sums, effectively removing restriction-site biases through a data-driven balancing process. This method models biases as multiplicative factors B_i and B_j for loci i and j, yielding a normalized contact frequency of C'_{ij} = \frac{C_{ij}}{B_i B_j}, where B_k represents the estimated coverage vector for locus k derived from iterative l1-norm minimization until convergence (typically 50-100 iterations). The ensures symmetry in the final , enabling downstream eigenvector of compartments. Knight-Ruiz (KR) matrix balancing offers a faster for large matrices, solving the doubly stochastic via Sinkhorn-like iterations with geometric means, converging in under 20 steps for typical Hi-C data and preserving sparsity. For explicit bias modeling, HiCNorm employs to jointly correct for , fragment length, and mappability as offsets in a , providing a parametric estimate of unbiased frequencies suitable for low-coverage datasets. Tools like cooltools integrate these methods (ICE, KR) into workflows for scalable processing, while HiCUP facilitates upstream filtering to enhance accuracy.

Detection of loops and domains

Detection of loops and domains in chromosome conformation capture (3C) data involves computational algorithms that analyze normalized contact frequency maps to identify topologically associating domains (TADs) and loops, which represent key structural features of the . TADs are self-interacting genomic regions typically spanning 100 to 1 , characterized by high intra-domain contact frequencies and relative insulation from neighboring regions, thereby organizing the into modular units that influence regulation. loops, often enhancer-promoter interactions, are mediated by architectural proteins such as and , forming point-to-point contacts that bring distant regulatory elements into proximity. Several methods have been developed to delineate TAD boundaries from contact maps. The insulation score, introduced by Crane et al., quantifies boundary strength by aggregating contacts in sliding windows along the diagonal of ; low scores indicate strong at TAD borders, allowing automated detection of edges. TADbit employs a detection based on penalized likelihood maximization under a model to segment chromosomes into optimal TADs, providing tools for modeling and visualization of these domains. For identifying loops, algorithms focus on detecting significant or peak interactions in contact maps. Fit-Hi-C assigns statistical confidence to intra-chromosomal contacts by modeling expected frequencies with non-parametric splines and negative binomial distributions, highlighting interactions that deviate from random polymer expectations. The Arrowhead algorithm, implemented in the Juicer toolkit, identifies loop apexes as triangular patterns of enriched contacts converging at specific genomic coordinates, facilitating the annotation of CTCF-anchored loops at high resolution. Brief motif analysis of loop anchors often reveals convergent CTCF binding sites, though detailed regulatory implications are explored elsewhere. Evaluation of detected features typically assesses the enrichment of interactions within TADs compared to inter-TAD contacts, confirming that intra-domain frequencies are significantly higher (often by 2- to 10-fold), which validates the modular organization and biological relevance of these structures.

Integration with other data

Chromosome conformation capture (3C) techniques, such as , are frequently integrated with epigenomic and transcriptomic datasets to provide a multidimensional view of genome organization and function. Overlaying contact maps with ChIP-seq data, for instance, allows researchers to correlate chromatin loops and topologically associating domains (TADs) with specific protein bindings, such as at loop anchors, revealing how architectural proteins enforce spatial proximity between regulatory elements. Similarly, combining with identifies overlaps between interactions and open regions, highlighting accessible loci involved in long-range , while integration with elucidates co-expression patterns linked to enhancer-promoter contacts, thereby mapping dynamic transcriptional landscapes. These approaches enable the of hotspots with functional epigenetic marks, facilitating a deeper understanding of how structure influences gene activity across cell types. Key resources and tools support this multi-omics integration, with the 4D Nucleome (4DN) project serving as a central hub for accessing harmonized datasets alongside complementary layers like ChIP-seq and from diverse tissues and conditions. The 4DN Data Portal provides standardized pipelines for data harmonization and visualization, enabling users to query interactions in the context of broader genomic features. For visualization, the software suite processes raw data into contact matrices and overlays them with external tracks such as ChIP-seq peaks or expression profiles, allowing interactive exploration of multi-layered genomic data. Recent advancements include automated pipelines like scCross, which unify single-cell with transcriptomic and epigenomic data for joint embedding and analysis, streamlining the identification of cell-type-specific regulatory networks. As of 2025, emerging methods, such as TRUHiC for enhancing sparse contact maps and benchmarks for differential analysis, further improve accuracy in detecting structural variations and integrating multi-omics data. Specific applications of this integration include correlating -derived loops with (eQTLs) to pinpoint genetic variants that modulate architecture and . For example, joint analyses across multiple tissues have shown that eQTLs often localize near loop boundaries, suggesting that variants disrupting these structures contribute to expression variability. In variant prioritization, recent chromatin-exome combinations leverage to assess how structural variants alter interaction profiles, such as creating novel TADs (neo-TADs) in genomic rearrangements, thereby guiding the functional interpretation of noncoding mutations. These correlations extend the detection of loops and domains by linking them to heritable regulatory effects. The primary benefits of such integrations lie in resolving causal relationships in gene regulation, where data distinguishes direct physical interactions from mere correlative associations observed in transcriptomic profiles alone. By overlaying spatial contacts with functional , researchers can infer mechanistic pathways, such as how enhancer via neo-TAD formation rewires regulatory circuits in rearranged genomes. This multi-omics framework enhances predictive power for regulatory outcomes, prioritizing variants with high-confidence structural impacts and advancing genome-wide association studies toward functional validation.

Applications

Gene regulation mechanisms

Chromosome conformation capture techniques, particularly , have elucidated how loops facilitate long-range interactions between enhancers and promoters, thereby driving . These loops physically juxtapose distal regulatory elements, enabling transcription factors and co-activators to coordinate activation of target genes. In human cells, data reveal approximately 10,000 such loops genome-wide, many of which connect active promoters to enhancers and correlate with increased transcriptional output.01497-4) Disruptions to these loops, such as those observed in the clusters, lead to altered patterns by preventing enhancer-promoter contacts. A central underlying loop formation is the CTCF-mediated loop extrusion model, where complexes actively extrude fibers to form until halted by CTCF-bound sites. In this process, loads onto DNA and reels in intervening bidirectionally, creating that stabilize enhancer-promoter interactions. CTCF acts as a directional barrier, with its binding motifs oriented to impede extrusion efficiently. analyses confirm that loop anchors are enriched for CTCF occupancy, supporting this model's role in organizing regulatory networks.30530-7) At a broader scale, identifies A and B compartments, where principles contribute to spatial segregation of active and repressed genomic regions. A compartments, characterized by open and high gene density, associate with transcriptional activation, while B compartments feature condensed, heterochromatic structures linked to . This compartmentalization influences regulatory element accessibility and correlates with epigenetic marks that reinforce active or repressed states. Hi-C further reveals super-enhancers as dense clusters of interacting enhancers, often forming through loop-mediated aggregation that amplifies target gene expression. These clusters exhibit strong self-interactions in contact maps, integrating multiple regulatory inputs for robust transcriptional control. DNA motif analysis of loop anchors consistently shows a bias toward convergent CTCF orientations, where motifs face each other to facilitate stable loop closure and precise regulatory wiring.01497-4)

Disease associations

Chromosome conformation capture (3C) technologies have revealed how disruptions in topologically associating domains (TADs) contribute to developmental diseases, particularly through misregulation of key genes. In limb malformations such as and , structural variants (SVs) at the EPHA4 locus alter TAD boundaries, leading to ectopic enhancer-promoter interactions that upregulate genes like EPHA4 or in patient-derived fibroblasts and mouse models. These findings demonstrate that analysis of patient cells can detect SV-induced ectopic chromatin contacts, providing a mechanistic link between 3D genome rewiring and pathogenic phenotypes. In cancer, 3C methods uncover neo-loops and neo-TADs formed by SVs, which hijack enhancers to drive activation. For instance, genomic rearrangements in various tumor types create novel loops that connect distant regulatory elements to proto-oncogenes, as identified through profiling across hundreds of cancer genomes. of Hi-C data is crucial in these analyses to account for biases from and copy number variations, which can otherwise distort contact frequency estimates and lead to misinterpretation of structural changes. Additionally, has been applied to map disease-specific loops mediated by factors like or in cancer cells, highlighting alterations in loop anchors that promote tumor progression. Recent advances integrate 3C with in the Exo-C approach to simultaneously detect SVs and point mutations affecting architecture in cancers. This method, applied to pediatric tumor samples, identifies noncoding point mutations that disrupt motifs or TAD boundaries, contributing to oncogenesis without altering protein-coding sequences. Such point mutations can subtly alter enhancer , as briefly noted in loop detection algorithms that prioritize integrity for accurate identification.

Evolutionary and developmental biology

Chromosome conformation capture techniques, particularly , have revealed dynamic changes in topologically associating domains (TADs) during processes. In stem cells, data show a marked reorganization of architecture as quiescent cells activate and progress through myogenic stages, with significant loss of TAD border insulation and enhancer-promoter looping early in , while over 60% of TAD borders remain stable across . This rewiring is driven by factors like PAX7, whose reduction correlates with diminished long-range interactions essential for myogenic . Similar dynamics occur during aging, where geriatric muscle stem cells exhibit increased long-range contacts and weakened TAD insulation, contributing to impaired regenerative potential. In embryogenesis, and its high-resolution variant Micro-C have illuminated the establishment of 3D structures prior to zygotic genome activation. During early development, Micro-C maps at 100-bp resolution across nuclear cycles 1–8 and later stages identify over 3,600 boundaries and 1,000 loops that form as early as the pre-cellular embryo, persisting through and facilitating the deployment of developmental genes bound by factors like and . Single-cell in post-gastrulation embryos further demonstrates cell-to-cell variability in chromosome conformations, with most nuclei showing plastic structures that cluster into and lineages, while specialized cells like primitive erythrocytes adopt compact, stable folding with elevated long-range contacts. In plants, low-input during male development in uncovers stage-specific shifts in loops and compartments, linking 3D reorganization to pollen viability and . Evolutionarily, has highlighted conserved loops across species, particularly in clusters that coordinate body patterning. In , unlike where a large insertion splits the Hox cluster into separate TADs, reveals a single encompassing TAD with inter-looping between (ANT-C) and Bithorax (BX-C) complexes, mediated by shared sites; disrupting these loops via CRISPR-Cas9 reduces expression and induces morphological defects. Cross-species comparisons, including aligned contact maps from and D. pseudoobscura, show 57% conservation of states for orthologous genes, with evolutionary rearrangements like neo-X chromosome formation altering long-range interactions but preserving core TAD structures. Such analyses also detect synteny in genomic rearrangements, where inter-chromosome inversions rarely disrupt pre-existing 3D folding, underscoring the robustness of organization amid evolutionary change. Recent advances link dynamics to transcription in developmental contexts, with showing compartment shifts that modulate across species.

References

  1. [1]
    Capturing Chromosome Conformation - Science
    Job Dekker et al. ,. Capturing Chromosome Conformation.Science295,1306-1311(2002).DOI:10.1126/science.1067799. Export citation. Select the format you want to ...
  2. [2]
    Systematic evaluation of chromosome conformation capture assays
    Sep 3, 2021 · Chromosome conformation capture (3C) assays are used to map chromatin interactions genome-wide. Chromatin interaction maps provide insights ...
  3. [3]
    Advances in the multimodal analysis of the 3D chromatin structure ...
    Apr 25, 2024 · This review divides these methodologies into sequencing-based and imaging-based methodologies, tracing their development over time.
  4. [4]
    Spheroid Chromatin Units (ν Bodies) - Science
    Spheroid Chromatin Units (ν Bodies). Ada L. Olins and Donald E. OlinsAuthors Info & Affiliations. Science. 25 Jan 1974 ... nucleosomes with short DNA linkers ...
  5. [5]
    Analysis of long-range chromatin interactions using Chromosome ...
    The basic steps of 3C are: 1) Formaldehyde crosslinking, 2) Digestion with a restriction enzyme, 3) intra-molecular ligation, and 4) ligation product detection ...
  6. [6]
    Non-backtracking walks reveal compartments in sparse chromatin ...
    Jul 9, 2020 · Resulting dependence turns out to be fairly close to the fractal globule contact probability, P(s)\sim s^{-\alpha } with \alpha \approx 1 at ...
  7. [7]
  8. [8]
  9. [9]
    Chromosome conformation capture technologies and their impact in ...
    Apr 30, 2016 · Chromosome conformation capture carbon copy or 5C allows obtaining information on the contacts established by multiple genomic fragments in a ...
  10. [10]
    None
    Nothing is retrieved...<|control11|><|separator|>
  11. [11]
    Hi-C: A Method to Study the Three-dimensional Architecture of ... - NIH
    May 6, 2010 · This method was used in the research reported in Lieberman-Aiden et al., Science 326, 289-293 (2009). I. Crosslinking, Digestion, Marking of ...
  12. [12]
    HI-C 2.0: AN OPTIMIZED HI-C PROCEDURE FOR HIGH ... - NIH
    Previous high resolution Hi-C libraries have used MboI or DpnII [13] to fragment DNA with restriction endonucleases to an average length of ~500 bp.
  13. [13]
    Single-cell Hi-C reveals cell-to-cell variability in chromosome structure
    Sep 25, 2013 · Here we introduce single-cell Hi-C, combined with genome-wide statistical analysis and structural modelling of single-copy X chromosomes,Missing: paper | Show results with:paper
  14. [14]
    Three-dimensional genome structures of single diploid human cells
    Aug 31, 2018 · J. Dekker, K. Rippe, M. Dekker, N. Kleckner, Capturing chromosome conformation. Science 295, 1306–1311 (2002).
  15. [15]
    Robust single-cell Hi-C clustering by convolution- and random-walk ...
    We describe scHiCluster, a single-cell clustering algorithm for Hi-C contact matrices that is based on imputations using linear convolution and random walk.
  16. [16]
    Order and stochasticity in the folding of individual Drosophila genomes
    Jan 4, 2021 · In single nuclei, the dependence of the contact probability on the genomic distance, Pc(s), has a shape comparable to that observed in the bulk ...Missing: ∝ | Show results with:∝
  17. [17]
    Mapping nucleosome-resolution chromatin organization and ...
    Jan 2, 2024 · To overcome the resolution limit (~1000 bp) in Arabidopsis 3D genome research, we used Micro-C-XL to map nucleosome-resolution chromatin ...
  18. [18]
    An upgraded method of high-throughput chromosome conformation ...
    Jul 3, 2023 · This upgraded method, termed Hi-C 3.0, was applied to cotton leaves for trial. In comparison with the conventional Hi-C 2.0, Hi-C 3.0 can obtain more than 50% ...<|control11|><|separator|>
  19. [19]
    Region Capture Micro-C reveals coalescence of enhancers and ...
    Chromosome conformation capture (3C) methods effectively capture large-scale three-dimensional (3D) genome structure but struggle to achieve the depth necessary ...Missing: review 2023-2025
  20. [20]
    Combining chromosome conformation capture and exome ...
    May 7, 2025 · We present a novel method based on chromatin conformation capture that can be used to simultaneously detect structural variants and point mutations in the ...Missing: Micro- | Show results with:Micro-
  21. [21]
    HiC-Pro: an optimized and flexible pipeline for Hi-C data processing
    Dec 1, 2015 · HiC-Pro is an optimized and flexible pipeline for processing Hi-C data from raw reads to normalized contact maps.
  22. [22]
    The Hitchhiker's Guide to Hi-C Analysis: Practical guidelines - PMC
    The aim of this paper is to provide guidelines for analyzing and interpreting data obtained with genome-wide 3C methods such as Hi-C and 3C-seq that rely on ...
  23. [23]
    HiCNorm: removing biases in Hi-C data via Poisson regression - PMC
    Sep 27, 2012 · We propose a parametric model, HiCNorm, to remove systematic biases in the raw Hi-C contact maps, resulting in a simple, fast, yet accurate normalization ...
  24. [24]
    HiCNorm: removing biases in Hi-C data via Poisson regression
    Sep 27, 2012 · We propose a parametric model, HiCNorm, to remove systematic biases in the raw Hi-C contact maps, resulting in a simple, fast, yet accurate normalization ...<|separator|>
  25. [25]
    Integration of Hi-C and ChIP-seq data reveals distinct types of ...
    We have analyzed publicly available K562 Hi-C data, which enable genome-wide unbiased capturing of chromatin interactions, using a Mixture Poisson ...
  26. [26]
    Integrated ATAC, ChIP, HiC and RNA Seq - CD Genomics
    ATAC-seq, ChIP-seq and Hi-C are techniques to study the mechanisms that govern the dynamic structural and spatial organization of chromatin.
  27. [27]
    Linking genome structures to functions by simultaneous single-cell ...
    Jun 8, 2023 · We obtained similar results using other single-cell Hi-C clustering methods (fig. S3H). This observation echoes recent findings in Drosophila ...
  28. [28]
    The 4D Nucleome Data Portal as a resource for searching and ...
    The 4D Nucleome (4DN) Network aims to elucidate the complex structure and organization of chromosomes in the nucleus and the impact of their disruption in ...Missing: omics | Show results with:omics
  29. [29]
    Current state and future aims of the 4D nucleome project
    Aug 3, 2023 · The four-dimensional nucleome (4DN) consortium studies the architecture of the genome and the nucleus in space and time.
  30. [30]
    Juicer provides a one-click system for analyzing loop-resolution Hi ...
    Juicer comprises three tools, which are designed to be run one-after-another. First, Juicer transforms raw sequence data into a list of Hi-C contacts (pairs of ...Missing: omics | Show results with:omics
  31. [31]
    scCross: a deep generative model for unifying single-cell multi ...
    Jul 29, 2024 · Leveraging our model, we generate data sets at onefold (1X) and fivefold (5X) of the original count of Ast cells' single-cell multi-omics data.Missing: Accio | Show results with:Accio
  32. [32]
    Joint analyses of multi-tissue Hi-C and eQTL data demonstrate ... - NIH
    Apr 30, 2019 · We performed various joint analyses of eQTL and high-throughput chromatin conformation capture (Hi-C) data from 11 human primary tissue types ...
  33. [33]
    Identifying genetic variants associated with chromatin looping and ...
    Sep 18, 2024 · We observed that ~64% (5933 out of 9426) iQTL SNPs overlap with eQTLs or hQTLs (Fisher's exact test p-value < 0.00001, OR = 1.43; see Methods; ...
  34. [34]
    Integrating multi-omic features exploiting Chromosome ... - Frontiers
    In particular, a novel sequencing technique called Chromosome Conformation Capture allows the analysis of the chromosome organization in the cell's natural ...Missing: Accio 2024
  35. [35]
    Integration of Hi-C with short and long-read genome sequencing ...
    Oct 29, 2022 · Integration of Hi-C with short and long-read genome sequencing reveals the structure of germline rearranged genomes.
  36. [36]
    Temporal dynamics and developmental memory of 3D chromatin ...
    Apr 29, 2014 · In this study, we describe the 3D organization of Hox gene clusters at high resolution during the implementation of temporal colinearity in ...<|separator|>
  37. [37]
    Chromatin extrusion explains key features of loop and domain ...
    Model based on loop extrusion makes it possible to recapitulate Hi-C maps accurately using only CTCF ChIP-Seq results. (A, i and ii) Extrusion complex loads ...
  38. [38]
    Dissecting super-enhancer hierarchy based on chromatin interactions
    Mar 5, 2018 · Super-enhancers (SEs) are putative enhancer clusters with unusually high levels of enhancer activity and enrichment of enhancer-associated ...
  39. [39]
    Disruptions of Topological Chromatin Domains Cause Pathogenic ...
    We demonstrate that disruption of TADs can rewire long-range regulatory architecture and result in pathogenic phenotypes. We show that distinct human limb ...
  40. [40]
    Disruption of chromatin folding domains by somatic genomic ...
    Feb 5, 2020 · Disruption of chromatin folding domains by somatic genomic rearrangements in human cancer. Kadir C. Akdemir,; Victoria T. Le, ...
  41. [41]
    ChromLoops: a comprehensive database for specific protein ...
    Disruption of chromatin loops is associated with many diseases, such as cancer and polydactyly. A few methods, including ChIA-PET, HiChIP and PLAC-Seq, have ...
  42. [42]
    Multiscale 3D genome reorganization during skeletal muscle stem ...
    Feb 17, 2023 · Chromatin loops are dynamically remodeled during SC lineage progression. Chromatin loops manifest as foci in high-resolution Hi-C contact maps.
  43. [43]
  44. [44]
    Single cell Hi-C identifies plastic chromosome conformations ...
    Jun 29, 2023 · Here we use single cell Hi-C to map chromosomal conformations in post-gastrulation mouse embryo cells and study their distributions and correlations.
  45. [45]
    Dynamic changes in 3D chromatin structure during male ...
    Feb 10, 2025 · Here, we use the optimized single-nucleus and low-input Hi-C methods to investigate changes in 3D chromatin structure in four types of male gametophyte nucleus ...
  46. [46]
    Evolutionary patterns and functional effects of 3D chromatin ... - Nature
    Jul 26, 2024 · Notably, butterflies exhibited chromatin loops between Hox gene cluster ANT-C and BX-C, unlike Drosophila. Our CRISPR-Cas9 experiments in ...
  47. [47]
    Development and evolution of Drosophila chromatin landscape in a ...
    Nov 1, 2024 · The cross-linked cells were used to prepare Hi-C libraries with Proximo Hi-C kits v4.0 (Phase Genomics) according to the manufacturer's protocol ...Results · Methods · Chip-Seq Experiments And...
  48. [48]
    Chromatin conformation, gene transcription, and nucleosome ...
    Jan 10, 2025 · On Hi-C with 4-μm ActD treatment, we again observed a decrease in loop frequency with transcriptional inhibition at the wild-type loci (Fig. 4C) ...