Fact-checked by Grok 2 weeks ago

Comparative genomic hybridization

Comparative genomic hybridization (CGH) is a molecular cytogenetic designed to detect and map chromosomal copy number variations, such as gains, losses, and amplifications, across the entire of a test sample by comparing it to a . Developed in 1992 by Kallioniemi and colleagues, the method involves labeling DNA from the test sample (e.g., tumor ) with one fluorochrome and reference DNA with another, then hybridizing them competitively to normal metaphase chromosomes; variations in the ratio of fluorescence intensities along the chromosomes reveal regions of DNA over- or under-representation. This approach revolutionized the analysis of solid tumors and other samples where traditional karyotyping is challenging due to the lack of dividing cells. Over time, CGH has advanced into array-based CGH (aCGH), which replaces metaphase chromosomes with high-density microarrays of DNA probes, enabling detection of submicroscopic copy number variants (CNVs) as small as 50–100 kb with greater precision and throughput. Key applications include identifying somatic alterations in cancers like breast and bladder tumors, where it has uncovered novel amplified regions such as those on chromosomes 11q13 and 17q11-12; prenatal diagnosis of fetal anomalies through analysis of amniotic fluid or chorionic villi; and evaluation of products of conception in recurrent pregnancy loss, achieving abnormality detection rates of 38% compared to 37% with conventional karyotyping, as reported in one study. aCGH also aids in diagnosing constitutional disorders, such as intellectual disabilities, by pinpointing pathogenic CNVs missed by standard cytogenetics. Despite these strengths, including no requirement for cell culture and applicability to archived paraffin-embedded samples, CGH and aCGH face limitations such as potential over-detection of benign polymorphisms, which can complicate clinical interpretation, and the need for follow-up validation with methods like or sequencing to confirm causality. Ongoing refinements, including integration with arrays to assess and maternal cell contamination, continue to enhance its diagnostic yield and utility in genomic medicine.

Introduction

Definition and Principles

Comparative genomic hybridization (CGH) is a molecular cytogenetic technique designed to identify gains and losses in DNA copy number across the entire genome by comparing a test sample to a . It achieves this through the hybridization of differentially labeled test and reference DNA to normal genomic targets, such as chromosomes in conventional CGH or DNA arrays in array-based variants, enabling the detection of chromosomal imbalances like , deletions, and amplifications without requiring prior knowledge of specific loci. The foundational principle of CGH is competitive hybridization, where equal amounts of test DNA—often from pathological samples such as tumors—and reference DNA from a normal genome compete for binding to the same target sequences. The test DNA is typically labeled with a red-emitting like Cy5, while the reference DNA is labeled with a green-emitting like Cy3; these are mixed and hybridized to the targets under conditions that allow specific binding proportional to their abundance. Copy number variations in the test sample disrupt the balance: gains lead to excess red signal binding, appearing as red dominance, while losses result in excess green signal, visualized as green dominance along chromosomal or positions to generate ratio profiles of genomic imbalances. Detection relies on quantifying the fluorescence intensities to compute a ratio, defined as the test signal divided by the reference signal, which reflects relative copy numbers. For interpretive clarity, this ratio is commonly log-transformed to the base 2: \log_2 \left( \frac{\text{test signal}}{\text{reference signal}} \right) Values greater than 0 indicate copy number gains, less than 0 indicate losses, and near 0 signify balanced regions. In conventional CGH, resolution is constrained by target density to approximately 5-10 , limiting detection to larger-scale aberrations. This method was pioneered in for mapping DNA amplifications in solid tumors.

Historical Development

Comparative genomic hybridization (CGH) was invented in 1992 by Outi Kallioniemi and colleagues at the University of California, San Francisco, as a molecular cytogenetic technique to map DNA copy number imbalances across the entire genome in solid tumors, circumventing the need for cell culturing or metaphase chromosome preparation. The method involved differentially labeling tumor and reference DNA with fluorescent dyes and hybridizing them to normal metaphase chromosomes, allowing visualization of gains and losses through ratio-based fluorescence signals. In the early 1990s, CGH saw rapid initial adoption for analyzing chromosomal alterations in uncultured solid tumor samples, with the seminal 1992 study demonstrating its ability to detect gene amplifications in primary breast carcinomas and other malignancies. Follow-up publications in 1994 further validated its application in identifying recurrent copy number changes in breast cancer, establishing CGH as a valuable tool for cancer genomics research during that decade. The transition to array-based CGH (aCGH) occurred in 1998, when Daniel Pinkel and colleagues at the same institution developed a higher-resolution version by hybridizing labeled DNAs to spotted DNA arrays instead of spreads, enabling detection of sub-megabase alterations. This innovation, building on bacterial artificial chromosome (BAC) clones as targets, marked a significant refinement and was widely adopted by the early 2000s for its improved sensitivity and throughput in mapping tumor genomes. Commercialization accelerated aCGH's dissemination in the 2000s, with platforms from Agilent Technologies—launching its oligonucleotide aCGH arrays in 2005—and NimbleGen Systems providing accessible, high-density tools for routine use in research and diagnostics. By the 2010s, integration of CGH principles with next-generation sequencing (NGS) emerged, allowing combined copy number and sequence variant analysis for enhanced genomic resolution in clinical settings. Key contributors, including Daniel Pinkel and J.W. Gray, drove these advancements from conventional to array and sequencing-enhanced formats.

Core Principles

Mechanism of Copy Number Detection

In comparative genomic hybridization (CGH), the detection of copy number variations relies on the competitive binding of differentially labeled DNA fragments from test and reference samples to normal genomic targets, such as metaphase chromosomes or arrayed probes. Test DNA, typically from a sample of interest like tumor tissue, is labeled with a red fluorophore (e.g., Cy5), while reference DNA from a normal diploid genome is labeled with a green fluorophore (e.g., Cy3). These DNA fragments, sheared to sizes ranging from 200 to 2000 base pairs to facilitate efficient hybridization, are mixed in equal amounts and allowed to compete for binding sites on the target sequences. In regions of copy number gain in the test sample, excess test DNA fragments outcompete reference fragments, resulting in increased red fluorescence intensity; conversely, copy number losses lead to predominant green signals due to underrepresentation of test DNA. This biophysical competition ensures that fluorescence ratios directly reflect relative DNA copy numbers across the genome. Copy number imbalances are quantitatively detected by measuring the of red-to-green intensities along chromosomal positions, with diploid regions exhibiting balanced signals and equal contributions from both labels. Gains and losses are identified when the logarithm base 2 of the intensity (log2(/)) deviates significantly from 0, typically by thresholds of >0.2 to 0.5 for gains and <-0.2 to -0.5 for losses, depending on the platform resolution and noise levels; for instance, a log2 of approximately 0.58 corresponds to a single-copy gain in a diploid context. These ratios are plotted as profiles along chromosomes, revealing aberrations such as whole-arm gains (e.g., +8q in various cancers), where sustained elevated ratios indicate amplified regions. The sensitivity of this approach allows detection of changes as small as 20% of the cell population in conventional CGH, though array-based variants improve resolution to kilobase scales. To ensure accurate ratio measurements, data undergo normalization procedures that correct for technical variations, including background subtraction to remove nonspecific fluorescence and global adjustments to account for differences in labeling efficiencies between dyes. These steps involve subtracting local background signals from raw intensities and scaling ratios so that the median across autosomes equals 1 (or log2 ratio of 0), mitigating biases from unequal incorporation of fluorophores. Such normalization is critical for reliable quantification, as uncorrected labeling disparities can shift ratios by up to 20-30%. CGH distinguishes true copy number changes from benign polymorphisms or noise through statistical segmentation models that identify contiguous regions of consistent ratio deviations. For example, circular binary segmentation (CBS) recursively partitions the genome into segments of uniform copy number, using permutation-based tests to assess significance and filter out small, nonrecurrent variations like copy number polymorphisms (CNPs), which typically span <50 kb and occur in healthy populations. This approach enhances specificity by focusing on biologically relevant aberrations, such as those exceeding 1 Mb, while minimizing false positives from experimental variability.

Signal Interpretation and Ratios

In comparative genomic hybridization (CGH), signal interpretation begins with the quantification of fluorescence intensities from the hybridized metaphase chromosomes or array elements, where test DNA is typically labeled with a red fluorophore (e.g., Cy5) and reference DNA with a green fluorophore (e.g., Cy3). Pixel-by-pixel intensity ratios (R/G) are calculated after subtracting local background noise, and these are averaged across each target spot or clone to yield a representative ratio per genomic locus. To enhance symmetry and facilitate statistical analysis, the ratios are commonly transformed to a log2 scale, where log2(R/G) = 0 indicates balanced copy number, positive values suggest gains, and negative values indicate losses. This log2 transformation centers the data around zero for diploid regions and amplifies deviations for easier detection of aberrations. Normalization is essential to correct for technical biases such as dye-specific effects, spatial heterogeneity, and sequence composition artifacts that can distort the ratio profiles. Lowess (locally weighted scatterplot smoothing) normalization addresses dye bias by regressing log2 ratios against total intensity for each spot, fitting a non-linear curve to adjust for intensity-dependent imbalances across the genome. Median normalization further refines the data by subtracting the median log2 ratio from autosomes or the entire genome to establish a baseline of 1.0 (or log2 = 0) for copy-neutral regions, mitigating global shifts due to unequal labeling efficiency or hybridization conditions. Additional corrections, such as for GC content, prevent wavy patterns in log2 profiles by adjusting ratios based on probe composition, ensuring accurate representation of true copy number variations. Aberration calling involves identifying regions of copy number imbalance from the normalized log2 ratio profiles, often using threshold-based or statistical segmentation methods to delineate gains, losses, and breakpoints. Simple threshold approaches classify gains as log2 ratios exceeding 0.2 (corresponding to approximately 1.15-fold increase) and losses below -0.2, with higher thresholds like ±1.0 for amplifications or homozygous deletions, though these values are calibrated based on platform-specific noise levels (typically standard deviations of 0.1-0.3). More advanced statistical methods, such as , model the data as a sequence of hidden states representing copy number levels (e.g., 0 to 8 copies), using to infer state transitions and detect breakpoints with higher sensitivity and specificity, particularly in noisy tumor samples. These HMM-based approaches achieve accuracies up to 98% in copy number inference when validated against independent methods like . Software tools facilitate segmentation, visualization, and interpretation of CGH data, enabling automated processing of large datasets. DNAcopy, implementing (CBS), divides the genome into regions of equal copy number by iteratively testing for change points that maximize between-segment variance, providing robust breakpoint detection for both conventional and array CGH profiles. BlueFuse offers a graphical interface for multi-sample analysis, integrating normalization, segmentation, and aberration calling with visualization as chromosomal ideograms or copy number plots that highlight gains/losses in color-coded tracks. These tools support replicate comparisons to confirm findings and export results for downstream genomic studies. Interpreting artifacts is crucial to distinguish true copy number aberrations from technical noise, such as optical bleeding, uneven hybridization, or batch effects, which can mimic gains or losses in ratio profiles. Replicate experiments, including dye-swap hybridizations and self-hybridizations of reference DNA, are used to assess reproducibility; consistent deviations across replicates indicate biological signals, while discordant ones flag artifacts, with false-positive rates estimated below 0.01 through such validations. This approach ensures reliable calling by quantifying signal-to-noise ratios and confirming borderline aberrations with orthogonal methods like quantitative PCR.

Conventional CGH Methods

Sample Preparation and Labeling

In conventional comparative genomic hybridization (CGH), sample preparation begins with the isolation of genomic DNA from the test sample, typically tumor tissue, and a matched reference sample from normal diploid cells. Approximately 1-2 μg of high-molecular-weight DNA is extracted from each using standard techniques such as phenol-chloroform extraction or commercial purification kits to ensure purity and integrity. Quality is assessed by spectrophotometry, targeting an A260/A280 ratio of 1.8-2.0 to confirm protein-free DNA, and by 0.8% agarose gel electrophoresis to visualize intact bands without smearing indicative of degradation. The extracted DNA is then fragmented to generate pieces of 200-2000 base pairs, optimal for efficient and uniform hybridization to metaphase chromosomes. This is accomplished by enzymatic digestion with restriction endonucleases and , which recognize frequent 4-base pair sequences (AG^CT and GT^AC, respectively), typically at 10-20 units per μg DNA in appropriate buffer for 1-2 hours at 37°C, followed by heat inactivation and purification via phenol-chloroform extraction and ethanol precipitation. Fragment size distribution is verified by electrophoresis on a 1% agarose gel, where a continuous smear in the target range confirms successful shearing without over- or under-digestion. Labeling of the fragmented DNA occurs via nick translation, which introduces fluorescent nucleotides while further refining fragment size. Test DNA is labeled with Cy3-conjugated dCTP (green emission) and reference DNA with Cy5-conjugated dCTP (red emission), using a reaction mixture containing 1 μg DNA, DNase I (0.1-0.4 mU/μL for controlled nicking), E. coli DNA polymerase I (5-10 U/μL for nick extension and replacement), and dNTPs (including 0.1 mM labeled ) in nick translation buffer, incubated at 15-16°C for 60-120 minutes. The reaction is stopped by EDTA and heat inactivation at 65°C. Efficiency is evaluated by slot blot assay, where labeled DNA is dotted onto a nylon membrane, hybridized with a total human DNA probe, and scanned for fluorescence; successful labeling achieves >20% incorporation and a base-to-dye ratio of 8:1 to 15:1, ensuring bright signals without excessive . Alternatively, spectrophotometric measurement of dye absorbance (Cy3 at 550 nm, Cy5 at 650 nm) quantifies incorporation, targeting >100 pmol dye per μg DNA. To minimize non-specific binding to repetitive sequences, which can obscure copy number signals, blocking is performed by adding human Cot-1 DNA (50-100 μg per μg labeled probe) and carrier nucleic acids like sheared salmon sperm DNA (10-20 μg) to the combined test and reference probes. The mixture is ethanol-precipitated, resuspended in hybridization buffer, denatured at 75-80°C for 5-10 minutes, and pre-annealed at 37°C for 30-60 minutes to allow Cot-1 DNA to hybridize with repeats prior to target application. Final involves re-running a portion of the labeled probes on a 1.5% to confirm a size smear of 300-2000 post-labeling, with no high-molecular-weight bands indicating incomplete reaction, and yield assessment to secure at least 0.5 μg usable probe per sample for downstream hybridization.

Hybridization and Visualization

In conventional comparative genomic hybridization (CGH), target preparation begins with normal male or female spreads fixed on slides to serve as the hybridization . These spreads are derived from peripheral lymphocytes cultured and harvested using standard cytogenetic methods, including hypotonic and fixation in methanol-acetic acid. Prior to hybridization, the slides undergo pretreatment to enhance accessibility: incubation with RNase A (100 µg/mL) at 37°C for 1 hour removes , followed by optional digestion (50 µg/mL in 0.01 M HCl) at for 10 minutes to reduce cytoplasmic remnants, and fixation in 1% in with MgCl₂. This pretreatment minimizes non-specific binding and ensures clear morphology. The hybridization procedure involves mixing 500 ng each of differentially labeled test and DNAs (e.g., test DNA labeled with Cy3 and with Cy5) with blocking agents such as 10–50 µg human Cot-1 DNA to suppress repetitive sequences. The mixture, prepared in a hybridization buffer containing 50% and 10% dextran sulfate in 2× , is denatured at 74–75°C for 5–7 minutes, pre-annealed at 37°C for 30–60 minutes, and applied to the pretreated slides under a coverslip. Slides are then incubated in a humidified chamber at 37°C for 48–72 hours to allow competitive binding of the probes to the chromosomes. Following incubation, stringent washing removes unbound probes: three 5-minute washes in 50% /2× at 45°C, followed by three 5-minute washes in 2× at 45°C, and a brief rinse in 4× /0.05% Tween-20 at room temperature to minimize non-specific signals. Visualization commences with counterstaining the chromosomes with DAPI (0.1–0.2 µg/mL) in an antifade mounting medium to outline the chromatin and facilitate chromosome identification. Images are captured using a fluorescence microscope, such as the Zeiss Axioskop, equipped with appropriate filter sets for Cy3 (green channel for test DNA) and Cy5 (red channel for reference DNA), at 100× magnification with a high-resolution cooled CCD camera. Digital images of at least 5–10 metaphases are acquired to ensure reproducibility. Initial analysis employs quantitative image analysis software, such as Quips (Vysis/Applied Imaging), which performs background subtraction, fluorescence ratio normalization, and generates average green-to-red ratio profiles along the length of each chromosome to identify copy number variations. These profiles, typically plotted with thresholds of 0.8 for losses and 1.2 for gains, provide a visual map of genomic imbalances.

Array CGH Methods

Array Platforms and Design

Array comparative genomic hybridization (aCGH) platforms represent an evolution from conventional metaphase-based CGH, adapting the basic hybridization principle to solid-support arrays for enhanced resolution and throughput. Early aCGH arrays primarily utilized bacterial artificial chromosome (BAC) and P1-derived artificial chromosome (PAC) clones as probes, each spanning approximately 100-200 kb of genomic DNA, enabling genome-wide copy number detection at resolutions around 1 Mb. These large-insert clones were selected for their stability and ability to represent unique genomic regions, with initial designs featuring about 2,000-5,000 elements spaced at 1 Mb intervals across the human genome. Subsequent advancements introduced oligonucleotide-based arrays, which employ shorter synthetic probes (typically 25-70 bases, such as Agilent's 60-mer ) to achieve resolutions below 1 kb, particularly in targeted regions. (SNP) arrays, often integrated with copy number probes, extend aCGH capabilities by simultaneously assessing and , using probes that interrogate specific SNPs for dual-purpose analysis. Design strategies for these arrays emphasize tiling paths—overlapping probe placements—to ensure comprehensive whole-genome coverage while minimizing gaps, with probe selection algorithms prioritizing unique sequences to exclude repetitive elements and common polymorphisms that could confound signal interpretation. Probe density has progressively increased from kilobase-scale early arrays to over 1 million features by the , driven by synthesis techniques that allow customizable, high-density layouts for either whole-genome or targeted applications. As of 2025, commercial platforms like Agilent's SurePrint technology utilize over 28 million validated probes, achieving resolutions down to the level for enhanced detection of small CNVs. Commercial platforms dominate aCGH implementation, including Agilent's oligonucleotide arrays, which offer 60-mer probes on glass slides for resolutions down to 5-10 kb genome-wide, and Illumina's BeadChip technology, featuring bead-immobilized probes for SNP-inclusive designs with up to 850,000 markers. Custom arrays enable region-specific focusing, such as high-density tiling over exons or microdeletion hotspots, contrasting with standardized whole-genome formats. Probes are immobilized via spotting (for BAC/PAC) or direct synthesis on substrates like glass or silicon slides, often using silane-based linker chemistry to attach DNA covalently and ensure uniform hybridization surfaces. Compared to conventional CGH, array platforms provide superior resolution (from sub-megabase to kilobase levels), eliminate the need for chromosome preparation, and offer quantitative detection of low-level mosaicism (as low as 10-20%), facilitating identification of subtle genomic imbalances previously undetectable.

Hybridization, Scanning, and Data Processing

In array comparative genomic hybridization (array CGH), the hybridization process involves denaturing the fluorescently labeled test and reference DNA samples and applying them to the surface for binding to immobilized probes. The labeled DNAs, typically with Cy3 (green) for the test sample and Cy5 (red) for the reference, are mixed with blocking agents like Cot-1 DNA to suppress non-specific hybridization, then denatured at approximately 95°C for 5 minutes before cooling. Hybridization occurs in a controlled chamber or oven at 65°C for 16 to 40 hours, depending on array density and sample type, with gentle rotation (e.g., 20 rpm) to ensure even distribution and contact across the array slide. This extended incubation allows for specific annealing of DNA fragments to their complementary probes, such as bacterial artificial chromosomes () or , on the array. Coverslips or automated hybridization stations, like the Agilent SureHyb chamber, are used to maintain a stable liquid film and prevent evaporation during the process. Following hybridization, the arrays undergo stringent washing to remove unbound or loosely bound DNA and reduce . Washes typically include an initial rinse in 0.1% /2× SSC at , followed by in 0.1× SSC/0.1% at 37°C for 5 minutes to dissociate non-specific hybrids, and a final rinse in 0.1× SSC. For ozone-sensitive environments (>5 ppb), additional steps with and stabilization solutions prevent Cy5 degradation. The washed slides are then dried by at 1,000 × g for 3 minutes before scanning. Scanning employs laser-based systems, such as the Agilent G2565BA Scanner, exciting Cy3 at 532 nm and Cy5 at 635 nm to capture emitted intensities. Scans are performed at a resolution of 5 to 10 μm per , generating high-resolution images of the array grid for subsequent . This step quantifies the relative binding efficiency of test versus reference DNA to each probe spot. Data extraction begins with image processing using specialized software, such as Agilent's Feature Extraction (version 10.5 or later), which identifies and segments individual spots through adaptive circle or irregular segmentation algorithms. For each , foreground intensities for Cy3 and Cy5 channels are measured, with local background subtraction applied to correct for non-specific . Poor-quality spots—those with low signal-to-noise ratios, saturation, or spatial defects—are flagged and excluded based on metrics like scores below 0.1 or flags less than 1, ensuring over 95% of spots typically pass . The software computes log2 ratios (test/reference intensity) for each , producing raw ratio files in text format for further analysis. These ratios reflect copy number variations, enabling detection of focal amplifications or deletions as small as 50 in resolution, depending on probe density. Preprocessing of the extracted addresses technical artifacts, primarily through to correct for intensity-dependent dye bias and spatial variations. Lowess (locally weighted scatterplot smoothing) or linear is applied across the , adjusting log2 ratios so that the ratio for autosomal probes equals zero, thereby balancing systematic differences between channels. Quality assessment includes metrics such as the derivative log ratio spread (DLRS), which measures noise as the standard deviation of differences between consecutive probe log ratios; a DLRS value below 0.3 indicates high-quality suitable for downstream copy number calling. Additional checks, like signal-to-noise ratios above 20 and below 0.2, validate the dataset. The normalized ratio profiles are then exported as raw files for segmentation and aberration detection in tools like CGH Analytics, facilitating genome-wide copy number analysis.

Applications

In Cancer Research

Comparative genomic hybridization (CGH) has played a pivotal role in oncology by enabling the detection of somatic copy number variations (CNVs) in tumor genomes, which are hallmarks of cancer progression and often drive oncogenesis. Introduced in the early 1990s, CGH allows for genome-wide screening of DNA gains and losses without the need for tumor cell culturing, making it particularly valuable for analyzing solid tumors where karyotyping is challenging. In cancer research, it has revealed non-random chromosomal aberrations that correlate with tumor aggressiveness, such as amplifications at 17q12-q21 encompassing the ERBB2 (HER2) gene in approximately 20% of breast cancers, which is associated with poor prognosis and response to targeted therapies like trastuzumab. Similarly, losses at 18q, including the DCC tumor suppressor locus, are frequent in colorectal cancers and linked to advanced staging and metastasis risk. Early CGH studies in the demonstrated its utility in urologic malignancies, identifying recurrent imbalances like gains of 6p, 7q, and 20q and losses of 9q and 11p in transitional cell carcinomas, highlighting non-random changes that distinguish low- from high-grade tumors. In , CGH uncovered frequent losses at 8p22-p21 and 13q14, as well as gains at 7q and 8q24, which are early events in tumorigenesis and correlate with Gleason score and progression to hormone-refractory disease. These findings established CGH as a tool for mapping cancer-specific genomic landscapes, influencing subsequent research on driver mutations. The advent of array-based CGH (aCGH) in the 2000s enhanced resolution to detect submicroscopic alterations, revealing intratumor heterogeneity in multiforme (GBM), where focal amplifications of and PDGFRA vary across tumor regions, contributing to therapeutic resistance and subclonal evolution. In lung adenocarcinoma, aCGH identifies amplifications in up to 20% of cases, often co-occurring with activating mutations and guiding therapy, with amplified tumors showing differential survival outcomes. Clinically, aCGH applied to formalin-fixed paraffin-embedded (FFPE) archival samples has enabled retrospective analyses of historical cohorts, facilitating the correlation of CNVs with long-term prognosis in diverse cancers. A key advantage of aCGH in heterogeneous tumors is its ability to uncover low-level copy number gains and losses that reflect subclonal dynamics, such as progressive accumulation of alterations during tumor evolution in , where high-level MYCN amplification at 2p24 is detected in 20-25% of cases and strongly predicts adverse outcomes independent of stage. In these tumors, CGH also delineates (LOH) patterns, like 1p and 11q deletions, which interact with MYCN status to refine risk stratification. Overall, CGH's integration into has transformed the of actionable genomic , supporting precision approaches.

In Constitutional and Prenatal Genetics

In constitutional genetics, array comparative genomic hybridization (array CGH) serves as a primary diagnostic tool for identifying heritable or copy number variations (CNVs) associated with developmental disorders and intellectual disabilities. It excels at detecting submicroscopic deletions and duplications that are below the resolution of traditional karyotyping, such as the 22q11.2 deletion in , which affects approximately 1 in 4,000 live births and leads to conotruncal heart defects, , and immune deficiencies. Similarly, array CGH has identified duplications in the 17p12 region encompassing the PMP22 gene in Charcot-Marie-Tooth disease type 1A, a common inherited neuropathy characterized by progressive muscle weakness and sensory loss. Since the early 2010s, array CGH has largely replaced G-banded karyotyping as the first-line test for patients with unexplained congenital anomalies, autism spectrum disorder, or developmental delays, offering a genome-wide assessment with resolutions down to 50-100 kb. In prenatal genetics, array CGH is routinely applied to samples obtained via or (CVS) to evaluate fetuses with abnormalities, such as structural malformations or growth restrictions. It identifies pathogenic CNVs in approximately 6% of cases with normal karyotypes but abnormal , providing an incremental diagnostic yield over conventional . For instance, array CGH has facilitated the prenatal detection of the , which manifests with , seizures, and distinctive facial features, as well as the 4p deletion in Wolf-Hirschhorn syndrome, associated with growth failure, , and craniofacial anomalies. Additionally, array CGH is used to analyze in cases of recurrent pregnancy loss, detecting chromosomal abnormalities at rates up to 38%, compared to 23% with conventional karyotyping. The management of variants of unknown (VOUS), which occur in about 1-2% of prenatal cases, involves parental testing, correlation with fetal , and multidisciplinary counseling to assess potential risks, emphasizing the need for cautious interpretation to avoid unnecessary anxiety. Array CGH's advantages in these contexts include its ability to perform a whole-genome scan without the resolution limitations of banding (typically 5-10 ), enabling the detection of mosaicism levels as low as 10-20% that might be missed by karyotyping. The American College of and Genomics (ACMG) has recommended array CGH as a first-tier test for prenatal diagnosis since its 2010 addendum and 2013 revision, particularly for cases with fetal anomalies, integrating it with (FISH) for targeted confirmation of suspected CNVs when needed. This approach enhances diagnostic accuracy while minimizing the need for multiple invasive procedures.

Limitations and Advances

Technical Limitations

Conventional comparative genomic hybridization (CGH) is constrained by a low of approximately 5-10 , which limits its ability to identify small-scale genomic imbalances. This technique also fails to detect balanced chromosomal rearrangements, such as translocations and inversions, because it solely measures copy number variations through fluorescence ratio differences. Additionally, conventional CGH requires high-quality chromosomes from cultured cells, which can be challenging to obtain and introduces variability dependent on rates. The method has a high threshold for mosaicism detection, typically requiring imbalances present in at least 25% of cells to produce reliable signals. Array-based CGH improves but introduces other technical challenges, including artifacts from probe cross-hybridization, particularly in oligonucleotide arrays where similarity leads to non-specific binding and false positives. of data is complicated in samples with or , as global copy number shifts can distort ratios and obscure regional imbalances. Interpretation of variants of unknown (VOUS) remains difficult, often necessitating additional clinical correlation or parental testing due to the lack of established pathogenicity for many copy number . Furthermore, imbalances in GC-rich regions are prone to bias, as probe hybridization efficiency varies with , leading to uneven signal intensities. Both conventional and array CGH suffer from dye bias, where differential labeling efficiency between fluorophores (e.g., Cy3 and Cy5) skews log2 ratios and requires dye-swap experiments for correction. Sample preparation for conventional CGH is labor-intensive, involving chromosome spreads and lengthy hybridization times. High-density array CGH platforms with over 1 million probes incur significant costs, often exceeding $1,000 per assay due to array fabrication and processing expenses. CGH methods inherently miss point mutations and epigenetic alterations, as they focus exclusively on DNA copy number changes rather than sequence variants or patterns. Prior to the , CGH performed poorly with low-input samples, such as those from single cells, due to insufficient DNA yield and amplification biases that amplified noise over signal. Findings from CGH require validation using orthogonal techniques like (FISH) or quantitative (qPCR) to confirm copy number alterations and rule out artifacts.

Recent Technological Improvements

Since the early 2010s, array-based comparative genomic hybridization (array CGH) has seen significant advancements in probe density, transitioning from bacterial artificial chromosome (BAC) arrays to high-density platforms with over 1 million probes, enabling resolutions as fine as 2-10 kb for detecting copy number variations across the . By 2015-2016, commercial platforms like Agilent's SurePrint offered customizable targeted arrays with up to 1 million probes focused on specific genomic regions of interest, such as disease-associated loci, improving sensitivity for rare or small structural variants while reducing off-target noise. These higher-density designs have facilitated more precise mapping of aberrations in clinical samples, surpassing the limitations of earlier lower-resolution arrays. Integration of array CGH with next-generation sequencing (NGS) technologies has further enhanced its capabilities, particularly through hybrid approaches like optical genome mapping (OGM) introduced in the 2020s, which provides long-range structural information to phase copy number variants and resolve complex rearrangements not easily detected by traditional array CGH alone. OGM, using nanofluidic chips to image high-molecular-weight DNA molecules labeled at specific motifs, achieves genome-wide resolution down to 500 bp for structural variants and complements array CGH by confirming copy-neutral events like inversions and translocations in a single assay. This integration has been particularly valuable in cancer genomics, where it aids in distinguishing tumor-specific alterations from germline variants with higher accuracy. Single-cell adaptations of CGH, known as scCGH, emerged prominently around 2012 with optimized protocols for whole-genome amplification and enzymatic labeling from limited input material, allowing detection of copy number changes as small as 100 kb in individual cells to study tumor heterogeneity. These methods, refined in subsequent years, support low-input applications from fine-needle biopsies or circulating tumor cells, enabling high-resolution profiling without culturing, which is critical for capturing intratumor diversity in heterogeneous samples like solid tumors. By , standardized workflows for scCGH on 60K-1M probe arrays demonstrated reliable aberration calling in single lymphoblasts and fibroblasts, expanding its utility in . Automation and have streamlined array CGH workflows, with automated hybridization stations and autoloading scanners introduced post-2010 reducing processing time from days to hours for high-throughput clinical labs. algorithms, such as the FACETS method developed in 2016, improve aberration calling by integrating allele-specific copy number estimation and clonal heterogeneity modeling, enhancing accuracy in noisy array data from tumor samples. More recent approaches, like those using hidden Markov models or for segmentation, further automate variant prioritization, minimizing false positives in large-scale datasets. Multiplexed techniques in scanning have also accelerated data acquisition, supporting faster turnaround in diagnostic settings. Emerging innovations include hybrid OGM-NGS pipelines for functional validation of CGH-detected variants, with cost reductions in CGH reagents and enabling routine clinical adoption by 2025, where diagnostic yields have increased to over 10-15% in neurodevelopmental disorders without prohibitive expenses. While have been explored for single-cell isolation in related sequencing assays, their direct application to CGH remains investigational for high-throughput aberration screening. Similarly, CRISPR-based editing integrated with CGH-like readouts is under development for validating copy number impacts in model systems, though not yet standardized for clinical use. These advances collectively address prior resolution and throughput gaps, positioning CGH as a cornerstone in modern cytogenomics.

References

  1. [1]
    Comparative genomic hybridization for molecular cytogenetic ...
    Oct 30, 1992 · Comparative genomic hybridization produces a map of DNA sequence copy number as a function of chromosomal location throughout the entire genome.Missing: original paper
  2. [2]
    Comparative Genomic Hybridization for Molecular Cytogenetic ...
    Abstract. Comparative genomic hybridization produces a map of DNA sequence copy number as a function of chromosomal location throughout the entire genome.
  3. [3]
    Array comparative genomic hybridization analysis of products of ...
    Array comparative genomic hybridization (aCGH) does not require live cells; it can be used on DNA from tissue samples. It is a high-resolution genome analysis ...
  4. [4]
    Comparative genomic hybridization: an overview - PMC - NIH
    Comparative genomic hybridization (CGH) is a newly described molecular-cytogenetic assay that globally assays for chromosomal gains and losses in a genomic ...
  5. [5]
    Review Controversies and challenges of array comparative genomic ...
    Comparative genomic hybridization-array analysis enhances the detection of aneuploides and submicroscopic imbalances in spontaneous miscarriages.
  6. [6]
    Comparative genomic hybridisation - PMC - PubMed Central - NIH
    Comparative genomic hybridisation (CGH) is a technique that permits the detection of chromosomal copy number changes without the need for cell culturing.
  7. [7]
    Comparative Genomic Hybridization: DNA labeling ... - NIH
    As a first step, two genomic DNA samples - one test and one reference - are labeled separately using two different fluorescent dyes: generally Cyanine 3 (Cy3; ...
  8. [8]
    arrays pave the way for identification of novel cancer-related genes
    However, CGH to metaphase chromosomes can provide only limited resolution at the 5-10 Mb level. To circumvent this limitation, array-based CGH has been ...
  9. [9]
    High resolution analysis of DNA copy number variation using ...
    Comparative genomic hybridization (CGH) was developed for genome-wide analysis of DNA sequence copy number in a single experiment.
  10. [10]
    Agilent Technologies Introduces First Commercial Mouse Microarray ...
    Aug 18, 2005 · Agilent launched its breakthrough oligo aCGH platform in January 2005. The company recently introduced major platform enhancements that include ...
  11. [11]
  12. [12]
    Development and validation of a CGH microarray for clinical ...
    There were 82 instances of “false-positive” signals defined as a T/R ratio exceeding the threshold of ± 0.2 in a region that was karyotypically normal. This ...
  13. [13]
    Comparative analysis of copy number detection by whole-genome ...
    Microarray-based comparative genomic hybridization (aCGH) is a powerful diagnostic tool for the detection of DNA copy number gains and losses associated ...
  14. [14]
    A stepwise framework for the normalization of array CGH data - PMC
    Both ICC and Correlation coefficient attain their highest values after the three-step normalization method. This applies to both the ratios of non-background ...
  15. [15]
    faster circular binary segmentation algorithm for the analysis of array ...
    We developed the circular binary segmentation (CBS) algorithm to divide the genome into regions of equal copy number.
  16. [16]
    Comparative Genomic Hybridization: microarray design and data ...
    1. Introduction. Comparative Genomic Hybridization (CGH) was developed in the early nineties to screen for chromosomal deletions and duplications along whole ...
  17. [17]
    Single-cell copy number variation detection | Genome Biology
    Aug 19, 2011 · Global loess normalization regresses the log2 ratios between test and reference samples on intensities using all probes [8]. The snapCGH package ...
  18. [18]
    [PDF] Hidden Markov Model Inference Copy Number Change in Array ...
    In our analysis, an array-CGH log2 ratio of +/-0.2 is often used as a threshold to make the call of changed clone, for most platform and samples with a profile ...
  19. [19]
    [PDF] DNAcopy: A Package for Analyzing DNA Copy Data - Bioconductor
    This document presents an overview of the DNAcopy package. This package is for analyzing array DNA copy number data, which is usually (but not always) ...
  20. [20]
    [PDF] BlueFuse Multi v4.5 - Software Guide - Illumina Support
    BlueFuse Multi v4.5 Software Guide. Page 58. Genotype Analysis. Two genotype ... If you select CGH summary only, BlueFuse Multi creates separate reports for each ...
  21. [21]
    Development of a DNA-Labeling System for Array-Based ... - NIH
    The two primary methods employed to label genomic DNA for CGH analysis are the nick translation4 and the random prime methods.
  22. [22]
    None
    ### Protocol for Conventional CGH Sample Preparation and Labeling
  23. [23]
    (PDF) Comparative genomic hybridization analysis - ResearchGate
    Download full-text PDF Read full-text · Download full-text PDF · Read full-text ... (1992) Comparative genomic hybridization for molecular. cytogenetic ...
  24. [24]
    [PDF] Pretreatment of Chromosome Slides for FISH/CGH/SKY
    The time of pepsin treatment and amount of pepsin stock solution to be used is dependent on (a) the amount of cytoplasm surrounding the metaphase spreads ...
  25. [25]
    A comparative genomic hybridization study in a 46,XX male
    The 1:1 probe mixture (500 ng) was hybridized in combination with unlabeled human Cot-1 DNA on normal male metaphase spreads. Slides were analyzed using a ...
  26. [26]
    [PDF] Development and Application of Comparative Genomic ...
    DNA probes and the target DNA were allowed to hybridise at 37°C for 48-72 hours in a metal tray floating in a water bath. 2.6.4 Washing steps. Solutions ...
  27. [27]
    Automated Screening for Genomic Imbalances using Matrix-Based ...
    Jan 1, 2002 · Analysis of chromosomal imbalances by comparative genomic hybridization (CGH) (Du Manoir et al, 1993; Joos et al, 1993; Kallioniemi et al, 1992) ...
  28. [28]
    The Potential Value of Comparative Genomic Hybridization Analysis ...
    ... Zeiss Axioskop 2 epifluorescence microscope using filter-set for ... Our experience with CGH in cytogenetic analysis of malignant effusions is that CGH ...
  29. [29]
    CGH Analysis Shows Genetic Similarities and Differences in ...
    The software program QUIPS (Vysis) was used to calculate average green-to-red ratio profiles for each chromosome. At least six observations per autosome and ...
  30. [30]
    A comparison of different metaphase CGH methods for the detection ...
    Mar 17, 2004 · In the CGH analysis using the Quips software, chromosome regions were considered to be lost if the hybridization green:red ratios were <0.85 or ...
  31. [31]
    Array-based comparative genomic hybridization: clinical contexts for ...
    Unlike BAC arrays, oligonucleotide arrays do not have the same underlying design concept. ... Some oligonucleotide array platforms are based on SNP probes.
  32. [32]
    Oligonucleotide arrays for high-resolution analysis of copy number ...
    Higher-density arrays are now available in both preconfigured and custom formats from Agilent for array CGH applications. A customized array created on the ...
  33. [33]
    Design optimization methods for genomic DNA tiling arrays - PMC
    The methods proposed below are designed to obtain optimal tile paths for repeat-masked genomes, maximizing the coverage of non-repetitive DNA while ...Missing: CGH polymorphisms
  34. [34]
  35. [35]
    [PDF] Cytogenetic Analysis by High-Resolution SNP Array - Illumina
    The Infinium CytoSNP-850K BeadChip Array is a high-resolution, high-performance array that interrogates over 850,000 single nucleotide polymorphisms (SNPs) ...
  36. [36]
    BAC to the future! or oligonucleotides: a perspective for micro array ...
    The array CGH technique (Array Comparative Genome Hybridization) has been developed to detect chromosomal copy number changes on a genome-wide and/or high- ...
  37. [37]
    [PDF] Agilent Oligonucleotide Array-Based CGH for Genomic DNA Analysis
    This solution contains material which causes damage to the following organs: kidneys, liver, cardiovascular system, upper respiratory tract, skin, central ...
  38. [38]
    Human and mouse oligonucleotide-based array CGH - PMC - NIH
    The presented oligo array CGH protocol provides a highly sensitive and reproducible platform both for human and mouse samples applicable to DNA isolated from ...
  39. [39]
    Normalization of array-CGH data: influence of copy number ...
    Oct 22, 2007 · A parallel can be made to the derivative log ratio spread (DLR) value calculated by the Agilent CGH Analytics software. The DLR-value can be ...
  40. [40]
    Optimizing comparative genomic hybridization for analysis of DNA ...
    Comparative genomic hybridization (CGH) is a powerful new method for molecular cytogenetic analysis of cancer. In a single hybridization, CGH provides an ...
  41. [41]
    Identification of gains and losses of DNA sequences in ... - PubMed
    In this study, CGH analysis was applied to the identification of genomic imbalances in 26 bladder cancers in order to gain information on the genetic events ...Missing: 1990s prostate
  42. [42]
    Intratumoral heterogeneity of receptor tyrosine kinases EGFR and ...
    Feb 8, 2012 · We investigated the pattern of heterogeneity of RTK amplification and functional RTK dependence in GBM tumor cell subpopulations.
  43. [43]
    Detection of DNA copy number alterations in cancer by array ...
    In this review, we briefly discuss specific technical issues relevant for array comparative genomic hybridization analysis in cancer tissues.
  44. [44]
  45. [45]
    Array-based technology and recommendations for utilization in ... - NIH
    Array CGH cannot identify balanced chromosomal rearrangements, such as translocations or inversions, or differentiate free trisomies from unbalanced ...
  46. [46]
    Comparative genomic hybridization: Uses and limitations
    Based on its technical limitations, there is little reason to apply CGH to chromosomes of metaphase cells in routine diagnostic settings. However, the new ...
  47. [47]
    The cytogenetic view of standard comparative genomic hybridization ...
    The limits of the resolving power of comparative genomic hybridization (CGH) have been given as 10-20 Mbp if at least 50% of the studied neoplastic cell ...
  48. [48]
    Custom Array Comparative Genomic Hybridization - MDPI
    Mar 10, 2017 · Array-based comparative genomic hybridization (aCGH) has provided a new impulse to cytogenetic diagnostics and has proved to be a valuable tool ...
  49. [49]
    Prenatal diagnosis: the clinical usefulness of array comparative ...
    Array comparative genomic hybridization (aCGH) is a useful technique for the detection of DNA submicroscopic rearrangements, known as copy number variants (CNVs) ...<|control11|><|separator|>
  50. [50]
    A GC-Wave Correction Algorithm that Improves the Analytical ...
    Array-based comparative genome hybridization (aCGH) is a powerful, data-intensive technique used to identify genomic copy number variation throughout the ...Missing: dye | Show results with:dye
  51. [51]
    [PDF] Agilent 2024 Pricing
    SurePrint G3 Unrestricted CGH Array (1x1M). 1 slide. 895.00. $. G4825A. SurePrint G3 Human CNV Array, 2x400K. 1 slide. 1,145.00. $. G4826A. SurePrint G3 CGH ...
  52. [52]
    The use of ultra-dense array CGH analysis for the discovery of micro ...
    Jan 27, 2011 · Moreover, the recent development of the one million (1 M) probe array CGH platform by Agilent offers an ultra-high (2.1 kb) resolution ...
  53. [53]
    Comprehensive performance comparison of high-resolution array ...
    Apr 24, 2017 · We systematically compared the genome-wide CNV detection power of all 17 available array designs from the Affymetrix, Agilent, and Illumina platforms.
  54. [54]
    Optical Genome Mapping: A New Tool for Cytogenomic Analysis - NIH
    Jul 31, 2025 · Background/Objectives: Optical genome mapping (OGM) has recently emerged as a new technology in the clinical cytogenomics laboratories.
  55. [55]
    Optical genome mapping enables constitutional chromosomal ...
    This proof-of-principle study demonstrates the ability of OGM to detect nearly all types of chromosomal aberrations.
  56. [56]
    Optical genome mapping enhances cytogenetic analysis in ...
    May 29, 2025 · Optical genome mapping (OGM) is a next-generation cytogenetic technique that may be beneficial for detecting subtle structural chromosomal alterations.
  57. [57]
    [PDF] Agilent Oligonucleotide Array-Based CGH for Single Cell Analysis
    May 2, 2012 · This chapter describes the steps to hybridize, wash and scan. Agilent CGH microarrays and to extract data using the Agilent. Feature Extraction ...
  58. [58]
    A Robust Method to Analyze Copy Number Alterations of Less than ...
    Jun 25, 2013 · Here, we provide a protocol for array comparative genomic hybridization (aCGH) of single cells. The protocol is based on an established adapter- ...Missing: post- | Show results with:post-
  59. [59]
    Analysis of Copy‐Number Alterations in Single Cells Using ...
    Dec 1, 2014 · In this unit, we describe a workflow that enables array comparative genomic hybridization (aCGH) of single cells. The unit first describes ...
  60. [60]
    Automation in an Array Comparative Genomic Hybridization ...
    We investigate an automated hybridization station to improve microarray hybridization, an autoloading scanner to facilitate the rapid scanning of multiple ...
  61. [61]
    FACETS: allele-specific copy number and clonal heterogeneity ...
    First, it provides a much more comprehensive identification of copy number aberrations including copy-neutral loss-of-heterozygosity (LOH) events not detectable ...
  62. [62]
    Methods for copy number aberration detection from single-cell DNA ...
    Aug 17, 2020 · In this review, we review eight methods that have been developed for detecting CNAs in scDNAseq data, and categorize them according to the steps of a seven- ...
  63. [63]
    The clinical benefit of array-based comparative genomic ...
    Jul 23, 2019 · Our results confirm the benefit of array-CGH in the current clinical genetic diagnostics leading to identification of the genetic cause of ID/DD in affected ...
  64. [64]
    Droplets for Gene Editing Using CRISPR-Cas9 and Clonal Selection ...
    Mar 19, 2024 · This study presents a novel method that seamlessly integrates CRISPR-Cas9 gene editing techniques with single-cell isolation methods in induced pluripotent ...