Fact-checked by Grok 2 weeks ago

Clique problem

In graph theory, the clique problem, often referred to as the maximum clique problem, is the computational task of determining whether an undirected graph contains a clique—a subset of vertices such that every pair of distinct vertices is adjacent via an edge—of size at least a given integer k.^[1] This decision version asks for the existence of a complete subgraph isomorphic to the complete graph K_k, while the optimization variant seeks the largest such clique in the graph.^[1] The problem is defined on finite, simple undirected graphs without self-loops or multiple edges between vertices.^[2] The clique problem holds a central place in computational complexity theory, as its decision version was proven NP-complete by Richard Karp in 1972 through a polynomial-time reduction from the satisfiability problem (SAT).^[1] This places it among the first 21 problems Karp identified as NP-complete, highlighting its intractability for general graphs, where no known polynomial-time algorithm exists unless P = NP.^[1] The optimization version is NP-hard, and exact solutions typically require exponential-time algorithms, such as branch-and-bound methods or dynamic programming approaches that enumerate potential cliques.^[2] Approximation algorithms exist but cannot guarantee solutions within a constant factor better than n^{1-\epsilon} for any \epsilon > 0 in polynomial time, unless P = NP.^[2] Due to its hardness, practical solutions for the clique problem often rely on heuristics, metaheuristics like genetic algorithms or tabu search, and exact solvers tailored to specific graph classes, such as perfect graphs where the clique number equals the chromatic number of the complement graph.^[2] The problem's complement is the independent set problem, and both are duals in many graph optimization contexts.^[2] Variants include the maximum clique enumeration problem, which identifies all maximum-sized cliques, and the clique cover problem, which partitions vertices into the minimum number of cliques.^[3] The clique problem finds diverse applications across fields, modeling scenarios where fully interconnected groups are sought. In bioinformatics, it aids in protein structure prediction by identifying densely connected residue clusters and analyzing metabolic or gene regulatory networks to detect functional modules.^[3] In social network analysis, cliques represent tightly knit communities or fault-tolerant clusters in communication graphs.^[2] Other uses include chemoinformatics for molecular similarity searches, scheduling problems in operations research, and fault diagnosis in electronics, where cliques correspond to minimal sets of interacting components.^[2]

Definitions

Basic concepts

In graph theory, an undirected simple graph G = (V, E) consists of a finite set V of vertices and a set E of edges, where each edge is an unordered pair of distinct vertices from V, with no multiple edges between the same pair of vertices and no loops incident to a single vertex.^[4] A clique in such a graph G is a subset of vertices where every pair of distinct vertices is connected by an edge, equivalently forming a complete subgraph induced by those vertices.^[5] A k-clique refers specifically to a clique containing exactly k vertices.^[6] A clique is maximal if it is not properly contained in any larger clique, meaning no additional vertex from V can be added while preserving the complete subgraph property.^[6] In contrast, a maximum clique is a clique of largest possible size in G, and the clique number \omega(G) denotes this maximum size.^[5] Cliques relate to independent sets through the graph complement: the complement graph \overline{G} has the same vertex set V but edges between pairs of vertices that are non-adjacent in G, and a set of vertices forms a clique in G if and only if it forms an independent set in \overline{G}, where an independent set is a subset of vertices with no edges between them.^[5] The maximum clique problem is formally defined as follows: given an undirected simple graph G = (V, E), find a clique in G of maximum cardinality, or equivalently, compute \omega(G).^[7]

Problem variants

The decision version of the clique problem, known as the k-clique problem, asks whether an undirected graph G = (V, E) contains a clique of size at least k. The input consists of the graph G and a positive integer k, and the output is "yes" if such a clique exists and "no" otherwise.^[1] The optimization version, called the maximum clique problem, seeks to identify the largest clique in the graph G, either by determining its size (the clique number ω(G)) or by finding the actual vertex set forming that clique. A clique is defined as a subset of vertices where every pair is adjacent, and the objective is to maximize the cardinality of this subset.^[8] The search version of the clique problem requires explicitly outputting the vertices of a maximum clique or a k-clique if one exists in the graph G. This formulation extends the decision version by demanding a constructive solution rather than a mere yes/no answer.^[9] A related variant is the clique cover problem, specifically the vertex clique cover, which aims to find the minimum number of cliques whose union covers all vertices of the graph G. Each vertex must belong to at least one clique in the cover, and the goal is to minimize the number of such cliques. The clique cover number equals the chromatic number of the complement graph.^[10] The clique problem is closely related to the independent set problem through the complement graph: a clique in G corresponds exactly to an independent set (a set of vertices with no edges between them) in the complement graph Ḡ, where edges exist between non-adjacent vertices of G. Thus, the maximum clique in G is equivalent to the maximum independent set in Ḡ.^[11] The decision and search versions of the clique problem are equivalent via self-reducibility, meaning the search version can be solved in polynomial time using an oracle for the decision version. To find a k-clique, iteratively query the decision oracle on subgraphs obtained by removing vertices until a minimal set inducing a k-clique is isolated, ensuring the process runs in time polynomial in the input size assuming the oracle is polynomial-time.^[9]

Background

Historical development

The study of cliques in graph theory traces its origins to early 20th-century work on combinatorial structures. In 1930, Frank P. Ramsey published a foundational result in his paper "On a Problem of Formal Logic," demonstrating that sufficiently large structures inevitably contain monochromatic cliques under edge colorings, laying groundwork for understanding guaranteed clique existence in dense graphs.^[12] This theorem, which has a natural graph-theoretic interpretation involving monochromatic cliques in edge-colored complete graphs, highlighted cliques as key objects in extremal combinatorics. Building on these ideas, Pál Turán's 1941 theorem addressed the maximum number of edges in graphs avoiding cliques of a given size, introducing Turán graphs as extremal constructions and initiating systematic analysis of clique-free graphs.^[13] The term 'clique' was introduced to graph theory by R. Duncan Luce and Albert D. Perry in 1949, who used it to describe complete subgraphs in their analysis of social networks.^[14] The mid-20th century saw further advancements in quantifying cliques. In 1965, J.W. Moon and L. Moser established tight bounds on the maximum number of maximal cliques in an n-vertex graph, proving it is at most 3^{n/3}, which provided critical insights into clique enumeration complexity.^[15] The clique problem was formally recognized as a computational challenge in 1972 when Richard M. Karp included it in his seminal list of 21 NP-complete problems, showing its reducibility from other hard problems like 3-SAT and emphasizing its role in complexity theory.^[16] This classification was solidified in 1979 by Michael R. Garey and David S. Johnson's influential textbook "Computers and Intractability," which cataloged the clique problem as NP-complete and discussed its implications for algorithm design. Algorithmic milestones emerged in the 1970s, with Coen Bron and Joep Kerbosch introducing in 1973 a backtracking procedure for enumerating all maximal cliques, which remains a benchmark for exact enumeration due to its efficiency on sparse graphs.^[17] The 1990s marked the rise of parameterized complexity, where Rodney G. Downey and Michael R. Fellows developed a framework analyzing clique problems with respect to parameters like clique size k, classifying k-clique as W^[18]-complete and distinguishing tractable cases. Recent progress includes hybrid branching approaches for maximal clique enumeration, as proposed in 2024, enhancing scalability for large graphs in practical solvers.^[19]

Applications

In social network analysis, cliques represent tightly knit groups or communities where every member is connected to every other, such as friend circles in platforms like Facebook, enabling the detection of cohesive subgroups for influence maximization or recommendation systems.^[20] For instance, enumerating maximal cliques in large-scale social graphs helps identify collaborative teams or event participants by revealing dense substructures that reflect strong interpersonal ties. In bioinformatics, finding cliques in protein-protein interaction networks identifies functional modules or protein complexes, where a clique corresponds to a set of proteins that physically interact with each other, aiding in the prediction of biological pathways and drug targets.^[21] Algorithms that detect dense cliques, often extended to near-cliques via transitive closures, have been applied to sparse interaction data to uncover overlapping complexes in yeast and human networks.^[22] In VLSI design, the maximum clique problem arises in channel routing, where the horizontal net compatibility graph's clique cover determines the minimum number of tracks needed to route nets without overlaps, optimizing chip layout efficiency.^[23] Similarly, in register allocation, the interference graph's maximum clique size provides a lower bound on the minimum registers required, as mutually interfering variables cannot share a register, influencing compiler optimizations for high-performance circuits.^[24] In coding theory, cliques in the Johnson graph correspond to constant-weight binary codes, where the size of the maximum clique yields upper bounds on the maximum number of codewords with fixed weight and minimum distance, directly impacting the design of error-correcting codes for reliable data transmission.^[25] This connection has led to heuristic constructions using clique-finding techniques to generate optimal or near-optimal constant-weight codes for applications like combinatorial designs.^[26] In scheduling, the clique number of a task conflict graph serves as a lower bound on the minimum number of processors required in multiprocessor systems, as it represents the maximum set of mutually incompatible tasks that cannot run concurrently, guiding resource allocation in parallel computing environments.^[27] Recent applications include clique interdiction in network security, where removing a limited number of vertices minimizes the largest remaining clique to disrupt tightly connected threat groups, such as terrorist cells in communication networks, with formulations showing NP-hardness and branch-and-bound solutions.^[28] Additionally, in 2020s data mining, summarization of maximal cliques compresses the output of enumeration algorithms by merging overlapping cliques into representative patterns, facilitating scalable analysis in large graphs for anomaly detection and pattern discovery.^[29]

Algorithms

Exact algorithms for maximum cliques

Exact algorithms for the maximum clique problem aim to compute the largest clique in a graph by exhaustively searching the solution space while employing pruning strategies to eliminate unpromising branches. Branch-and-bound techniques form the cornerstone of these methods, where the search tree is constructed by sequentially adding vertices to a partial clique, ensuring each added vertex is adjacent to all previous ones in the clique. To prune effectively, upper bounds on the possible clique size in subproblems are computed, often using the size of the candidate vertex set or more sophisticated estimates like the minimum number of colors needed in a greedy coloring of the induced subgraph on remaining vertices.^[30] A classic implementation is the MCQ algorithm developed by Carraghan and Pardalos in 1990, which orders vertices in decreasing order of degree and applies a depth-first branch-and-bound search. In MCQ, bit-parallelism is utilized to represent neighborhoods efficiently, allowing rapid intersection operations during clique extension; the algorithm processes vertices from one end of the ordering, updating the maximum clique size whenever a larger one is found. This simple yet effective approach has served as a baseline for subsequent improvements, demonstrating strong performance on sparse graphs. Dynamic programming over subsets provides an alternative exact method suitable for small graphs, adapting principles similar to the Held-Karp algorithm for the traveling salesman problem. In this framework, states represent subsets of vertices, and the recurrence computes the maximum clique size by considering whether a subset induces a clique, with transitions based on adding vertices adjacent to the current subset; verification of clique status requires checking all pairs, leading to a time complexity of O(2^n n^2) in the naive form, though optimizations via adjacency bitsets can reduce practical constants. Inclusion-exclusion principles can further refine this by avoiding redundant subset checks, but the approach remains exponential and is typically limited to graphs with up to around 40 vertices.^[31] Recent advancements include hybrid solvers that integrate metaheuristics like tabu search with exact branching to enhance exploration and pruning. For instance, a 2020 method embeds an exact branch-and-bound solver within a tabu search framework to solve subproblems arising from heuristic moves, improving solution quality on challenging instances through diversified searches followed by provable optimality checks. Improvements in pruning often involve graph reductions, such as vertex elimination schemes that remove vertices with degree less than the current best clique size minus one or dominated vertices whose neighborhoods are subsets of others, thereby shrinking the search space before branching.^[32] Theoretically, the best-known exact algorithms achieve time complexities of O(1.20^n), as in optimized branch-and-bound variants with advanced reduction rules and bounding functions. In practice, these methods excel on DIMACS benchmark instances, where MCQ and its derivatives solve many graphs with hundreds of vertices in seconds, though hard instances like brock400_1 remain challenging, requiring hours or more on modern hardware.^[33] A simple backtracking algorithm for the maximum clique can be outlined as follows, assuming vertices are ordered v_1, v_2, \dots, v_n in decreasing degree:

function MaxClique(current_clique, candidate_set, best_clique):
    if |candidate_set| + |current_clique| <= |best_clique|:
        return  // Prune
    if |current_clique| > |best_clique|:
        best_clique = current_clique
    for each v in candidate_set (in order):
        new_candidates = candidate_set after v ∩ N(v)  // Intersect with neighbors
        MaxClique(current_clique ∪ {v}, new_candidates, best_clique)
function MaxClique(current_clique, candidate_set, best_clique):
    if |candidate_set| + |current_clique| <= |best_clique|:
        return  // Prune
    if |current_clique| > |best_clique|:
        best_clique = current_clique
    for each v in candidate_set (in order):
        new_candidates = candidate_set after v ∩ N(v)  // Intersect with neighbors
        MaxClique(current_clique ∪ {v}, new_candidates, best_clique)

This pseudocode initializes with an empty clique and full vertex set, pruning via a basic upper bound and updating the global best; enhancements add tighter bounds and reductions for efficiency.^[34]

Enumeration of maximal cliques

The enumeration of maximal cliques involves listing all subsets of vertices that induce complete subgraphs and cannot be extended by adding another vertex while preserving completeness. A seminal approach is the Bron–Kerbosch algorithm, introduced in 1973, which employs a recursive backtracking procedure to systematically explore potential cliques without duplicates. The algorithm maintains three sets: R (the current clique), P (vertices adjacent to all in R that could extend it), and X (vertices already excluded). In its basic form without pivoting, it reports R as maximal when both P and X are empty, then recursively branches on vertices in P to build larger candidates, updating the sets by intersections with neighborhoods. To enhance efficiency, the original algorithm incorporates pivoting: at each step, a pivot vertex u is selected from P ∪ X to minimize branching, as only vertices in P excluding the neighborhood N(u) need recursion, since others will be covered in the pivot's branch. This reduces the worst-case time from exponential in n (number of vertices) to near-optimal for sparse graphs. Pseudocode for the pivoting version is as follows:

procedure BronKerbosch(R, P, X)
    if P = ∅ and X = ∅ then
        report R as a maximal clique
    choose pivot u ∈ P ∪ X
    for each v ∈ P \ N(u) do
        BronKerbosch(R ∪ {v}, P ∩ N(v), X ∩ N(v))
        P := P \ {v}
        X := X ∪ {v}
procedure BronKerbosch(R, P, X)
    if P = ∅ and X = ∅ then
        report R as a maximal clique
    choose pivot u ∈ P ∪ X
    for each v ∈ P \ N(u) do
        BronKerbosch(R ∪ {v}, P ∩ N(v), X ∩ N(v))
        P := P \ {v}
        X := X ∪ {v}

A common optimization preprocesses the graph with a degeneracy ordering, where vertices are ordered v_1, ..., v_n such that each v_i has at most d (the degeneracy) neighbors later in the order; the algorithm then processes in reverse order, restricting candidates to subsequent vertices, which bounds branching and improves practical performance on sparse graphs.^[17]^[35] Output-sensitive algorithms measure time relative to the number of maximal cliques output, denoted ω(G). A fundamental bound is that ω(G) ≤ 3^{n/3} for any n-vertex graph G, achieved by the Turán graph T(n,2) (the complete bipartite graph K_{⌊n/3⌋,⌈2n/3⌉}), with matching upper and lower bounds established in 1965; thus, enumeration algorithms aim for O(3^{n/3} poly(n)) total time. Recent improvements in the 2020s, such as hybrid branching methods, achieve better practical delays on large real-world graphs by combining pivoting with graph reductions, though worst-case bounds remain tied to the Moon–Moser limit.^[36] The reverse search technique provides an alternative framework for implicit enumeration, avoiding explicit storage of all cliques and ensuring no duplicates via a parent-child tree structure over the output set. Introduced in 1996 for general combinatorial enumeration, it applies to maximal cliques by defining a local search from each clique to a unique parent (e.g., via removing a vertex with minimal degree), enabling traversal in constant amortized time per output after an initial setup. This method underpins efficient implementations for large outputs, such as in matrix-multiplication-based algorithms achieving O(M(n)) delay per clique, where M(n) is the matrix multiplication time.^[37] Enumeration of maximal cliques overlaps with frequent itemset mining in data mining, where transactions form a bipartite graph and maximal frequent itemsets correspond to maximal bicliques (or cliques in the one-sided projection); algorithms like Bron–Kerbosch variants are adapted for this, reducing the need to store all subsets. For handling large outputs (potentially exponential), streaming techniques process cliques on-the-fly without full storage, using bounded memory and polynomial delay to output them sequentially, as in reverse search or partition-based methods.^[38] In terms of complexity, maximal clique enumeration is worst-case exponential due to the 3^{n/3} output size, but polynomial-delay algorithms exist that spend O(poly(n)) time per clique, such as those using fast matrix multiplication or lexicographic ordering; for example, one variant lists them with O(n^ω) delay, where ω < 2.373 is the matrix exponent. In sparse graphs or bounded degeneracy, delays can be near-linear per output.^[39]

Algorithms for special graph classes

In graphs belonging to certain special classes with structural restrictions, the maximum clique problem can be solved exactly in polynomial time, in contrast to the general case where it is NP-hard. These classes often exhibit properties like perfectness or bounded treewidth that allow exploitation of the graph structure for efficient algorithms. Seminal results leverage optimization techniques or graph decompositions to compute the clique number ω(G) or identify a maximum clique directly. For perfect graphs, where the clique number equals the chromatic number for every induced subgraph, the maximum clique can be computed in polynomial time using the ellipsoid method applied to the separation problem for the stable set polytope. This approach, developed by Grötschel, Lovász, and Schrijver, yields an algorithm running in time polynomial in the input size, specifically O(n^{6.5}) for n-vertex graphs based on the ellipsoid method's complexity at the time. Alternatively, semidefinite programming formulations, building on the same theoretical foundation, provide practical implementations for computing ω(G) in perfect graphs, confirming the equality χ(G) = ω(G) without explicit enumeration. Chordal graphs, a subclass of perfect graphs characterized by the absence of induced cycles of length four or more, admit linear-time algorithms for finding a maximum clique. These rely on a perfect elimination ordering (PEO), which can be obtained via maximum cardinality search in O(n + m) time, where n is the number of vertices and m the number of edges. Gavril's algorithm processes vertices in reverse PEO order, maintaining the neighborhood intersection to identify the largest clique containing each vertex, achieving overall O(n + m) time complexity. Clique trees, junction trees representing the maximal cliques and their intersections, further enable linear-time computation by traversing the tree structure to aggregate clique sizes. Interval graphs, which represent intersections of intervals on a line and are a subclass of chordal graphs, support a specialized O(n log n) greedy algorithm for the maximum clique. Given interval representations, sort the 2n endpoints and perform a sweep-line traversal: increment a counter for left endpoints and decrement for right endpoints, tracking the maximum overlap, which equals ω(G). This directly identifies the size and a point of maximum overlap to extract the clique, with the sorting step dominating the time. As interval graphs are chordal, the general linear-time PEO method also applies if no representation is provided, but the endpoint-based approach is more direct. In bipartite graphs, the maximum clique size is at most 2, as no odd cycles exist, making the problem trivial: ω(G) = 2 if the graph has at least one edge, and 1 otherwise, detectable in O(1) time after checking for edges or in O(m) for confirmation. No dedicated algorithm is needed beyond basic connectivity checks. For planar graphs, no polynomial-time algorithm is known for computing the maximum clique, as the problem remains NP-hard even in this restricted class. However, for fixed k, detecting or finding a k-clique can be done in linear time O(n) using color-coding techniques adapted to planar structure. Alon, Yuster, and Zwick's method colors vertices randomly with k colors and searches for a colorful K_k subgraph, derandomizable via dynamic programming on the color classes; in planar graphs, the bounded degree and separator properties yield O(n) expected time for fixed k. Recent characterizations of clique-sparse graph classes, where the number of maximal cliques is O(n^c) for constant c, enable faster enumeration of all maximal cliques in subexponential time, improving over general exponential algorithms. Defined via forbidden induced subgraphs, these classes include graphs with bounded clique-tree width or sparsity in clique overlaps, allowing output-sensitive algorithms that run in time polynomial in n and the number of cliques output.

Approximation and heuristics

The maximum clique problem lacks polynomial-time approximation schemes, motivating the development of heuristics and approximation algorithms that deliver high-quality solutions efficiently on general graphs. These methods are particularly valuable for large-scale instances where exact algorithms falter due to exponential time complexity. Greedy heuristics provide simple, fast baselines, while metaheuristics explore broader solution spaces; approximation algorithms offer theoretical guarantees, albeit weak ones given the problem's inapproximability. Practical implementations often integrate these with modern techniques like machine learning for enhanced performance on benchmarks such as DIMACS graphs. Greedy heuristics construct cliques incrementally by prioritizing vertices likely to extend the current set. A standard sequential greedy approach begins with a single vertex and repeatedly adds the neighbor with the highest degree within the induced subgraph of remaining candidates, terminating when no additions are possible; this often yields a large clique quickly but may trap in local optima. ^[40] Local search variants improve upon this by permitting swaps or deletions of vertices in the current clique to explore nearby solutions, with randomized restarts to diversify initializations and boost solution quality. ^[41] For instance, adaptive randomized greedy methods adjust selection probabilities based on prior iterations, achieving empirical ratios near optimal on sparse graphs. ^[42] Metaheuristics extend greedy and local search by incorporating global optimization strategies to escape local maxima. Genetic algorithms represent cliques as bitstrings or vertex sets, evolving populations through selection, crossover (e.g., combining subsets of parent cliques), and mutation (random vertex flips), with fitness measured by clique size; seminal implementations demonstrate robustness on random and structured graphs. ^[43] Tabu search builds on local search by maintaining a tabu list of prohibited moves to prevent cycling, iteratively perturbing the current clique while respecting short-term memory; a hybrid tabu search incorporating exact solvers for subproblems has shown superior runtime on DIMACS instances compared to pure heuristics. ^[44] Recent 2024 hybrids, such as elite genetic algorithms fused with tabu search, outperform standalone metaheuristics on benchmark suites by balancing exploration and intensification. ^[45] Approximation algorithms provide worst-case guarantees, though limited by hardness results. A basic greedy coloring-based method approximates the maximum clique within a factor of n / \omega(G), where n is the graph order and \omega(G) the optimum, by selecting vertices from color classes in a heuristic coloring; this ratio, while polynomial, degrades on dense graphs. ^[46] Semidefinite programming (SDP) relaxations offer improved empirical approximations by formulating the clique as a quadratic program over the unit sphere, relaxing integrality to a convex semidefinite cone solvable via interior-point methods; rounding the SDP solution yields lower bounds competitive with greedy on moderate instances, though scalability limits their use. ^[47] Practical tools leverage branch-and-bound (BnB) frameworks with heuristic pruning for near-exact results in feasible time. The BBMC algorithm uses bitstring representations for efficient neighborhood checks and coloring bounds to prune search trees, excelling on sparse DIMACS graphs; recent variants integrate machine learning, such as MCQD-ML, which applies dynamic neural networks to predict and prune unpromising branches based on subgraph features. ^[48] ^[31] Unsupervised ML for vertex ordering further accelerates BnB by prioritizing high-potential nodes, reducing explored nodes by up to 50% on hard benchmarks. ^[49] Empirical evaluations on DIMACS benchmarks highlight the efficacy of these approaches, with metaheuristics like dynamic local search achieving approximation ratios of 0.95–0.99 (i.e., within 1–5% of optimal) on challenging instances such as c-fat200-5 and brock400_2, often in seconds versus hours for exact methods. ^[41] ^[50] No constant-factor approximation algorithm exists unless P=NP, underscoring the heuristic nature of these solutions. ^[51]

Complexity

NP-completeness

The decision version of the clique problem—determining whether an undirected graph G = (V, E) contains a clique of size at least k—is in NP. A certificate consists of a subset S \subseteq V with |S| = k, which can be verified in polynomial time by checking that every pair of vertices in S is adjacent in G, requiring O(k^2) edge queries.^[1] The problem is NP-hard, as established by a polynomial-time many-one reduction from the vertex cover problem, which is known to be NP-complete. In Richard Karp's seminal 1972 paper, the reduction constructs the complement graph G_C = (V, E_C), where E_C = \{(u, v) \mid u, v \in V, (u, v) \notin E\}, and sets the parameter l = |V| - k = n - k. This construction can be computed in O(n^2) time by enumerating all possible pairs and including those absent from E. The reduction preserves yes-instances: G has a clique of size k if and only if G_C has a vertex cover of size l. To see this, suppose G has a clique K with |K| = k; then no edges exist between vertices in K in G_C, so every edge in E_C must incident to at least one vertex in V \setminus K, making V \setminus K a vertex cover of size n - k in G_C. Conversely, if G_C has a vertex cover U of size n - k, then V \setminus U induces no edges in G_C (hence a clique in G) of size k. Since vertex cover reduces to clique in polynomial time and clique is in NP, clique is NP-complete.^[1] An equivalent NP-hardness proof follows from reducing the independent set problem (also NP-complete) to clique via the graph complement, as a clique in G corresponds to an independent set in G_C. Vertex cover itself reduces from 3-SAT, but the direct link via complement establishes the hardness chain.^[1] The NP-completeness of clique implies significant theoretical and practical consequences. If P = NP, then clique (and all NP problems) would be solvable in polynomial time on a deterministic Turing machine, collapsing the complexity classes. In practice, the intractability for large n (e.g., graphs with thousands of vertices) necessitates exponential-time algorithms or approximations, as no subexponential deterministic algorithm is known unless the exponential time hypothesis fails.^[1] The complementary decision problem—whether G has no clique of size at least k—is co-NP-complete. A certificate for this would be a proof that the maximum clique size is less than k, verifiable in polynomial time if provided (e.g., via a maximum clique enumeration), but its hardness follows from the NP-completeness of clique, as the languages are complements.^[1]

Parameterized and fixed-parameter complexity

In parameterized complexity theory, a parameterized problem is fixed-parameter tractable (FPT) if there exists an algorithm solving it in time f(k) \cdot n^c, where n is the input size, k is the parameter, f is a computable function depending only on k, and c is a constant independent of k.^[52] The k-Clique problem, which asks whether a graph contains a clique of size k parameterized by k, is a canonical example in this framework. Unlike its classical NP-hardness, parameterization by solution size k allows for efficient algorithms when k is small. The k-Clique problem is FPT, with the seminal color-coding technique providing a randomized algorithm running in $2^{O(k)} n^{O(1)} time.^[53] This method involves randomly coloring the vertices with k colors and searching for a colorful clique (one vertex per color) using dynamic programming over subsets of colors, with derandomization possible via universal hashing at a minor efficiency cost. Subsequent deterministic improvements maintain the $2^{O(k)} n^{O(1)} bound while enhancing practicality. While these FPT algorithms are theoretically efficient for fixed k, practical implementations often favor faster but parameter-dependent approaches for moderate k. Combinatorial algorithms avoiding fast matrix multiplication achieve times like n^k / \log^{k-1} n, improving on brute-force n^k.^[54] Algorithms leveraging fast matrix multiplication yield even better bounds, such as O(n^{\omega k / 3}) where \omega \approx 2.37155 is the matrix multiplication exponent, or refined variants approaching n^{O(k / \log k)} through rectangular multiplication techniques.^[54] These remain in XP (solvable in n^{f(k)} time) rather than FPT, as the exponent grows with k, but they outperform FPT methods empirically for small k. Although k-Clique is FPT, certain generalizations exhibit fixed-parameter intractability. For instance, the Multicolored k-Clique problem, where vertices are pre-colored with k colors and a rainbow clique (one vertex per color) is sought, is W^[18]-complete, implying it is unlikely to admit an FPT algorithm assuming FPT \neq W^[18].^[55] The standard k-Clique contrasts this by residing firmly in FPT. Broader classes like XP encompass problems solvable in n^{f(k)} time, including k-Clique via enumeration, but under standard complexity assumptions (e.g., FPT \neq W^[18]), many related problems lack FPT algorithms despite XP membership. Recent advancements extend parameterized tractability to restricted graph classes. For H-minor-free graphs, where H is a fixed minor, structural decomposition theorems enable subexponential FPT algorithms for k-Clique, running in $2^{O(\sqrt{k} \log k)} n^{O(1)} time when k is small relative to the exclusion bound.^[56] In 2024, robust contraction decompositions for H-minor-free graphs further refined these, yielding improved parameters for clique detection in sparse or bounded-expansion settings.^[57]

Approximation hardness

The approximation hardness of the maximum clique problem refers to the theoretical barriers preventing efficient algorithms from achieving good approximation ratios for the size of the largest clique in a graph. Early results leveraging probabilistically checkable proofs (PCPs) established strong inapproximability. Specifically, Arora and Safra showed that, unless NP ⊆ ZPP, there is no polynomial-time algorithm approximating the maximum clique within a factor of n^{1-\epsilon} for some constant \epsilon > 0.^[58] This result stems from a reduction using interactive proofs and the PCP framework, highlighting the inherent difficulty even for slightly sublinear factors. Subsequent work strengthened these bounds. Håstad proved that for every \epsilon > 0, approximating the maximum clique within n^{1-\epsilon} is NP-hard, assuming P ≠ NP.^[59] This optimal inapproximability (up to the trivial n-factor) arises from tailored reductions from 3-SAT instances, incorporating long-code tests to amplify gaps in satisfiability to near-linear gaps in clique size. Further advancements using label-cover problems, as in Feige's analysis, demonstrated that no polynomial-time approximation better than n^{1 - o(1)} exists unless NP ⊆ DTIME(n^{O(\log \log n)}).^[60] Zuckerman extended these insights by derandomizing prior constructions with linear-degree extractors, yielding that for every \epsilon > 0, there is no n / (\log n)^{\epsilon}-approximation unless NP ⊆ DTIME(n (\log \log n)^{O(1)} / \log n).^[61] These results have profound implications for heuristic and approximation algorithm design. They establish that any polynomial-time method cannot guarantee better than nearly linear ratios, setting realistic expectations for practical solvers, which often target O(n / \polylog n) performance on sparse or structured graphs. In the 2020s, ongoing research links these hardness bounds to the Unique Games Conjecture (UGC), suggesting that assuming UGC could yield even tighter inapproximability thresholds, potentially closing gaps between known algorithms and theoretical limits for clique-related problems.

Other lower bounds

In circuit complexity, the clique problem exhibits significant hardness for monotone computations. Razborov demonstrated that any monotone circuit computing the clique function requires superpolynomial size, specifically \exp(\Omega(n^{1/3})) for detecting cliques of size \Theta(n^{2/3}) in n-vertex graphs.^[62] This bound was strengthened by Alon and Boppana to \exp(\Omega(n^{1/3} \log^2 n / \log \log n)). These results imply that the problem is not solvable by polynomial-size monotone circuits, with implications for parallel computation models; for instance, the monotone version separates monotone NC from P, as monotone NC circuits correspond to bounded-depth monotone circuits of polynomial size.^[63] In decision tree models, lower bounds on query complexity highlight the challenges of accessing graph edges to detect cliques. For k-clique detection, particularly in the context of average-case analysis or bounded-depth circuits approximating decision trees, Rossman established an \Omega(n^{k/4}) lower bound on the size for constant-depth circuits solving k-clique on random graphs, which extends to query-like models where edge probes simulate tree branches. In the sparse graph model, testing for triangles requires \Omega(\sqrt{m}) edge queries, where m is the number of edges, establishing a tight bound in property testing frameworks for graphs far from triangle-free.^[64]^[65] Time-space tradeoffs for clique detection leverage communication complexity to derive lower bounds on resource usage. For the CLIQUE-GAP problem—distinguishing graphs with a large clique from those without—new bounds using graph norms show that any deterministic streaming algorithm requires space \Omega(n^{1/2}) in the worst case, derived from \Omega(\sqrt{n}) communication lower bounds in multiparty protocols.^[66] These tradeoffs imply that algorithms balancing time and space cannot achieve sublinear space without exponential time increases, as communication barriers prevent efficient coordination in distributed or memory-constrained settings.^[67] In quantum complexity, Grover's algorithm provides a quadratic speedup for unstructured search problems, including clique detection. By formulating maximum clique as a search over vertex subsets, a quantum implementation achieves O(2^{n/2}) time complexity compared to the classical O(2^n), though exact solutions remain exponential.^[68] Recent efforts in the 2020s explore adiabatic quantum approaches; for instance, a 2024 parameter-independent algorithm using Seidel continuous-time quantum walks as a Hamiltonian solves maximum clique on random graphs of order up to 100 with accuracy converging to theoretical maxima and time complexity under O(n^2), outperforming classical simulated adiabatic methods by identifying larger cliques (e.g., size 14.5 vs. 13.75 on 100-vertex dense graphs).^[69] Geometric lower bounds address inapproximability in Euclidean spaces, where cliques correspond to sets of points within fixed diameter. In dimension d = \Theta(\log n), approximating the largest such set (equivalent to maximum clique in unit disk graphs) within factors (95/94 - \epsilon, \sqrt{4/3} - \epsilon) is NP-hard for any \epsilon > 0, unless P = NP.^[70] This hardness persists in high dimensions, contrasting with polynomial-time solvability in fixed low dimensions like 2D, and underscores the computational barriers for geometric graph classes.^[70]

References

[1]
[PDF] Reducibility among Combinatorial Problems
Karp's paper showed that computational intractability is the rule rather than the exception. • Together Cook & Karp, and independently Levin laid the.
[2]
(PDF) The Maximum Clique Problem - ResearchGate
Aug 7, 2025 · In this paper we present a survey of results concerning algorithms, complexity, and applications of the maximum clique problem.
[3]
The maximum clique enumeration problem: algorithms, applications ...
Jun 25, 2012 · The maximum clique enumeration (MCE) problem asks that we identify all maximum cliques in a finite, simple graph.
[4]
[PDF] Graph Search
Definition. An undirected (simple) graph. G = (V, E) is a 2-tuple: 1. V is a set of vertices (also referred to as nodes/points). 2. E is a set of edges where ...
[5]
[PDF] Chapter 12. Stable Sets and Cliques
Apr 27, 2022 · Definition. A clique of a graph G is a set of mutually adjacent vertices (i.e., the vertices of a complete subgraph of G). The maximum size of ...
[6]
[PDF] 1 Cliques and Independent Sets Definition 1 A set of vertices is ...
A set of vertices is called a clique if every two vertices in the set are adjacent. An independent set (resp. a clique) is called maximal, if no other ...
[7]
[PDF] Fast Algorithms for the Maximum Clique Problem on Massive ... - cucis
Dec 1, 2014 · A clique in an undirected graph is a subset of vertices in which every two vertices are adjacent to each other. The maximum clique problem seeks ...
[8]
[PDF] A review on algorithms for maximum clique problems
Jun 1, 2020 · Abstract. The maximum clique problem (MCP) is to determine in a graph a clique (i.e., a complete subgraph) of maximum cardinality.
[9]
[PDF] CSC 373 Lecture # 11 Instructor: Milad Eftekhar Self-reducibility
Optimization problems can also be polytime self-reducible by using decision problem algorithm to find optimal value of relevant parameter(s). Example: MAX- ...
[10]
Effective Data Reduction for the Vertex Clique Cover Problem
Jan 5, 2022 · The vertex clique cover (VCC) problem—the problem of computing a minimum cardinality set of cliques covering all vertices of a graph—is a ...
[11]
14. Some Graph Theory - MIT Mathematics
A complete graph is often called a clique. The size of the largest clique that can be made up of edges and vertices of G is called the clique number of G. The ...
[12]
[PDF] ON A PBOBLEM OF FOKMAL LOGIC This paper is primarily ...
ON A PROBLEM OF FORMAL LOGIC. 265. It -may happen that F contains a member Xi ... F. P. RAMSEY. [Dec. 13,. These formulae can be easily seen to define f(r ...
[13]
P. Turán, “On an Extremal Problem in Graph Theory,” Matematikai ...
P. Turán, “On an Extremal Problem in Graph Theory,” Matematikai és Fisikai Lapok, Vol. 48, 1941, pp. 436-452.Missing: Turán's theorem original
[14]
[PDF] On cliques in graphs - User Web Pages
ABSTRACT. A clique is a maximal complete subgraph of a graph. The maximum number of cliques possible in a graph with n nodes is determined. Also, bounds are.
[15]
[PDF] REDUCIBILITY AMONG COMBINATORIAL PROBLEMS
REDUCIBILITY AMONG COMBINATORIAL PROBLEMS. 87 elements of other countable domains. It is a reasonable working hypothesis, championed originally by Jack ...
[16]
Algorithm 457: finding all cliques of an undirected graph
Bron, C., Kerbosch, JAGM, and Schell, HJ Finding cliques in an undirected graph. Tech. Rep. Technological U. of Eindhoven, The Netherlands.
[17]
[PDF] Maximal Clique Enumeration with Hybrid Branching and ... - arXiv
Dec 11, 2024 · Abstract—Maximal clique enumeration (MCE) is crucial for tasks like community detection and biological network analysis. Existing algorithms ...
[18]
Parallel Maximum Clique Algorithms with Applications to Network ...
Feb 25, 2013 · We propose a fast, parallel maximum clique algorithm for large sparse graphs that is designed to exploit characteristics of social and information networks.
[19]
Predicting interactions in protein networks by completing defective ...
Step 1: Find all cliques in the network. Step 2: Find pairs of cliques overlapping on all but one node each. In each of these pairs predict the edges ...
[20]
Finding cliques in protein interaction networks via transitive closure ...
We propose a method to identify cliques on weighted graphs. To overcome the sparsity problem, we introduce the concept of transitive closure on weighted graphs ...
[21]
[PDF] A Review on Channel Routing On VLSI Physical Design
So we can see that a clique in HNCG says that nets can be placed into one track. So, here we calculate the minimum click cover of HNCG of the channel. The ...Missing: register | Show results with:register
[22]
[PDF] Register Allocation Via Clique Separators
An overall algorithm for global register allocation using clique separators and based upon priorities, is summarized in Fig. 8. In this algorithm, the program ...Missing: VLSI design channel routing
[23]
[PDF] Maximal Cliques in Graphs and Some New Upper Bounds for ...
An (n, d, w) constant-weight binary code is a set of binary vectors of length n, such that each vector contains w ones and n − w zeros, and any two.<|separator|>
[24]
[PDF] Heuristic Construction of Constant Weight Binary Codes - IDSIA
In small cases clique search can also be used to find good constant weight codes ... Lexicographic codes: error-correcting codes from game theory. IEEE ...
[25]
Finding a Maximum Clique in an Arbitrary Graph - SIAM.org
Abstract. We describe a new type of branch and bound procedure for finding a maximum clique in an arbitrary graph G = ( V , E ) .
[26]
The maximum clique interdiction problem - ScienceDirect.com
Aug 16, 2019 · The Maximum Clique Interdiction Problem asks to find a subset of at most k vertices to remove from G so that the size of the maximum clique in the remaining ...
[27]
(PDF) Mining Maximal Clique Summary with Effective Sampling
Feb 4, 2020 · PDF | On Nov 1, 2019, Xiaofan Li and others published Mining Maximal Clique Summary with Effective Sampling | Find, read and cite all the ...Missing: 2020s | Show results with:2020s<|control11|><|separator|>
[28]
[PDF] Exact Algorithms for Maximum Clique A Computational Study TR ...
The algorithms are presented in chronological order. 1990: In 1990 [6] Carraghan and Pardalos present a branch and bound algorithm. Vertices are ordered in non- ...
[29]
Exact Maximum Clique Algorithm for Different Graph Types ... - MDPI
In this paper, improvements based on machine learning (ML) are added to a dynamic algorithm for finding the maximum clique in a protein graph.
[30]
The Use of an Exact Algorithm within a Tabu Search Maximum ...
Tabu search is an effective metaheuristic for the maximum clique problem. This paper evaluates the option of including an exact algorithm within the tabu search ...Missing: 2024 | Show results with:2024
[31]
[PDF] A Faster Branching Algorithm for the Maximum k-Defective Clique ...
Jul 24, 2024 · known time complexity for the exact computation of the maximum clique is O∗(1.20n) [28]; Practical algorithms such as MC-BRB [7] and MoMC ...
[32]
[PDF] fast algorithms for the maximum clique problem on massive graphs ...
The algorithm of Carraghan and. Pardalos [8] is an early example of a simple and effective branch-and-bound algorithm for the maximum clique problem. More ...
[33]
A note on the problem of reporting maximal cliques - ScienceDirect
Nov 6, 2008 · Bron, J. Kerbosch, Algorithm 457: Finding all cliques of an undirected graph, Communication of ACM 16 (9) (1973) 575–577], and two ...
[34]
On cliques in graphs | Israel Journal of Mathematics
Moon, J.W., Moser, L. On cliques in graphs. Israel J. Math. 3, 23–28 (1965). https://doi.org/10.1007/BF02760024. Download citation. Received: 07 April 1965.
[35]
[PDF] New Algorithms for Enumerating All Maximal Cliques
We view their algorithms as the enumeration algorithms based on reverse search, where reverse search was introduced by Avis and Fukuda [4] to solve enumeration.
[36]
Fast algorithms for maximal clique enumeration with limited memory
We first propose an efficient partition-based algorithm for MCE that addresses the problem of processing large graphs with limited memory.
[37]
On the overall and delay complexity of the CLIQUES and Bron ...
Jan 6, 2022 · In this paper, we extend the time-complexity analysis with respect to the maximum size and the number of maximal cliques, and to its delay.<|control11|><|separator|>
[38]
[PDF] HEURISTICS FOR MAXIMUM CLIQUE AND INDEPENDENT SET ...
Heuristics for maximum clique include sequential greedy, which adds vertices, and randomized heuristics, which use random factors.
[39]
[PDF] Dynamic Local Search for the Maximum Clique Problem - arXiv
DLS-MC is a stochastic local search algorithm for the maximum clique problem, alternating between iterative improvement and plateau search, using dynamically ...
[40]
Adaptive, Restart, Randomized Greedy Heuristics for Maximum Clique
Aug 10, 2025 · This paper presents some adaptive restart randomized greedy heuristics for MAXIMUM CLIQUE. The algorithms are based on improvements and ...Missing: seminal | Show results with:seminal
[41]
[PDF] Finding Maximum Clique with a Genetic Algorithm
The maximum clique problem (MAXCLIQUE) is to find the largest clique in a given graph G. The MAXCLIQUE problem is an NP-hard problem [24], which means that ...
[42]
The Use of an Exact Algorithm within a Tabu Search Maximum ...
Oct 15, 2025 · A tabu search algorithm for the maximum clique problem that uses an exact algorithm on subproblems is presented. The exact algorithm uses a ...
[43]
A Comparative Study on the Maximum Clique Problem
Sep 26, 2024 · Various meta-heuristics have been used to approximate the MCP. These include genetic and memetic algorithms, ant colony optimization, greedy ...
[44]
A Short Review on Novel Approaches for Maximum Clique Problem
Mar 13, 2024 · This manuscript provides a comprehensive review of the Maximum Clique Problem, a computational problem that involves finding subsets of vertices in a graph.2 Maximum Clique Problem · 6 Quantum Algorithms For Mcp · 7 Benchmarks And Testing
[45]
[PDF] Lecture 10: Semidefinite Programming for Planted Clique
Mar 1, 2022 · To formulate a Semidefinite Programming relaxation of a combinatorial problem, it is helpful to first formulate the combinatorial problem as a ...
[46]
A new exact maximum clique algorithm for large and massive ...
This paper describes a new very efficient branch-and-bound exact maximum clique algorithm BBMCSP, designed for large and massive sparse graphs which appear ...Missing: machine | Show results with:machine
[47]
[2503.21814] Unsupervised Ordering for Maximum Clique - arXiv
Mar 25, 2025 · By integrating this clique-oriented ordering into branch-and-bound search, we improve search efficiency and reduce the number of computational ...
[48]
SCCWalk: An efficient local search algorithm and its improvements ...
We developed two local search algorithms for the maximum weight clique problem (MWCP). We first proposed a variant of the CC strategy, called SCC. Second ...
[49]
[PDF] Lecture 13 (Notes) 1 Introduction 2 Hardness of Approximation
Nov 22, 2018 · ... Clique has no constant-factor approximation algorithm. We then use “expanders” to get polynomial hardness. 3.1 No constant approximation ...
[50]
Fixed-Parameter Tractability and Completeness I: Basic Results
We establish the main results of a completeness program which addresses the apparent fixed-parameter intractability of many parameterized problems.
[51]
Color-coding | Journal of the ACM
Published: 01 July 1995 Publication History. 792citation6,671Downloads ... ~ALON, N., YUSTER, R., AND ZWICK, U. Finding and counting given length ...
[52]
Efficient algorithms for clique problems - ScienceDirect.com
We give an algorithm for k-clique that runs in O ( n k / ( ε log n ) k − 1 ) time and O ( n ε ) space, for all ε > 0 , on graphs with n nodes. This is the first ...
[53]
The multicolored graph realization problem - ScienceDirect.com
Sep 15, 2024 · The MGR is a generalization of the multicolored clique problem, which is known to be W [ 1 ] -hard when parameterized by the number of colors.
[54]
[PDF] Subexponential Parameterized Algorithms on Bounded-Genus ...
H-minor-free graphs are very general. The deep Graph-. Minor Theorem of Robertson and Seymour shows that any graph class that is closed under minors is ...
[55]
Robust Contraction Decomposition for Minor-Free Graphs and its ...
Dec 5, 2024 · We prove a robust contraction decomposition theorem for H-minor-free graphs, which states that given an H-minor-free graph G and an integer p, one can ...
[56]
Probabilistic checking of proofs: a new characterization of NP
We discuss implications of this characterization; specifically, we show that approximating Clique ... ARORA, S., AND SAFRA, S. 1992. Probabilistic checking of ...
[57]
Clique is hard to approximate withinn 1−ε | Acta Mathematica
Interactive proofs and the hardness of approximating cliques.J. ACM, 43 (1996), 268–292. Article MathSciNet MATH Google Scholar
[58]
Clique Is Hard to Approximate within n 1-o(1) - SpringerLink
It was previously known that Max Clique cannot be approximated in polynomial time within n1-ε, for any constant ε > 0, unless NP = ZPP.
[59]
[PDF] Linear Degree Extractors and the Inapproximability of Max Clique ...
Aug 6, 2007 · In this section, we show how our dispersers yield inapproximability results for MAX CLIQUE ... clique number at most 2εR in graphs on 2(1+ε)R.
[60]
[PDF] lower bounds for тне monotone complexity of some boolean functions
We now turn to the construction of con8tructive 8equences consi8ting of monotone. Boolean function8 of sufficiently great monotone complexity. The fir8t example ...
[61]
[PDF] 1 The Razborov monotone circuit lower bound - UCSD Math
A positive test graph is a graph that is a k-clique, and no other edges are present. A negative test graph is a graph formed by a (k−1)- coloring adding edges ...
[62]
[PDF] Average-Case Complexity of Detecting Cliques Benjamin Rossman
Sep 3, 2010 · The main results of this thesis are lower bounds of Ω(nk/4) for the average-case complex- ity of k-Clique on both bounded-depth Boolean circuits ...
[63]
Detecting cliques in CONGEST networks | Distributed Computing
Dec 21, 2019 · 2 Lower bound (clique detection needs \widetilde{{\Omega }}(\sqrt{n}) rounds). In this section we prove our hardness results showing that any ...Missing: decision | Show results with:decision
[64]
[PDF] New Bounds for the CLIQUE-GAP Problem Using Graph ...
In this section we present our lower bounds on the space complexity of the CLIQUE-. GAP problem. ... We use the following optimal bound on the communication ...
[65]
Streaming and Communication Complexity of Clique Approximation
Aug 7, 2025 · The settings are related in that the communication complexity gives a lower bound on the space complexity of streaming algorithms. We give ...
[66]
Quantum speedup in solving the maximal-clique problem
Mar 29, 2018 · Here we develop a quantum algorithm to solve the maximal-clique problem for any graph G with n vertices with quadratic speedup over its ...
[67]
A parameter-independent algorithm of finding maximum clique with ...
Jan 18, 2024 · Exact algorithms for this problem have a time complexity of O ( 2 0.249 N ), making them impractical for graphs with large cardinality.
[68]
[PDF] Approximation and Inapproximability Results for Maximum Clique of ...
Mar 14, 2009 · Abstract. We prove algorithmic and hardness results for the problem of finding the largest set of a fixed diameter in the Euclidean space.