Fact-checked by Grok 2 weeks ago

Subgraph

In , a subgraph is a whose set and edge set are subsets of the set and edge set, respectively, of a given . If one H is a subgraph of another G, then G is called a supergraph of H. This concept allows for the analysis of smaller portions of larger networks while preserving the structural relationships defined by the original 's connections. Subgraphs are classified into several types based on how vertices and edges are selected. An is formed by taking a of vertices from the original and including all edges that connect those vertices in the original. In contrast, a spanning subgraph includes all vertices of the original but may omit some edges, such as in the case of a . Other variants include edge-induced subgraphs, which are formed by selecting a of edges and including all vertices incident to them. These distinctions are crucial for studying properties like , cycles, and within specific components. The study of subgraphs plays a foundational role in and its applications, enabling algorithms for , network optimization, and complexity analysis. For instance, subgraph isomorphism problems determine whether one graph contains a copy of another as a subgraph, which is central to computational challenges like in and . Subgraphs also facilitate the decomposition of complex graphs into simpler structures, aiding in proofs of theorems such as Kuratowski's theorem on planarity.

Definition and Basic Concepts

Formal Definition

In graph theory, a graph is formally defined as an ordered pair G = (V, E), where V is a finite set of vertices and E is a set of edges, each edge being a pair of vertices. A subgraph H = (V', E') of G is a graph such that V' \subseteq V and E' \subseteq E, with the incidence function of H being the restriction of that of G to E', ensuring that every edge in E' connects vertices in V'. This definition presupposes familiarity with the basic structure of graphs, where edges in undirected graphs are unordered pairs \{u, v\} with u, v \in V, while in directed graphs (digraphs), edges are ordered pairs (u, v) and subgraphs must preserve this directionality by including only arcs whose tails and heads lie in V'. Trivial subgraphs include the empty graph, which has no vertices and no edges (V' = \emptyset, E' = \emptyset), and single-vertex subgraphs, where V' = \{v\} for some v \in V and E' = \emptyset. These cases satisfy the subgraph criteria and are valid instances, though the empty subgraph is sometimes disregarded in analyses unless explicitly considered.

Notation and Terminology

In , the relation where H is a of G is denoted H \subseteq G, signifying that the vertex set of H, denoted V(H), is a of the vertex set of G, or V(H) \subseteq V(G), and the edge set of H, denoted E(H), is a of the edge set of G, or E(H) \subseteq E(G). A proper subgraph of G is defined as a subgraph H where either V(H) is a proper of V(G) or E(H) is a proper of E(G), ensuring H \neq G. A spanning subgraph maintains the full vertex set of the original , so V(H) = V(G), while including only a subset of the edges. A minor of G extends this by allowing edge contractions alongside vertex and edge deletions to form H, representing a more contracted substructure. Standard conventions dictate that adjacency in a subgraph H directly inherits from G: an edge between two vertices exists in H if and only if it exists in G between the same pair. In the case of multigraphs, multiple edges between vertices in H and loops at vertices are included only if they appear correspondingly in G.

Types of Subgraphs

Induced Subgraph

An induced subgraph of a graph G = (V, E) is formed by selecting a subset S \subseteq V of vertices and including all edges from E that connect pairs of vertices in S. This construction ensures that the subgraph captures the complete neighborhood relationships among the chosen vertices without any edge omissions. The standard notation for an induced subgraph is G[S], denoting the induced by the set S, or alternatively, the subgraph of G induced by S. A key property of induced subgraphs is their maximal edge inclusion: for any two vertices in S, if an edge exists between them in G, it must be present in G[S]. This makes induced subgraphs particularly useful for studying vertex-induced structures, such as cliques or independent sets, where preserving all inter- connections is essential to analyze local properties without alteration. In contrast to a general subgraph, which may omit some edges between selected vertices, an induced subgraph requires the full set of edges present in the original among those vertices, ensuring no arbitrary deletions. For example, in the C_5 with vertices labeled 1 through 5 and edges forming a pentagon, selecting the non-adjacent vertices {1, 3} yields an consisting of two isolated vertices, as no edge connects them in C_5.

Spanning Subgraph

A spanning subgraph H of a G is defined as a subgraph where the set V(H) equals V(G), and the edge set E(H) is a of E(G). This construction preserves the entire set of the original while allowing the removal of edges, resulting in potentially sparser structures that retain the same number of vertices but fewer connections. Prominent examples of spanning subgraphs are and spanning forests. A is a connected, acyclic spanning subgraph of a connected G, containing exactly |V(G)| - 1 edges and providing the minimal structure to connect all vertices without cycles. For disconnected graphs, a spanning forest extends this concept as an acyclic spanning subgraph consisting of a for each of G. These structures are fundamental for identifying minimal connected frameworks within graphs. Spanning subgraphs play a key role in studies by enabling the exploration of minimal sets that maintain inclusion. They facilitate the discovery of efficient connecting structures, such as minimum spanning trees, which minimize total weight while ensuring . Algorithms like Kruskal's, which sorts by weight and adds non-cycle-forming greedily, and Prim's, which grows a tree from an arbitrary by incorporating the cheapest connecting , generate such minimum spanning trees as specific spanning subgraphs. Unlike induced subgraphs, which fix between selected , spanning subgraphs prioritize preservation over retention.

Edge Subgraph

An subgraph, also referred to as an edge-induced subgraph, of a G = (V, E) is obtained by selecting a nonempty F \subseteq E of edges, with the resulting subgraph H having edge set E(H) = F and vertex set V(H) consisting precisely of the endpoints of the edges in F. This construction ensures that H includes only those vertices incident to at least one edge in F, implicitly excluding any isolated vertices from the original G. The notation G[F] is commonly used to denote this subgraph. Unlike subgraphs that prioritize vertex selection, edge subgraphs emphasize the chosen edges and their immediate , making them suitable for examining edge-centric structures such as matchings, where no two edges share a . For instance, in the K_4 with vertices labeled a, b, c, d and all possible edges, selecting the non-adjacent edges \{ab, cd\} yields an edge subgraph that is a 2-matching covering all four vertices. This example illustrates how the vertex set automatically spans the endpoints without requiring explicit inclusion of non-incident vertices. Properties of edge subgraphs highlight their focus on edge preservation and potential for disconnected components if the selected edges form disjoint unions. They may not span all vertices of G, distinguishing them from spanning subgraphs that retain the full vertex set. In applications, edge subgraphs serve as a foundational concept for line graphs, where the adjacency of edges in the original graph is modeled by vertices in the line graph.

Properties and Operations

Subgraph Isomorphism

Subgraph isomorphism is a fundamental problem in that involves determining whether a given H, known as the pattern graph, is isomorphic to some subgraph of a larger G, the target graph. Formally, a subgraph isomorphism from H = (V_H, E_H) to G = (V_G, E_G) is an injective mapping f: V_H \to V_G such that for every pair of vertices u, v \in V_H, (u, v) \in E_H implies (f(u), f(v)) \in E_G. This mapping ensures that the structure of H is preserved in the image under f within G, but G may contain additional edges or vertices not involved in the mapping. In contrast, induced subgraph isomorphism requires stricter preservation of the graph structure. Here, the mapping f must satisfy the edge preservation condition above and additionally ensure that for every u, v \in V_H, (f(u), f(v)) \in E_G implies (u, v) \in E_H. This means no extra edges are allowed between the mapped vertices in G; the induced subgraph on f(V_H) must exactly match H. Subgraph isomorphism thus permits extraneous edges in G between the mapped vertices, making it a more permissive variant, while induced subgraph isomorphism enforces exact structural fidelity, including the absence of edges. The of subgraph is NP-complete when both graphs are part of the input, as it generalizes problems like finding cliques or independent sets. However, the problem admits polynomial-time algorithms for restricted cases, such as when the pattern graph H is a , where dynamic programming techniques can efficiently verify the . For induced subgraph , the NP-completeness holds similarly, though solvability in polynomial time extends to additional classes like outerplanar graphs under certain conditions. A representative example is detecting a in a , equivalent to finding the K_3 (three vertices all pairwise connected) as a subgraph of G. This involves searching for any three vertices in G that form such a configuration, illustrating how subgraph captures basic for motifs like cycles or cliques. In practice, exhaustive search is infeasible for large graphs, but heuristics prune candidates based on degrees or neighborhoods. Key algorithms for solving subgraph isomorphism include Ullmann's 1976 method, which builds partial mappings iteratively while using look-up and refinement rules to eliminate incompatible vertices early, achieving practical performance on moderate-sized graphs. A more advanced approach is the VF2 algorithm, introduced in 2004, which enhances with state-space traversal, feasibility checks on partial matches, and look-ahead to detect inconsistencies sooner, making it effective for larger instances in domains like . These methods trade completeness for efficiency on real-world graphs, often outperforming naive enumeration.

Density and Connectivity Preservation

In , the density of an undirected subgraph H = (V(H), E(H)) is formally defined as the ratio of the number of edges in H to the maximum possible number of edges on its vertex set, given by \rho(H) = \frac{|E(H)|}{\binom{|V(H)|}{2}}. This measure quantifies how closely H approaches a on |V(H)| , with \rho(H) \leq 1 always holding. For a general subgraph H of a parent graph G, the \rho(H) is not necessarily bounded above by \rho(G); dense subgraphs can exhibit higher density than G if G contains clusters of high edge concentration amid sparser regions, as the reduced vertex set allows the edge count relative to possible pairs to increase. However, for spanning subgraphs—those retaining all of G but possibly fewer edges—the same denominator \binom{|V(G)|}{2} applies, ensuring \rho(H) \leq \rho(G) since |E(H)| \leq |E(G)|. Connectivity properties in subgraphs reflect how paths and separators from the parent are inherited or altered. A connected subgraph H of G preserves the existence of paths between its own vertices that were present in G using only edges and vertices within H, but it may exclude longer paths relying on vertices outside V(H). For k-connected subgraphs, where k \geq 1, the structure maintains at least k vertex-disjoint paths between any pair of its vertices, mirroring the parent graph's separation properties within the ; this implies that the minimum cut in H is at least k if H is chosen to be k-connected. Induced subgraphs, formed by selecting a and including all edges between them from G, preserve local in the sense of adjacency and immediate neighborhoods, ensuring that direct connections (or their absence) between selected vertices remain unchanged, though global paths may be severed if they traverse excluded vertices. In contrast, spanning subgraphs can reduce overall by edge deletions; for instance, a connected spanning subgraph like a maintains 1-connectivity (basic connectedness) but drops higher k-connectivity to 1, potentially disconnecting the graph entirely if edges are removed to create isolated components. Theorems on Hamiltonicity provide key insights into connectivity preservation under degree conditions. Dirac's theorem states that an n-vertex G with minimum \delta(G) \geq n/2 contains a cycle, a connected spanning subgraph visiting each exactly once. Implications for subgraphs arise in robustness results: Dirac graphs remain Hamiltonian after removal of a constant fraction of edges or vertices, ensuring that certain spanning subgraphs preserve the Hamiltonian property despite reductions in or . Brief extensions, such as those to random subgraphs, show that with high probability, a p-random spanning subgraph of a Dirac retains Hamiltonicity for p above a threshold, highlighting how is preserved probabilistically in edge-sparse reductions. Average serves as a practical for assessing sparsity in subgraphs, defined as d(H) = \frac{2|E(H)|}{|V(H)|}, which equals twice the for undirected graphs. Subgraphs with low average indicate sparsity, and the maximum average over all subgraphs of G, denoted \mathrm{mad}(G), quantifies the densest possible local structure; graphs with bounded \mathrm{mad}(G) < d are considered sparse, as no subgraph exceeds this , aiding of in . For example, in induced subgraphs, the average reflects the local sparsity of the subset, often lower than G's global average if the subset avoids high- regions.

Applications

Graph Algorithms

Graph algorithms for subgraphs encompass methods to construct, identify, and optimize subgraphs within larger graphs, addressing both theoretical and practical challenges in graph computation. Construction algorithms often leverage traversal techniques to extract specific subgraphs, such as connected components. (DFS) is a foundational method for identifying connected subgraphs, where the algorithm explores as far as possible along each branch before , thereby delineating the vertices and edges forming each in linear time relative to the graph size. This approach ensures that all edges between visited vertices are included, producing induced connected subgraphs efficiently for undirected graphs. methods extend construction to approximate maximum subgraphs, such as selecting vertices iteratively based on local density criteria to build large induced subgraphs with high edge counts, though these heuristics do not guarantee optimality. Finding subgraphs involves enumeration techniques, particularly in data mining contexts where frequent patterns are sought. The , introduced in 2002, exemplifies this by employing a strategy on a DFS code tree to enumerate frequent subgraphs without generating candidate sets, avoiding redundant checks through canonical labeling. Introduced for graph-based substructure , gSpan efficiently handles labeled graphs by infrequent extensions, making it suitable for large datasets in cheminformatics and bioinformatics. Subgraph may serve as a subroutine in such enumeration to verify matches, but broader algorithms focus on scalable listing over exhaustive verification. Optimization problems in subgraph algorithms often center on extremal structures, with the maximum representing the search for the largest complete . This problem is NP-hard, as established through from 3-SAT, implying no efficient exact exists unless P=. Practical approximations, including greedy vertex addition based on neighborhood overlap, provide near-optimal solutions for sparse graphs, highlighting the computational trade-offs in subgraph optimization. In practice, libraries like NetworkX facilitate subgraph extraction through functions such as induced_subgraph, which generates a view of the induced subgraph on a specified node set, preserving edges between them for further analysis. These tools integrate construction and finding operations, enabling rapid prototyping of subgraph algorithms in Python environments. Historically, early efforts in the 1970s by Read and Corneil surveyed algorithms for subgraph-related problems, including listing techniques tied to isomorphism testing, laying groundwork for modern enumeration methods.

Network Theory

In network theory, subgraphs play a central role in analyzing the structure and dynamics of complex systems, such as social and biological networks, by identifying modular components that reveal underlying organizational principles. Community detection leverages subgraphs to uncover densely connected clusters within networks, where these subgraphs represent groups of nodes with stronger internal ties compared to external connections. The Louvain method, introduced in 2008, iteratively optimizes modularity to partition networks into such modular subgraphs, enabling the identification of communities in large-scale systems like social media graphs. This approach highlights how subgraphs preserve higher internal density, facilitating the study of group cohesion in social networks. Motif analysis focuses on small induced subgraphs that recur significantly more often than expected by chance, serving as fundamental building blocks in biological networks. In gene regulatory networks, the feed-forward loop—a motif consisting of three nodes where one regulates the other two, and the first two jointly regulate a target—appears frequently and performs functions like signal filtering and response acceleration. These motifs provide insights into evolutionary pressures shaping , with overrepresentation indicating functional significance in processes such as transcription regulation. Subgraphs also inform assessments of network robustness, particularly through , which examines the connectivity of random subgraphs formed by removing nodes or edges to simulate failures. In scale-free networks, percolation on random subgraphs reveals a where a giant emerges only above a critical removal , underscoring the of real-world systems like the against random disruptions. This framework quantifies by analyzing the size and persistence of percolating subgraphs under adversarial conditions. In social networks, cliques manifest as induced subgraphs where every pair of nodes is directly connected, representing tightly knit groups such as friendship circles that enhance and influence propagation. In local area networks, subgraphs provide loop-free paths connecting all devices, as implemented in protocols like the developed by in 1985 that select a minimal set of edges to ensure efficient data transmission. Interdisciplinary applications extend to epidemiology, where contact subgraphs—induced by interactions between individuals—model disease transmission pathways in human mobility networks. These subgraphs capture clusters of potential spread, allowing simulations to predict outbreak sizes and inform intervention strategies like targeted quarantines.

References

  1. [1]
    Subgraph -- from Wolfram MathWorld
    A subgraph of a graph is a graph whose vertex set and edge set are subsets of those of . If is a subgraph of , then is said to be a supergraph of.
  2. [2]
    Graph Theory - Subgraphs - Tutorials Point
    In graph theory, a subgraph is a graph formed from a subset of the vertices and edges of another graph. Subgraphs plays an important role in understanding ...
  3. [3]
    subgraph
    Definition: A graph whose vertices and edges are subsets of another graph. Formal Definition: A graph G'=(V', E') is a subgraph of another graph G=(V, E) iff.
  4. [4]
    Subgraph in Graph Theory: Analyse Part of a Network - Symbio6
    A subgraph is a little replica of a larger graph. Selecting some points (vertices) from the larger graph and connecting them with lines (edges) creates it.
  5. [5]
    [PDF] The Basics
    1.1.3. A graph G with subgraphs G0 and G00: G0 is an induced subgraph of G, but G00 is not. If G0 ✓ G and G0 contains all the edges xy 2 E with x, y 2 V 0, ...
  6. [6]
    [PDF] Chapter 2. Subgraphs
    Sep 22, 2022 · A supergraph of a graph G is a graph H which contains G as a subgraph, H ⊇ G. Subgraphs F 6= G and supergraphs H 6= G of G are called proper.Missing: formal | Show results with:formal
  7. [7]
    [PDF] 5 Directed Graphs
    Directed Graph: A directed graph, or digraph, D, consists of a set of vertices V (D), a set of edges E(D), and a function which assigns each edge e an ...
  8. [8]
    [PDF] GRAPH THEORY WITH APPLICATIONS
    This book is intended as an introduction to graph theory. Our aim has been to present what we consider to be the basic material, together with a wide.
  9. [9]
    [PDF] Groupwork: An Introduction to Graphs
    Jan 22, 2019 · A subgraph H of a graph G is said to be an induced subgraph of G if (a) H is a subgraph of G and (b) whenever u, v ∈ V (H) and u is adjacent to ...
  10. [10]
    [PDF] Section 2.2. Spanning and Induced Subgraphs
    Sep 28, 2020 · Definition. A spanning subgraph of a graph G is a subgraph obtained by edge deletions only (so that a spanning subgraph is a subgraph of G ...
  11. [11]
    Graph Minor -- from Wolfram MathWorld
    A graph H is a minor of a graph G if a copy of H can be obtained from G via repeated edge deletion and/or edge contraction.
  12. [12]
    14. Some Graph Theory - MIT Mathematics
    There is a standard notation for several kinds of graphs that are of general interest. The graph Ck is a k length cycle, consisting of k vertices and k ...
  13. [13]
    [PDF] 5 – Graph Theory Basics - William T. Trotter
    Nov 14, 2017 · Definition A graph H = (V', E') is an induced subgraph of a graph G = (V, E) if V' ⊆ V and xy is an edge in H whenever x and y are distinct.
  14. [14]
    [PDF] 11. Graph Theory
    A subgraph G′ of a graph G = (V, E) is called an induced subgraph if there exists U⊆V such that G′ = <U>. Note: An induced subgraph is a subgraph. But a ...
  15. [15]
    [PDF] Chapter 1: Graphs, Subgraphs, Degrees, and Connections
    A subgraph of a graph has some of the vertices and some of the edges. An induced subgraph is one which contains all possible edges for its vertices. A.
  16. [16]
    Spanning Tree -- from Wolfram MathWorld
    A spanning tree of a graph on n vertices is a subset of n-1 edges that form a tree (Skiena 1990, p. 227). For example, the spanning trees of the cycle graph ...
  17. [17]
    [PDF] 5. Spanning Trees - FSU Mathematics
    Definition 5.5.1. A spanning forest of a graph G is the union of a collection of one spanning tree from each connected component of G.Missing: theory | Show results with:theory
  18. [18]
    [PDF] On the Shortest Spanning Subtree of a Graph ... - Utah State University
    On the Shortest Spanning Subtree of a Graph and the Traveling Salesman Problem. Author(s): Joseph B. Kruskal, Jr. Source: Proceedings of the American ...
  19. [19]
    BSTJ 36: 6. November 1957: Shortest Connection Networks And ...
    Jan 19, 2013 · November 1957: Shortest Connection Networks And Some Generalizations. (Prim, R.C.) ... PDF WITH TEXT download · download 1 file · SINGLE PAGE ...
  20. [20]
    Edge-Induced Subgraph -- from Wolfram MathWorld
    An edge-induced subgraph is a subset of the edges of a graph G together with any vertices that are their endpoints. The subgraph induced by a set of edges ...
  21. [21]
    [PDF] Week 1-2: Graphs and Subgraphs - HKUST Math Department
    Sep 19, 2020 · An empty graph is a graph with possible vertices but no edges. • A complete graph is a simple graph that every pair of vertices are adjacent.
  22. [22]
    Line Graph -- from Wolfram MathWorld
    G is obtained by associating a vertex with each edge of the graph and connecting two vertices with an edge iff the corresponding edges of G have a vertex in ...
  23. [23]
    [PDF] 1 Subgraph Isomorphism - Stanford CS Theory
    Sep 28, 2016 · A subgraph isomorphism from H to G is a function f : VH → V such that if (u, v) ∈ EH, then (f(u),f(v)) ∈ E. f is an induced subgraph ...Missing: formal | Show results with:formal
  24. [24]
    [PDF] Subtree Isomorphism Revisited 1 Introduction - People | MIT CSAIL
    Abstract. The Subtree Isomorphism problem asks whether a given tree is contained in another given tree. The problem is of fundamental importance and has ...
  25. [25]
    An Algorithm for Subgraph Isomorphism | Journal of the ACM
    In this paper a new algorithm is introduced that attains efficiency by inferentially eliminating successor nodes in the tree search.
  26. [26]
  27. [27]
    [PDF] Finding a Maximum Density Subgraph
    Since the density of a graph is half the average degree, the problem is equivalent to that of finding a maximum aver- age degree subgraph of a graph. Note that ...
  28. [28]
    [PDF] Connectivity
    In this section we describe how every 3-connected graph can be ob- tained from a K4 by a succession of elementary operations preserving. 3-connectedness. We ...
  29. [29]
    [PDF] Dirac's theorem
    Dirac's Theorem. Recall that a Hamiltonian cycle in a graph G = (V,E) is a cycle that visits each vertex exactly once. Unlike for Euler cycles, no simple ...
  30. [30]
    [PDF] Robust Hamiltonicity of Dirac graphs - Mathematical Sciences
    A classical theorem of Dirac from 1952 asserts that every graph on n vertices with minimum degree at least n/2 is Hamiltonian. We refer to such graphs as Dirac ...
  31. [31]
    Cyclic Subsets in Regular Dirac Graphs - Oxford Academic
    These stronger theorems include showing that Dirac graphs have many Hamiltonian cycles [8, 30] and remain Hamiltonian even after passing to a p -random subgraph ...Missing: implications | Show results with:implications
  32. [32]
    Maximum average degree and relaxed coloring - ScienceDirect.com
    The maximum average degree, m a d ( G ) , of a graph G is the maximum average degree over all subgraphs of G . This parameter is used to measure how sparse a ...
  33. [33]
    [PDF] Chapter 1: Measuring sparsity - mimuw
    Oct 24, 2017 · Sparsity can be measured by bounded average degree, where the average degree of a graph is bounded, or by degeneracy, where subgraphs have a ...
  34. [34]
    [PDF] reducibility among combinatorial problems - CS@Purdue
    (Plenum Press, 1972). REDUCIBILITY AMONG COMBINATORIAL PROBLEMS. +. Richard M. Karp. University of California at Berkeley. Abstract: A large class of ...
  35. [35]
    Graph.subgraph — NetworkX 3.5 documentation
    Returns a SubGraph view of the subgraph induced on nodes. The induced subgraph of the graph contains the nodes in nodes and the edges between those nodes.
  36. [36]
    The graph isomorphism disease - Read - 1977 - Wiley Online Library
    ” This paper surveys the present state of the art of isomorphism testing ... 9 D. G. Corneil, Graph Isomorphism. Ph.D. Thesis. University of Toronto ...Missing: survey | Show results with:survey
  37. [37]
    Network robustness and fragility: Percolation on random graphs - arXiv
    Jul 18, 2000 · In this paper we study percolation on graphs with completely general degree distribution, giving exact solutions for a variety of cases.Missing: subgraphs | Show results with:subgraphs
  38. [38]
    Contact networks have small metric backbones that maintain ...
    Thus, the metric backbone is an important subgraph with regard to epidemic spread, the robustness of social networks, and any communication dynamics that depend ...