Fact-checked by Grok 2 weeks ago

Breadth-first search

Breadth-first search (BFS) is a fundamental graph traversal algorithm that systematically explores the vertices of a graph or tree data structure level by level, starting from a designated source vertex and visiting all adjacent vertices at the current depth before advancing to vertices at the next depth.^[1] It operates using a queue data structure to manage the order of exploration, ensuring that closer vertices (in terms of edge count) are processed before farther ones.^[2] The algorithm traces its origins to early work in graph theory and maze-solving problems; it was independently developed by Konrad Zuse in 1945 for counting connected components in graphs, though not published until 1972, and reinvented by Edward F. Moore in 1957 as "Algorithm A" for computing the shortest path through a maze.^[3] Moore's description, published in 1959, formalized BFS as a method to find paths of minimal length in unweighted graphs by expanding layers outward from the source.^[4] Later, C. Y. Lee extended its application in 1961 to wire routing in circuit design, further establishing its utility in computational problems.^[3] In practice, BFS begins by enqueueing the source vertex and marking it as visited, then repeatedly dequeues a vertex, explores its unvisited neighbors by enqueuing them, and assigns each a level or distance equal to the dequeued vertex's level plus one.^[1] This process builds a breadth-first tree via parent pointers, which reconstructs shortest paths from the source to all reachable vertices in terms of the fewest edges.^[1] A typical implementation in pseudocode, as described in standard algorithms texts, is as follows:

BFS(G, s):
    Create a queue Q
    Enqueue s into Q
    Mark s as visited
    While Q is not empty:
        Dequeue u from Q
        For each neighbor v of u:
            If v is not visited:
                Enqueue v into Q
                Mark v as visited
                Set parent[v] = u  // For path reconstruction
BFS(G, s):
    Create a queue Q
    Enqueue s into Q
    Mark s as visited
    While Q is not empty:
        Dequeue u from Q
        For each neighbor v of u:
            If v is not visited:
                Enqueue v into Q
                Mark v as visited
                Set parent[v] = u  // For path reconstruction

This structure guarantees completeness in finite graphs and optimality for unweighted shortest paths.^[5] BFS exhibits linear time complexity of O(V + E), where V is the number of vertices and E is the number of edges, making it efficient for sparse graphs represented via adjacency lists.^[1] It requires \Theta(V) space for the queue and visited markers in the worst case.^[1] Notable applications include finding shortest paths in unweighted networks, such as social network connections or puzzle solving (e.g., sliding tiles); computing connected components; and serving as a building block for more advanced algorithms like bidirectional search.^[3] Unlike depth-first search, BFS prioritizes breadth over depth, avoiding deep recursion but potentially consuming more memory for wide graphs.^[5]

Introduction

Definition

Breadth-first search (BFS) is a fundamental graph traversal algorithm that systematically explores the vertices of a graph or tree structure level by level, beginning from a specified source vertex and expanding outward to visit all adjacent vertices before proceeding to the next layer of vertices. This level-order exploration ensures that vertices are processed in increasing order of their shortest-path distance (measured by the number of edges) from the source, making BFS particularly effective for discovering the structure of connected components in undirected graphs or reachable sets in directed graphs.^[6]^[7] The algorithm maintains a frontier of vertices to explore using a first-in, first-out (FIFO) queue, where the source vertex is initially enqueued and marked as visited to avoid cycles. As each vertex is dequeued, its unvisited neighbors are enqueued and labeled with the current distance plus one, effectively building layers of vertices at distances 0, 1, 2, and so on from the source. This process continues until the queue is empty, ensuring every reachable vertex is visited exactly once. BFS operates on both directed and undirected graphs without modification, though it assumes the graph is represented in a way that allows efficient neighbor access, such as adjacency lists.^[6]^[8]^[7] A defining property of BFS is its optimality in unweighted graphs, where it guarantees the discovery of shortest paths in terms of edge count, as vertices are always expanded in non-decreasing distance order. The time complexity is linear in the size of the graph, specifically O(|V| + |E|), where |V| is the number of vertices and |E| is the number of edges, since each vertex and edge is processed at most once. Space complexity is O(|V|) due to the queue and visited tracking structures. These characteristics position BFS as an archetype for uninformed search strategies in graph theory and computer science.^[7]^[8]

History

The breadth-first search (BFS) algorithm traces its origins to 1945, when Konrad Zuse developed an early implementation as part of his work on Plankalkül, the first high-level programming language. In his rejected Ph.D. thesis, Zuse used BFS to count and label connected components in undirected graphs, providing one of the earliest algorithmic descriptions of level-by-level graph traversal. Although not published until 1972, this formulation demonstrated BFS's utility for combinatorial problems predating widespread graph theory adoption.^[9] The algorithm was independently rediscovered in 1957 by Edward F. Moore, who applied it to find the shortest path through a maze modeled as an unweighted graph. Moore presented BFS, termed "Algorithm A," at the International Symposium on the Theory of Switching, emphasizing its ability to explore vertices layer by layer using a queue. The proceedings, published in 1959, formalized BFS as a method for computing shortest paths in graphs with unit edge weights, establishing its theoretical foundation in switching theory and maze-solving contexts.^[4] In 1961, C. Y. Lee further advanced BFS by adapting it for practical path connection problems in circuit board design. Lee's algorithm, published in the IRE Transactions on Electronic Computers, employed BFS to route wires between terminals while avoiding obstacles, treating the board as a grid graph. This extension highlighted BFS's efficiency for shortest-path computations in discrete spaces and influenced early applications in electrical engineering and automated routing. These seminal contributions by Zuse, Moore, and Lee solidified BFS as a core graph traversal technique, paving the way for its integration into broader computer science methodologies by the 1970s.^[9]

Comparison to Other Algorithms

Depth-First Search

Depth-first search (DFS) is a fundamental graph traversal algorithm that explores as deeply as possible along each branch of a graph before backtracking, contrasting with the level-by-level exploration of breadth-first search (BFS).^[10] In DFS, the algorithm starts at a source vertex and recursively visits an unvisited neighbor, continuing this process until reaching a dead end, at which point it backtracks to explore alternative paths. This depth-oriented approach makes DFS particularly suitable for scenarios where the goal is to find any path or detect connectivity, rather than the shortest one.^[11] A key difference between DFS and BFS lies in their traversal order and underlying data structures. BFS processes vertices in order of increasing distance from the source using a queue (FIFO), ensuring that all vertices at a given level are visited before moving deeper. In contrast, DFS employs a stack (LIFO) or recursion to prioritize the most recently discovered vertices, leading to a traversal that delves deeply into one path before exploring siblings. This results in DFS producing a spanning tree that is typically "long and stringy," while BFS yields a "short and bushy" tree with shortest-path properties from the root.^[12] Both algorithms exhibit the same time complexity of O(V + E), where V is the number of vertices and E is the number of edges, as they each visit every vertex and edge exactly once in connected graphs. However, their space complexities differ based on graph structure: DFS generally requires less memory, storing only the current path (up to O(V) in the worst case for deep graphs), whereas BFS may need O(V) space to hold an entire level of the queue in wide, shallow graphs. DFS's recursive implementation can risk stack overflow in very deep graphs, but iterative versions using explicit stacks mitigate this.^[10]^[11] In terms of optimality and completeness, BFS is complete and optimal for finding the shortest path in unweighted graphs, guaranteeing the minimal number of edges. DFS, while complete in finite graphs, does not ensure shortest paths and may find longer routes first, making it non-optimal for distance-based queries. DFS excels in applications like topological sorting, cycle detection via back edges, and finding strongly connected components, where deep exploration suffices without needing breadth. Conversely, BFS is preferred for shortest-path problems in unweighted graphs, such as social network degrees of separation or puzzle solving like the 15-puzzle.^[12]^[10] The choice between DFS and BFS often depends on the graph's shape and problem requirements. In deep, narrow graphs (e.g., expression trees), DFS is more space-efficient and faster in practice for path-finding tasks. In wide graphs (e.g., web graphs), BFS avoids the potential exponential path lengths of DFS but at higher memory cost. For example, in a binary tree traversal, DFS mimics preorder traversal for deep dives, while BFS performs level-order for balanced processing.^[11]

Aspect	BFS	DFS
Traversal Strategy	Level-by-level (breadth-first)	Branch-by-branch (depth-first)
Data Structure	Queue (FIFO)	Stack or recursion (LIFO)
Path Optimality	Shortest path in unweighted graphs	Any path, not necessarily shortest
Space Usage	Higher in wide graphs (O(V))	Lower in deep graphs (O(V))
Typical Applications	Shortest paths, bipartite checking	Cycle detection, topological sort

^[10]

Iterative Deepening Depth-First Search

Iterative deepening depth-first search (IDDFS) is a state-space search strategy that combines the space efficiency of depth-first search (DFS) with the completeness and optimality of breadth-first search (BFS). Introduced by Richard E. Korf in 1985, IDDFS performs a series of DFS traversals, each limited to a successively deeper level, until a goal state is found. This iterative approach simulates the level-by-level expansion of BFS while avoiding the exponential memory requirements of the latter.^[13] In contrast to BFS, which explores all nodes at a given depth before moving deeper and requires storing the entire frontier (potentially up to O(b^d) space, where b is the branching factor and d is the solution depth), IDDFS re-expands nodes at shallower depths multiple times across iterations but limits its memory to the current path (O(d) space). This makes IDDFS particularly advantageous in memory-constrained environments, such as large search spaces in artificial intelligence planning or puzzle solving, where BFS might become infeasible due to space explosion. However, the repeated expansions introduce a time overhead, resulting in a total time complexity of O(b^d) with a multiplicative constant factor of approximately (1 + 1/b + 1/b^2 + ...), which is only slightly higher than BFS's O(b^d) in practice for typical branching factors.^[13] Both algorithms are complete in finite spaces—guaranteeing a solution if one exists—and optimal for unweighted graphs, finding the shallowest goal node. Unlike standard DFS, which can fail to find optimal solutions or terminate without exploring all possibilities, IDDFS inherits BFS's guarantees by systematically increasing the depth bound until convergence. Korf proved that IDDFS achieves the optimal running time among admissible brute-force tree searches, as it minimizes redundant work while ensuring no shallower solutions are missed. This hybrid nature positions IDDFS as a practical alternative to BFS when memory is the primary bottleneck, though BFS remains preferable for scenarios with abundant resources and where the slight time savings matter.^[13]

Algorithm Description

Pseudocode

The breadth-first search (BFS) algorithm can be expressed in pseudocode that outlines its use of a queue to explore graph vertices level by level, starting from a designated source vertex. This formulation assumes an undirected or directed graph represented as G = (V, E) with an adjacency-list structure, where V is the set of vertices and E is the set of edges. The algorithm maintains attributes for each vertex: a color to track visitation status (WHITE for undiscovered, GRAY for discovered but not fully explored, BLACK for fully explored), a distance d from the source, and a predecessor \pi for path reconstruction. A canonical pseudocode for BFS, as presented in the seminal textbook Introduction to Algorithms (3rd edition) by Cormen, Leiserson, Rivest, and Stein, is as follows:

BFS(G, s)
1  for each [vertex](/page/Vertex) u ∈ G.V - {s}
2      u.color ← WHITE
3      u.d ← ∞
4      u.π ← NIL
5  s.color ← GRAY
6  s.d ← 0
7  s.π ← NIL
8  Q ← ∅
9  ENQUEUE(Q, s)
10 while Q ≠ ∅
11     u ← DEQUEUE(Q)
12     for each v ∈ G.Adj[u]
13         if v.color == WHITE
14             v.color ← GRAY
15             v.d ← u.d + 1
16             v.π ← u
17             ENQUEUE(Q, v)
18     u.color ← BLACK
BFS(G, s)
1  for each [vertex](/page/Vertex) u ∈ G.V - {s}
2      u.color ← WHITE
3      u.d ← ∞
4      u.π ← NIL
5  s.color ← GRAY
6  s.d ← 0
7  s.π ← NIL
8  Q ← ∅
9  ENQUEUE(Q, s)
10 while Q ≠ ∅
11     u ← DEQUEUE(Q)
12     for each v ∈ G.Adj[u]
13         if v.color == WHITE
14             v.color ← GRAY
15             v.d ← u.d + 1
16             v.π ← u
17             ENQUEUE(Q, v)
18     u.color ← BLACK

This pseudocode initializes all vertices except the source s as undiscovered (lines 1–4), sets the source as discovered with distance 0 (lines 5–7), and begins with s in an empty queue Q (lines 8–9). The main loop (lines 10–18) dequeues a vertex u and examines its adjacent vertices; any undiscovered neighbor v is marked discovered, assigned a distance one greater than u's, linked as u's child in the BFS tree, and enqueued. Once all neighbors of u are processed, u is marked fully explored. The result is a breadth-first spanning tree for the connected component containing s, with d giving the shortest-path distance from s to each reachable v in unweighted graphs.^[14] Variations of this pseudocode exist for specific applications, such as omitting color attributes in favor of a simple visited flag and distance array when only traversal (not shortest paths) is needed. For instance, an alternative formulation uses an explored array instead of colors and focuses solely on discovery and distance computation:

procedure BFS(G, s)
    for each [vertex](/page/Vertex) v ∈ V[G]
        explored[v] ← false
        d[v] ← ∞
    end for
    explored[s] ← true
    d[s] ← 0
    Q ← a [queue](/page/Queue), initialized with s
    while Q ≠ ∅
        u ← DEQUEUE(Q)
        for each v adjacent to u
            if not explored[v]
                explored[v] ← true
                d[v] ← d[u] + 1
                ENQUEUE(Q, v)
            end if
        end for
    end while
end procedure
procedure BFS(G, s)
    for each [vertex](/page/Vertex) v ∈ V[G]
        explored[v] ← false
        d[v] ← ∞
    end for
    explored[s] ← true
    d[s] ← 0
    Q ← a [queue](/page/Queue), initialized with s
    while Q ≠ ∅
        u ← DEQUEUE(Q)
        for each v adjacent to u
            if not explored[v]
                explored[v] ← true
                d[v] ← d[u] + 1
                ENQUEUE(Q, v)
            end if
        end for
    end while
end procedure

This version, drawn from academic course materials, simplifies tracking by avoiding the three-color scheme while preserving the core queue-based level-order exploration. It is suitable for basic graph traversal tasks, such as finding connected components, and still achieves O(|V| + |E|) time complexity with adjacency lists. Both formulations ensure that vertices are visited in order of increasing distance from the source, a key property of BFS.^[15]

Implementation Details

Breadth-first search (BFS) is typically implemented using a first-in, first-out (FIFO) queue to manage the order of vertex exploration, ensuring that vertices are processed level by level from the source. The algorithm requires a graph representation, commonly as an adjacency list for efficient neighbor access, where each vertex points to a list of its adjacent vertices; this uses \Theta(V + E) space, with V as the number of vertices and E as the number of edges. An array or set tracks visited vertices to prevent revisiting and infinite loops in graphs with cycles. In the standard formulation, a color array categorizes vertices as white (undiscovered), gray (discovered but not fully explored), or black (fully explored), allowing detection of the current processing state during traversal. The initialization step marks the source vertex s as gray, assigns it a distance of 0 from itself, sets its predecessor to null, and enqueues it. While the queue is non-empty, the algorithm dequeues a vertex u, processes its neighbors by iterating through the adjacency list, and for each unvisited (white) neighbor v, marks v as gray, updates its distance to d + 1, sets its predecessor to u, and enqueues v. After processing all neighbors, u is marked black. This process builds a breadth-first spanning tree via predecessor pointers and computes shortest-path distances in unweighted graphs. The color mechanism ensures each vertex and edge is examined exactly once, yielding O(V + E) time complexity with adjacency lists.^[16] For graphs with multiple connected components, BFS can be extended by iterating over all vertices and invoking the procedure on each unvisited (white) vertex as a new source, though the core implementation remains focused on single-source traversal. In implicit graphs, such as those generated on-the-fly (e.g., puzzle states), the adjacency function replaces explicit lists, maintaining zero additional space beyond the frontier and tracking structures. The queue can be implemented as a linked list or array-based deque for O(1) amortized enqueue and dequeue operations, with the frontier size peaking at the widest level of the graph. Practical considerations include using a boolean visited array instead of colors for simpler applications without need for discovery timing, reducing space to O(V). For very large graphs, bit vectors or hash sets optimize the visited structure, trading access time for memory efficiency, though the standard color array supports additional analyses like cross-edge detection in directed graphs.

Examples

Graph Traversal Example

To illustrate breadth-first search (BFS) in graph traversal, consider an undirected graph with vertices A, B, C, D, E, and F, and edges connecting A to B, A to C, A to D, C to E, and both C and D to F.^[17] This graph forms a structure where A is the starting vertex at level 0, connected to B, C, and D at level 1, and E and F at level 2. BFS begins at vertex A and explores the graph level by level using a queue to manage the order of visitation, ensuring all neighbors of a vertex are visited before moving to the next level.^[17] The traversal process initializes an empty queue and enqueues the starting vertex A, marking it as visited. The algorithm then dequeues A and enqueues its unvisited neighbors B, C, and D in the order they appear in the adjacency list, resulting in the queue containing B, C, D. Next, it dequeues B, which has no unvisited neighbors, leaving the queue as C, D. Dequeuing C adds its unvisited neighbors E and F to the queue, now D, E, F. Dequeuing D attempts to add F, but F is already visited, so the queue remains E, F. Finally, dequeuing E and F processes their neighbors, but none are unvisited, emptying the queue. The visitation order is A, B, C, D, E, F, which corresponds to the level-order traversal of the graph.^[17] This example demonstrates BFS's use of a first-in, first-out (FIFO) queue to prioritize nearer vertices, producing a breadth-first spanning tree where parent-child relationships reflect the discovery order. In practice, implementations track visited vertices with a set or array to avoid revisiting, and the queue ensures systematic exploration without backtracking. Such traversals are foundational for applications like finding shortest paths in unweighted graphs or computing connected components.^[17]

Step	Dequeued Vertex	Queue State Before Enqueue	Enqueued Neighbors	Queue State After	Visited Order So Far
1	A	[A]	B, C, D	[B, C, D]	A
2	B	[B, C, D]	None	[C, D]	A, B
3	C	[C, D]	E, F	[D, E, F]	A, B, C
4	D	[D, E, F]	None (F visited)	[E, F]	A, B, C, D
5	E	[E, F]	None	[F]	A, B, C, D, E
6	F	[F]	None	[]	A, B, C, D, E, F

This table summarizes the queue dynamics and progression, highlighting how BFS maintains level integrity during traversal.^[17]

Tree Level-Order Example

In breadth-first search (BFS) applied to trees, the algorithm performs a level-order traversal, visiting all nodes at the current depth before proceeding to the next deeper level. This process begins at the root (level 0) and uses a queue to enqueue child nodes level by level, ensuring a systematic exploration from top to bottom and left to right within each level. Unlike graphs, trees lack cycles, so BFS on trees guarantees visitation of all nodes without revisiting, making it ideal for applications like printing tree structures or finding shortest paths in unweighted tree edges.^[18] Consider a binary search tree with the following structure: the root is 38; its left child is 5 (with left child 1 and right child 9); 9 has left child 8 and right child 15; 15 has left child 13; the right child of 38 is 45, which has right child 47; and 47 has left child 46. This tree spans multiple levels, illustrating how BFS processes nodes hierarchically.^[18] The BFS traversal starts by enqueuing the root (38) and dequeuing it to visit level 0. It then enqueues 38's children (5 and 45) for level 1. Dequeuing 5 enqueues its children (1 and 9) while dequeuing 45 enqueues 47, forming level 2 (1, 9, 47). Continuing, level 3 consists of 8, 15 (from 9), and 46 (from 47), followed by level 4 with 13 (from 15). The complete visitation order is: 38, 5, 45, 1, 9, 47, 8, 15, 46, 13. This order reflects the queue's FIFO nature, prioritizing shallower nodes.^[18] To visualize the levels:

Level 0: 38
Level 1: 5, 45
Level 2: 1, 9, 47
Level 3: 8, 15, 46
Level 4: 13

This example demonstrates BFS's utility in trees for operations like serialization or layer-wise processing, where the level structure provides clear separation of depths.^[18]

Analysis

Time and Space Complexity

The time complexity of breadth-first search (BFS) on a graph with V vertices and E edges is O(V + E).^[19]^[20] This arises because each vertex is enqueued and dequeued at most once, accounting for O(V) operations, while each edge is examined at most twice (once from each endpoint in an undirected graph), contributing O(E) work. In practice, for sparse graphs where E = O(V), the algorithm runs in linear time relative to the input size.^[19] The space complexity of BFS is O(V).^[19]^[20] This is dominated by the queue, which in the worst case holds up to O(V) vertices (e.g., all vertices at the current level in a broad graph), and the visited set, which tracks all V vertices to avoid revisiting. For trees without cycles, the space remains O(V) but can be optimized to O(w) where w is the maximum width of a level if parent pointers are used instead of a full visited array.^[19] In the context of search problems with branching factor b and solution depth d, the time and space complexities are O(b^d) in the worst case, as BFS explores all nodes up to depth d.^[21] However, for explicit graph representations, the O(V + E) bound is more precise and commonly used.^[20]

Completeness and Optimality

Breadth-first search (BFS) is complete, meaning it guarantees to find a solution if one exists in the search space, under appropriate conditions. Specifically, if the shallowest goal node is at some finite depth d, BFS will eventually reach it after expanding all shallower nodes, provided the branching factor b (the number of successors from any node) is finite.^[22] This property holds because BFS systematically explores the state space level by level, ensuring no shallower solutions are overlooked before deeper ones. In infinite state spaces, completeness requires that step costs are bounded below by some positive \epsilon > 0 and the branching factor remains finite.^[22] Regarding optimality, BFS finds the optimal solution—defined as the path with the lowest cost—only when all actions have equal step costs, such as in unweighted graphs where the goal is to minimize the number of edges. In this scenario, the algorithm's level-by-level expansion ensures it discovers the shallowest goal node first, which corresponds to the least-cost path since path cost increases uniformly with depth.^[22] More generally, BFS is optimal if the path cost is a nondecreasing function of the node's depth; however, when step costs vary, it may not yield the lowest-cost path, and alternatives like uniform-cost search are required for optimality.^[22]

Properties

BFS Ordering

In breadth-first search (BFS), vertices are visited in non-decreasing order of their distance from the source vertex, where distance is defined as the minimum number of edges in a path connecting them. This level-by-level ordering ensures that all vertices at distance d are explored before any vertex at distance d+1.^[7] The process begins with the source vertex at level 0, followed by its adjacent vertices at level 1, then the unvisited neighbors of level-1 vertices at level 2, and continues outward until all reachable vertices are processed or the graph is exhausted.^[6] The queue data structure enforces this breadth-first ordering by operating on a first-in, first-out (FIFO) principle: newly discovered vertices are enqueued after their parent's neighbors, delaying their processing until all vertices at the current level have been dequeued and expanded. This mechanism guarantees that closer vertices are visited before farther ones, distinguishing BFS from depth-first approaches that prioritize deeper exploration.^[23] Within the same level, the visitation order among vertices depends on the sequence in which they are enqueued, which typically reflects the order of iteration over the adjacency lists of previously visited vertices.^[24] This ordering property makes BFS particularly suitable for discovering the structure of graphs layer by layer, enabling applications such as computing shortest paths in unweighted graphs, where the discovery time of a vertex directly corresponds to its distance from the source. In directed or undirected graphs, the ordering remains consistent as long as the graph is traversed without revisiting marked vertices, preserving the FIFO discipline.^[25] For instance, in a simple path graph with vertices labeled 0 to 6 starting from 0, BFS would visit in the sequence 0, 1, 2, 3, 4, 5, 6, reflecting strict level progression.^[24]

Level Structure

In breadth-first search (BFS), the level structure organizes the vertices of a graph into discrete layers based on their shortest path distance from a designated source vertex s. The source vertex itself forms level 0, while level i (for i \geq 1) comprises all vertices reachable from s via a path of exactly i edges, but no shorter path.^[1] This partitioning ensures that BFS explores the graph outward in a concentric manner, visiting all vertices at a given distance before proceeding to the next.^[6] The construction of this level structure relies on a queue data structure, which maintains the frontier of unvisited vertices in first-in, first-out order. Upon dequeuing a vertex from level i-1, BFS enqueues its unvisited neighbors into level i, systematically building each subsequent level from the previous one.^[26] Edges in the graph connect vertices either within the same level or between consecutive levels, but never skip levels, preserving the layered hierarchy.^[1] This property holds for undirected, unweighted graphs, where the levels directly correspond to distance metrics.^[6] In practice, the level structure can be explicitly tracked using an array or map that assigns a level number to each vertex during traversal, often alongside a parent pointer for path reconstruction. For instance, in a simple graph with source s connected to vertices a and b, followed by a to c and b to d, the levels would be: level 0 = {s}, level 1 = {a, b}, level 2 = {c, d}.^[27] This organization not only facilitates shortest-path computation but also enables efficient level-wise operations, such as counting vertices per level to assess graph density at varying distances.^[26]

Variants

Bidirectional BFS

Bidirectional breadth-first search (BFS) is a graph traversal algorithm that enhances standard BFS by initiating simultaneous searches from both the source node and the target node to locate the shortest path between them more efficiently. This approach leverages two separate BFS processes: a forward search expanding from the source and a backward search expanding from the target in the reversed graph. Each search explores nodes level by level, maintaining queues for pending nodes and sets to track visited nodes in their respective directions. The algorithm continues alternating expansions between the two searches until a node is encountered that has been visited by both, indicating a meeting point where the paths can be combined to form the complete shortest path.^[28]^[29] The core mechanism relies on the uniform cost property of BFS in unweighted graphs, ensuring that levels represent equal distances. Upon detecting an overlap in the visited sets—typically checked after each expansion—the shortest path length is the sum of the distances from the source to the meeting node and from the target to the meeting node. Path reconstruction involves backtracking from the meeting node to the source via parent pointers in the forward tree and from the meeting node to the target in the backward tree. This bidirectional strategy was first formalized by Ira Pohl in his 1969 technical report on bi-directional search in path problems, extending uniform-cost search techniques like BFS to dual-direction expansion.^[28]^[28] In terms of complexity, bidirectional BFS achieves a time and space complexity of O(b^{d/2}) in tree-like structures or graphs with constant branching factor b and solution depth d, as each search only needs to explore roughly half the depth before meeting, exponentially reducing the number of nodes expanded compared to the O(b^d) of unidirectional BFS. This improvement holds under the assumption of symmetric graphs where forward and backward expansions are balanced; in practice, strategies like cardinality comparison—selecting the frontier with fewer nodes for expansion—optimize performance by minimizing redundant work. Pohl's analysis demonstrated that such decision rules can reduce expected node expansions to about one-fourth of unidirectional methods in random graphs with branching factors of 3 to 15.^[28]^[29]^[28] The algorithm remains complete and optimal for unweighted, undirected graphs, provided the backward search uses the graph's transpose to simulate reverse traversal. It is particularly advantageous in large-scale applications like social network analysis or puzzle solving, where the target is known and the graph is expansive, often expanding 70-90% fewer nodes than unidirectional BFS in empirical tests on maze-like domains. However, in highly asymmetric or directed graphs, careful handling of edge directions is required to maintain correctness, and improper intersection detection can lead to suboptimal paths. Later refinements, such as those in Holte et al.'s 1997 reconsideration of bidirectional heuristic search, confirmed these gains even without heuristics, attributing early implementation pitfalls to flawed direction-switching rules rather than inherent flaws in the approach.^[29]^[28]^[29]

Parallel BFS

Parallel breadth-first search (BFS) extends the sequential algorithm to multiprocessor environments, addressing challenges such as load imbalance, nondeterminism in frontier expansion, and communication overhead in shared and distributed memory systems. In shared-memory multicore settings, parallel BFS must manage concurrent access to the frontier—a set of vertices at the current level—while ensuring work efficiency comparable to the serial O(V + E) complexity. A seminal approach, the parallel BFS (PBFS) algorithm, achieves this by replacing traditional FIFO queues with "bags," which are multisets supporting efficient insertion, union, and splitting operations, thereby coping with the nondeterminism arising from variable out-degrees and race conditions during vertex visitation.^[30] The PBFS algorithm employs layer synchronization to process levels sequentially and uses a recursive divide-and-conquer strategy for frontier partitioning across threads. To handle nondeterministic ordering, it leverages "pennants"—balanced binary trees of size 2^k—as internal nodes in bags, enabling amortized O(1) insertions and O(lg n) splits that distribute work evenly. This design ensures work efficiency nearly matching serial BFS on a single processor and a time complexity of O((V + E)/P + D lg^3(V/D)) on P processors for graphs with diameter D, where the parallelism scales linearly until memory bandwidth constraints dominate. On benchmarks like synthetic R-MAT graphs with up to 2 million vertices, PBFS demonstrated speedups of approximately 5x on 8 cores of an Intel Core i7 processor.^[30] In distributed-memory systems, parallel BFS partitions the graph across nodes, often using vertex-centric or matrix-centric decompositions to minimize inter-node communication. Single-direction BFS (1DBFS) adopts a level-synchronous approach with vertex-based partitioning and all-to-all messaging for edge aggregation, scaling to large clusters via hybrid MPI/OpenMP implementations. It achieves high throughput, such as 17.8 billion traversed edges per second (GTEPS) on a 4.3 billion-vertex graph using 40,000 cores of the Hopper supercomputer.^[31] Direction-optimizing BFS (2DBFS) improves upon this by employing a 2D sparse matrix representation, reducing communication volume by up to 3.5x through optimized data locality and block decomposition, particularly benefiting power-law graphs like web crawls. On the uk-union graph (133 million vertices, 5.5 billion edges), 2DBFS outperformed 1DBFS by 1.5–1.8x on systems like Franklin, while both variants exceeded prior libraries like Parallel Boost Graph Library (PBGL) by 2.7–4.1x in traversal rates.^[32] These parallel variants prioritize scalability for massive graphs in applications like social network analysis, where sequential BFS becomes infeasible due to memory limits. Optimizations such as bitmap-based visited sets and hybrid traversals further enhance performance on modern architectures, though challenges like irregular memory access persist in GPU-accelerated extensions.

Applications

Graph Connectivity Problems

Breadth-first search (BFS) serves as a foundational algorithm for addressing graph connectivity problems, particularly in undirected graphs, where it efficiently determines reachability between vertices and identifies isolated substructures. By exploring vertices level by level from a starting node using a queue data structure, BFS systematically visits all reachable vertices from the source, marking them as part of the same connected component. This process ensures that no vertex within the component is overlooked, as the algorithm exhaustively examines all adjacent edges before proceeding to deeper levels.^[1] To determine if an entire undirected graph is connected, BFS can be initiated from any single vertex; if all vertices are visited upon completion, the graph forms a single connected component. Conversely, for graphs with multiple components, the algorithm is repeated iteratively: after exhausting the reachable set from one starting vertex, select an unvisited vertex and perform another BFS traversal. Each such traversal delineates one connected component, partitioning the graph into disjoint subsets where vertices within a subset are mutually reachable via undirected paths, while no paths exist between subsets. This method achieves a total time complexity of O(V + E), where V is the number of vertices and E is the number of edges, as each vertex and edge is processed at most once across all traversals.^[33]^[34] In practice, BFS-based connectivity analysis finds application in diverse domains, such as verifying network integrity in computer systems or identifying clusters in social graphs. For instance, in puzzle-solving contexts like the Rubik's Cube, BFS from an initial configuration reveals reachable states, highlighting separate connected components that represent distinct solvability classes—such as the 12 parity-based components in the cube's state graph.^[35] This approach underscores BFS's role in providing a complete and efficient characterization of graph structure without requiring additional preprocessing.^[36]

Shortest Path Finding

BFS serves as a fundamental algorithm for computing shortest paths in unweighted graphs, where path length is defined by the number of edges rather than weights. Starting from a source vertex, it systematically explores the graph level by level using a queue data structure, visiting all vertices at distance k before those at distance k+1. This level-order traversal guarantees that the first discovery of any vertex occurs via a path of minimal edge count, as any shorter path would belong to an earlier level and thus be processed first.^[37] The approach was initially described by Edward F. Moore in 1959, who applied it to model and solve maze navigation problems as graph traversals, marking the formal introduction of BFS for shortest path computation in such contexts.^[38] To reconstruct paths, BFS maintains a parent array during traversal: each vertex records the predecessor from which it was first enqueued, forming a shortest path tree rooted at the source upon completion. The time complexity is O(|V| + |E|), making it efficient for sparse graphs common in pathfinding scenarios.^[39] The optimality of BFS in unweighted graphs stems from its FIFO queue mechanism, which prevents revisiting vertices and ensures distances are non-decreasing. Formally, for any vertex v, the distance \delta(s, v) computed by BFS equals the true shortest path length, as proven by induction on the level: base case holds for the source, and assuming it for level k, all paths to level k+1 must pass through level k without shorter alternatives.^[40] In applications, BFS enables shortest path solutions in domains like robotics for obstacle-avoiding navigation on grid maps, where edges represent unit movements, and in game AI for efficient agent routing in discrete environments. It also underpins network diagnostics, such as tracing minimal hop counts in unweighted communication topologies, and puzzle solvers like the 8-queens or sliding tile problems, where states form an implicit unweighted graph.^[41] For directed acyclic graphs (DAGs), BFS adapts seamlessly to topological order, preserving shortest path guarantees without cycles complicating exploration.^[42]

pseudocode
BFS-SHORTEST-PATH(G, s)
1  Initialize distance d[v] = ∞ for all v ∈ V[G], d[s] = 0
2  Initialize parent π[v] = NIL for all v ∈ V[G]
3  Enqueue Q = {s}, mark s as visited
4  While Q not empty
5      u = DEQUEUE(Q)
6      For each neighbor v of u (unvisited)
7          d[v] = d[u] + 1
8          π[v] = u
9          ENQUEUE(Q, v)
10         Mark v as visited
11 Return [shortest path tree](/page/Shortest-path_tree) via π
BFS-SHORTEST-PATH(G, s)
1  Initialize distance d[v] = ∞ for all v ∈ V[G], d[s] = 0
2  Initialize parent π[v] = NIL for all v ∈ V[G]
3  Enqueue Q = {s}, mark s as visited
4  While Q not empty
5      u = DEQUEUE(Q)
6      For each neighbor v of u (unvisited)
7          d[v] = d[u] + 1
8          π[v] = u
9          ENQUEUE(Q, v)
10         Mark v as visited
11 Return [shortest path tree](/page/Shortest-path_tree) via π

This pseudocode illustrates the core procedure, highlighting distance updates and parent tracking essential for path recovery.^[43]

Modern Network Analysis

In modern network analysis, breadth-first search (BFS) serves as a foundational tool for exploring the structure and dynamics of complex networks, including social, biological, and web graphs. By traversing nodes level by level from a starting vertex, BFS enables the computation of shortest paths in unweighted graphs, which is crucial for quantifying network properties such as average path length and diameter. These metrics reveal characteristics like the small-world phenomenon, where networks exhibit short average distances despite high clustering. For instance, in the seminal study on small-world networks, Watts and Strogatz applied BFS to calculate the characteristic path length L—the mean shortest path between all pairs of nodes—demonstrating that real-world networks like neural systems and power grids have L values comparable to random graphs but with greater local clustering. This approach has become standard in network science for assessing connectivity and information propagation efficiency.^[44] In social network analysis, BFS is employed to uncover relational structures and influence pathways, such as identifying mutual connections or verifying the "six degrees of separation" hypothesis in large-scale platforms. Researchers use BFS to generate spanning trees from seed users, allowing measurement of reach within k hops, which informs recommendation systems and community detection. A key application is in computing betweenness centrality approximations, where BFS from multiple sources helps rank nodes by their role in shortest paths, aiding identification of influencers or bottlenecks in information flow. For example, in analyzing Twitter or Facebook graphs, BFS traversals have quantified how rumors or trends spread level by level, with studies showing average path lengths of 4-6 in networks of millions of users.^[45] This level-order exploration contrasts with depth-first methods, providing unbiased sampling of local neighborhoods essential for scalable analysis on massive graphs. Biological network analysis leverages BFS for pathway discovery and module detection in protein-protein interaction (PPI) and metabolic graphs. In PPI networks, BFS identifies densely connected subgraphs representing functional complexes by expanding from seed proteins and pruning low-density branches, as in the ClusterBFS algorithm, which outperforms traditional clustering by incorporating weighted edges for biological relevance. This method has detected complexes in yeast and human interactomes with higher precision than random sampling, revealing modules involved in signaling pathways. Similarly, BFS computes shortest paths to model disease propagation or drug targeting, where the level structure highlights intermediate metabolites or proteins in unweighted models of genetic networks. Quantitative evaluations show BFS achieving up to 20% better recall in complex prediction compared to density-based alternatives on datasets like DIP.^[46] In web network analysis, BFS underpins crawler designs and structural mapping, as demonstrated in early studies of the World Wide Web graph, where multi-source BFS traversals estimated the bow-tie structure—core, in, out, and tendril components—across billions of pages, informing PageRank optimizations and link analysis. These applications underscore BFS's role in handling irregular, scale-free topologies prevalent in modern networks.

References

[1]
[PDF] 6.006 Lecture 13: Breadth-first search (BFS) - MIT OpenCourseWare
Breadth-First-Search Algorithm. BFS (V,Adj,s):. See CLRS for queue-based implementation level = { s: 0 } parent = {s : None } i = 1 frontier = [s]. # previous ...Missing: definition | Show results with:definition
[2]
[PDF] Graph Algorithms - Stanford Computer Science
Feb 25, 2015 · Breadth-first search begins at the start node (n1), then does the one-hops (n2 and n6), then the two hops (n3, n5, and n7) and finally the three ...
[3]
[PDF] Shortest Paths
search. Breadth-first search is often attributed to Edward Moore, who described it in. (as “Algorithm A”) as the first published method to find the shortest.
[4]
The Shortest Path Through a Maze - Edward F. Moore - Google Books
Title, The Shortest Path Through a Maze Volume 3523 of Bell Telephone System. Technical publications. monograph. Author, Edward F. Moore.
[5]
Algorithms - CS Stanford
An unintelligent searching algorithm, breadth-first searching expands a node and checks each of its successors for a goal state before expanding any of the ...
[6]
[PDF] Chapter 14 Breadth-First Search
The breadth-first algorithm is a particular graph-search algorithm that can be applied to solve a variety of problems such as finding all the vertices ...
[7]
[PDF] Lecture 8 8.1 Introduction Background: Breadth First Search
Breadth First Search (BFS) is a way of exploring the part of the graph that is reachable from a particular vertex (s in the algorithm below). Procedure BFS (G( ...
[8]
[PDF] Lecture 11: Breadth-First Search - Stony Brook Computer Science
We want to visit every vertex and every edge exactly once in some well- defined order. Breadth-first search is appropriate if we are interested in shortest ...<|control11|><|separator|>
[9]
[PDF] Basic Graph Algorithms
In fact, the very first implementation of breadth-first search, written around 1945 by Konrad Zuse in his proto-language Plankalkül, was developed for the ...
[10]
[PDF] CS 106X, Lecture 22 Graphs; BFS; DFS
depth-first search (DFS): Finds a path between two vertices by exploring ... DFS uses less memory than BFS, easier to reconstruct the path once found ...
[11]
Lecture 10: Depth-First Search | Introduction to Algorithms
This class builds on the previous lecture of breadth-first search (BFS) by introducing depth-first search (DFS) and full-BFS and full-DFS.Missing: CLRS comparison
[12]
BFS and DFS - UC Irvine
BFS and DFS are graph traversals. BFS uses a queue, while DFS uses a stack, both constructing spanning trees. BFS is like Dijkstra's, but with same edge ...
[13]
[PDF] Depth-First Iterative-Deepening: An Optimal Admissible Tree Search*
The standard breadth-first and depth-first algorithms will be shown to be inferior to the depth-first iterative-deepening algorithm. We will prove that this ...
[14]
None
Below is a consolidated response summarizing the Breadth-First Search (BFS) pseudocode and explanations from Chapter 22 of "Introduction to Algorithms" by Cormen et al., based on the provided segments. Since many segments lacked Chapter 22 content, I have focused on the segments that explicitly provide BFS details (e.g., page 594, page 595, and standard references) and included a table to organize the information comprehensively. For segments without Chapter 22 content, I’ve noted their focus and omitted irrelevant details to keep the response concise yet thorough.
[15]
[PDF] BFS Algorithm Pseudocode
BFS Algorithm Pseudocode procedure BFS(G,s) for each vertex v ∈ V[G] do explored[v] ← false d[v] ← ∞ end for explored[s] ← true d[s] ← 0. Q:= a queue ...
[16]
[PDF] Chapter 22
Sep 17, 2017 · Since breadth first search always considers shorter paths first, this is not possible. 2. Suppose that (u, v) is a tree edge. Then, this ...
[17]
[PDF] Breadth-first search (BFS)
A BFS traversal of a graph returns the nodes of the graph level by level. L ... Example. Breadth-First Search. 21. Queue: B C D. Dequeue B → Queue: C D. A.
[18]
Operations on Binary Search Tree's
In the above example, the level-order traversal is 38, 5, 45, 1, 9, 47, 8, 15, 46, 13. We can implement the level-order traversal using the API's LinkedList ...
[19]
Graph traversals - CS 2112/ENGRD 2112 Fall 2018
Time and space complexity So the asymptotic time complexity of BFS and DFS is linear, i.e. O(|V| + |E|). During execution, the BFS and DFS algorithms maintain ...
[20]
Lecture 13: Breadth-First Search (BFS) | Introduction to Algorithms
This lecture begins with a review of graphs and applications of graph search, discusses graph representations such as adjacency lists, and covers breadth-first ...
[21]
[PDF] Uninformed Search - cs.wisc.edu
Breadth-First Search (BFS). ○ Time and space complexity: O(bd) (i.e., exponential). – d is the depth of the solution. – b is the branching factor at each non ...
[22]
[PDF] 3 SOLVING PROBLEMS BY SEARCHING
– Breadth-first search expands the shallowest nodes first; it is complete, optimal ... Chapters 3 and 4 from Russell/Norvig, Artificial Intelligence, 3e ...
[23]
CS 225 | BFS & DFS
BFS and DFS are two simple but useful graph traversal algorithms. In this article, we will introduce how these two algorithms work and their properties. BFS.
[24]
CS302 Lecture Notes - Breadth First Search - UTK-EECS
Mar 28, 2022 · The order of the nodes is now 0, 1, 3, 2, 4, 5, 6, 7. BFS still visits all nodes and edges, but it does so in order of distance from the ...
[25]
[PDF] Lecture 4 Breadth-First Search
BFS visits vertices in order of increasing distance from s. In fact, our BFS algorithm above labels each vertex with the distance from s, or the number of edges ...
[26]
[PDF] Breadth-First Search
... and Tamassia. Breadth-First Search. 2. Breadth-First Search. ❑ Breadth-first search. (BFS) is a general technique for traversing a graph. ❑ A BFS traversal of a.
[27]
CS140 Final Project: Breadth-First Search (Cilk++)
BFS explores the vertices and edges of a graph, beginning from a specified "starting vertex" that we'll call s. It assigns each vertex a "level" number, which ...
[28]
[PDF] slac-104 uc-32 (misc) bi-directional and heuristic search in path ...
IRA POHL. STANFORD LINEAR ... The extension of the class of uni-directional admissible algorithms A*3o to bidirectional algorithms at first appears simple.
[29]
[PDF] Bidirectional Heuristic Search Reconsidered - arXiv
While the original algorithm BHPA proposed by Pohl (1971) may actually show such ine cient performance, the missile metaphor is wrong and misleading. We ...
[30]
Parallel Breadth-First Search Optimization Strategies
May 2, 2025 · In this paper, we introduce a set of optimization strategies for parallel BFS on multicore systems, including hybrid traversal, bitmap-based visited set, and a ...
[31]
https://arxiv.org/pdf/1104.4518
[32]
[PDF] Graph Search
Given G and node u, find all nodes that are connected to u. 3. Find all connected components of G. Can be accomplished in O(m + n) time using BFS or DFS. BFS ...
[33]
[PDF] Graphs Breadth-first Search and Search Uses Carol Zander
The breadth-first algorithm visits all adjacent nodes (children) before visiting their adjacent nodes, and so on. As with depth-first, when we visit each vertex ...<|control11|><|separator|>
[34]
[PDF] Breadth First Search, Dijkstra's Algorithm for Shortest Paths
Mar 28, 2023 · BFS solves single-source shortest path problems in unweighted graphs (both undirected and directed) in O(n + m) time. BFS is obtained from Basic ...
[35]
Moore, E.F. (1959) The Shortest Path through a Maze ... - Scirp.org.
Moore, E.F. (1959) The Shortest Path through a Maze. Proceedings of the International Symposium on the Theory of Switching, 285-292.
[36]
[PDF] Shortest Paths CSE 373: Data Structures and Algorithms - Washington
- BFS works to find the shortest path summary because BFS traverses the graph level by level outwards from the start -- because we're making sure we look at ...
[37]
[PDF] Readings Review: Graph Representations Review: BFS (Breadth ...
... Breadth First Search ... Proof of Correctness: The correctness of this algorithm follows from the lemma that BFS outputs the shortest paths in an unweighted graph ...
[38]
Shortest path algorithms
Applications: robot's path planning; game AI. ... Algorithms for single source shortest path problem: BFS, Dijkstra's, Bellman-Ford. BFS (for unweighted graphs)
[39]
Graph Traversal: BFS and DFS
BFS can be used to find the single-source shortest-path(s) in unweighted graphs (where each edge has a unit cost), which is also known as “uniform cost” search ...Missing: breadth- | Show results with:breadth-
[40]
[PDF] Chapter 12 Shortest Paths
The reason why BFS works on unweighted graphs is quite interesting and helpful for understanding other shortest path algorithms. The key idea behind BFS is ...
[41]
Chapter 2 - Network Science by Albert-László Barabási
Breadth-First Search (BFS) Algorithm. BFS is a frequently used algorithms in network science. Similar to throwing a pebble in a pond and watching the ripples ...
[42]
https://web.engr.oregonstate.edu/~huanlian/algorithms_course/3-graph/bfsdfs.html
[43]
A Novel Algorithm for Detecting Protein Complexes with the Breadth ...
ClusterBFS derives from the breadth first search method and constitutes protein complexes that originate from a protein seed based on the weighted density. In ...