Branch and bound

Branch and bound (B&B) is an algorithmic framework for solving combinatorial optimization problems, particularly those involving integer or discrete variables, by systematically partitioning the feasible solution space into subproblems (branching) and using relaxation-based upper and lower bounds to prune branches that cannot contain the global optimum, thereby guaranteeing an exact solution while avoiding exhaustive enumeration. Introduced in 1960 by Ailsa H. Land and Alison G. Doig for mixed-integer linear programming problems, the method constructs a search tree where each node represents a subproblem, and fathoming criteria—such as bound comparison or infeasibility—eliminate non-promising paths to focus computational effort on viable candidates. The core components of B&B include branching, which divides a subproblem by imposing constraints (e.g., fixing a variable to integer values above or below its fractional part in a linear programming relaxation); bounding, which computes estimates of the optimal value for each subproblem to assess potential; and pruning, which discards subtrees based on rules like if a lower bound exceeds the current best upper bound or if the subproblem is infeasible.^[1]^[2] Node selection strategies, such as best-first (prioritizing the most promising bound) or depth-first search, further influence efficiency, with the overall process continuing until all leaves are fathomed or the optimality gap closes to zero. While originally designed for linear relaxations, B&B has been extended to nonlinear, global, and spatial variants for nonconvex problems.^[1] B&B is foundational in operations research and computer science, powering solvers for mixed-integer programming (MIP), the traveling salesman problem (TSP), scheduling, and resource allocation, where it efficiently handles NP-hard instances by leveraging problem-specific relaxations and heuristics.^[2] Early extensions, such as Little et al.'s 1963 application to TSP, demonstrated its versatility beyond pure integer programs, while modern advances incorporate machine learning for branching decisions and parallel processing to scale to larger instances.^[3] Despite exponential worst-case complexity, empirical performance often yields practical solutions for real-world applications in logistics, finance, and engineering.^[4]

Fundamentals

Definition and Purpose

Branch and bound is a systematic enumeration algorithm designed for solving optimization problems, particularly those involving integer or discrete variables. It operates by constructing a tree of partial solutions, where each node represents a subproblem, and systematically explores the solution space while discarding branches that are unlikely to produce optimal results based on computed upper or lower bounds. This approach was first formalized as a method for discrete programming problems, extending linear programming techniques to handle constraints where variables must take indivisible values.^[5]^[4] The primary purpose of branch and bound is to address NP-hard combinatorial optimization challenges, such as integer linear programming, by significantly reducing the search space compared to exhaustive brute-force enumeration. Unlike heuristic methods that may yield approximate solutions without optimality guarantees, branch and bound provides a provably optimal solution by ensuring that pruned branches cannot contain better outcomes than the current best feasible solution found. It is especially suited for problems where the objective is to minimize or maximize a function subject to constraints, offering efficiency in scenarios where the number of possible solutions grows exponentially.^[6]^[5] At its core, branch and bound relies on distinguishing between continuous and discrete optimization problems; while continuous problems allow variables to take any real values and can often be solved efficiently with methods like simplex, discrete problems—common in operations research and computer science—require handling finite, often integer-based, feasible sets that lead to combinatorial explosion. The algorithm's mathematical foundation lies in the state-space tree, a conceptual structure where the root node embodies the full problem, internal nodes denote partial assignments of variables, and edges represent decisions that fix variable values, progressively refining the search toward complete, feasible solutions at the leaves. This tree-based partitioning enables targeted exploration, guided by bounds to fathom suboptimal paths early.^[4]^[6]

Historical Development

The branch and bound method originated in the field of operations research during the post-World War II era, when efforts to solve complex optimization problems in military and industrial contexts spurred advancements in mathematical programming.^[7] It was first formalized in 1960 by Ailsa H. Land and Alison G. Doig as an automated approach to discrete programming problems, particularly integer linear programs, where traditional linear programming relaxations often yielded non-integer solutions.^[5] Their seminal paper introduced the core idea of systematically partitioning the feasible region into subproblems (branching) while using bounds to prune infeasible or suboptimal branches, marking a foundational milestone in exact optimization techniques.^[6] In the 1970s, the method expanded through contributions that refined its integration with linear programming relaxations, enhancing its applicability to mixed-integer problems. E. M. L. Beale and J. A. Tomlin developed key extensions, including special ordered sets, which improved the handling of nonconvex and integer constraints within general mathematical programming systems.^[8] Concurrently, advancements in bounding techniques emerged via Lagrangian relaxation, pioneered by A. M. Geoffrion, who demonstrated its use to generate tighter dual bounds for branch and bound trees in integer programming.^[9] These developments, building on the 1960 framework, solidified branch and bound as a robust paradigm for operations research applications. By the 1980s and 1990s, branch and bound became integral to commercial solvers, with early implementations in systems like SCICON and LP/90 enabling practical use for large-scale mixed-integer programs. This evolution culminated in its adoption by leading optimizers such as CPLEX (introduced in 1988) and later Gurobi (2008), which incorporated advanced branch and bound engines with cutting planes and heuristics to handle industrial-scale problems efficiently.^[10] Post-2010 trends have further adapted the method for modern computing, including parallel branch and bound algorithms that distribute tree exploration across multi-core processors and machine learning-assisted bounding to predict strong relaxations and reduce search times. As of 2025, further advancements include the integration of deep reinforcement learning for automated branching decisions in solvers, improving performance on complex instances.^[6]^[11]

Algorithm Mechanics

Core Procedure

The branch and bound algorithm operates by systematically exploring the solution space of an optimization problem through a tree structure, where each node represents a subproblem obtained by adding constraints to the original problem.^[4] The process begins with solving a relaxation of the original problem—typically by ignoring integrality constraints—to obtain an initial upper or lower bound on the optimal value, serving as the root node of the search tree.^[5] If this relaxed solution satisfies all constraints (e.g., all variables are integer-valued), it is the optimal solution, and the algorithm terminates; otherwise, the algorithm proceeds to branch and bound iteratively.^[5] The core steps involve: (1) selecting an unexplored node from the tree and computing a bound for its subproblem using a relaxation or heuristic; (2) branching by partitioning the subproblem into two or more child subproblems, often by fixing a fractional variable to its floor or ceiling value; (3) evaluating bounds for these child nodes; (4) pruning (fathoming) any child whose bound indicates it cannot yield a better solution than the current best feasible solution (incumbent); and (5) repeating the process until all nodes are fathomed or the incumbent is proven optimal.^[4] Branching typically selects one variable at a time, creating a binary tree where each level adds one more constraint, progressing toward complete integer solutions at the leaves.^[5] The tree structure has the root node representing the full relaxed problem, internal nodes denoting partial constraints, and leaves corresponding to complete feasible solutions or infeasible subproblems.^[4] Fathoming occurs under three main criteria: reaching a feasible optimal solution (all variables fixed and constraints satisfied); declaring a subproblem infeasible (no solution exists); or bound domination (the subproblem's bound is worse than the incumbent, eliminating further exploration).^[5] Pruning mechanisms rely on updating the incumbent whenever a better feasible solution is found during traversal, which tightens bounds for subsequent nodes and discards dominated branches.^[4] Traversal can be depth-first (exploring one branch deeply before backtracking, which is memory-efficient), breadth-first (processing levels sequentially, requiring more memory), or best-bound-first (prioritizing the most promising node by bound value to find good solutions quickly).^[4] In practice, depth-first traversal is common for its simplicity, though hybrid approaches may be used.^[4] The algorithm has worst-case exponential time complexity, as the search tree can grow to encompass all possible combinations in the absence of pruning, but effective bounding often reduces the explored space significantly, leading to polynomial-time performance on many structured instances.^[4]

Pseudocode Representation

The branch and bound algorithm is commonly implemented using a queue-based structure to manage open subproblems, with fathoming applied via bound comparisons to prune unpromising branches. This representation assumes a minimization problem where lower bounds are computed for subproblems and an incumbent solution provides an upper bound on the optimal value. Branching typically involves selecting a fractional variable and creating child subproblems by fixing it to floor and ceiling values. The following pseudocode outlines a generic eager variant, where bounds are computed immediately after branching (Clausen 1999)^[12].

pseudocode
Algorithm BranchAndBound(P0)  // P0 is the initial problem
Initialize:
    incumbent := ∞  // Best known solution value (upper bound for minimization)
    Live := {(P0, LB(P0))}  // Queue of open nodes, each with lower bound LB
    // Compute initial lower bound LB(P0) using relaxation or heuristic

While Live ≠ ∅:
    Select and remove node P from Live  // e.g., best-first by lowest LB(P)
    If P is a leaf (feasible solution X found):
        If f(X) < incumbent:  // f is the objective function
            incumbent := f(X)
            bestSolution := X
    Else:
        Branch on P: Select variable (e.g., most fractional) and generate children P1, ..., Pk
        For each child Pi:
            Compute LB(Pi)  // Lower bound, e.g., via LP relaxation
            If LB(Pi) ≥ incumbent:  // Fathoming: prune by bound
                Discard Pi  // Upper bound on optimum (incumbent) dominates
            Else If LB(Pi) is finite and corresponds to feasible integer X with f(X) < incumbent:
                incumbent := f(X)
                bestSolution := X
                // Optionally fathom other nodes if needed
            Else:
                Add (Pi, LB(Pi)) to Live

Return bestSolution, incumbent  // Optimal if tree fully explored
Algorithm BranchAndBound(P0)  // P0 is the initial problem
Initialize:
    incumbent := ∞  // Best known solution value (upper bound for minimization)
    Live := {(P0, LB(P0))}  // Queue of open nodes, each with lower bound LB
    // Compute initial lower bound LB(P0) using relaxation or heuristic

While Live ≠ ∅:
    Select and remove node P from Live  // e.g., best-first by lowest LB(P)
    If P is a leaf (feasible solution X found):
        If f(X) < incumbent:  // f is the objective function
            incumbent := f(X)
            bestSolution := X
    Else:
        Branch on P: Select variable (e.g., most fractional) and generate children P1, ..., Pk
        For each child Pi:
            Compute LB(Pi)  // Lower bound, e.g., via LP relaxation
            If LB(Pi) ≥ incumbent:  // Fathoming: prune by bound
                Discard Pi  // Upper bound on optimum (incumbent) dominates
            Else If LB(Pi) is finite and corresponds to feasible integer X with f(X) < incumbent:
                incumbent := f(X)
                bestSolution := X
                // Optionally fathom other nodes if needed
            Else:
                Add (Pi, LB(Pi)) to Live

Return bestSolution, incumbent  // Optimal if tree fully explored

Key elements include the initialization of the incumbent and live node queue, fathoming when a subproblem's lower bound meets or exceeds the incumbent, and recursive subdivision until leaves yield feasible solutions or pruning occurs (Land and Doig 1960)^[13]. The traversal order, such as depth-first or best-first search, influences efficiency but is not specified here (Clausen 1999)^[12]. Placeholders for bound computation (e.g., LP relaxation) and variable selection allow adaptation to specific problems like integer linear programming.

Enhancements and Variants

Branching Strategies

Branching strategies in branch and bound algorithms guide the construction of the search tree by determining which variable to partition and how to divide its domain, directly influencing the efficiency of the overall search. These strategies aim to create subproblems that tighten bounds quickly, thereby pruning unproductive branches early. Basic approaches prioritize simplicity, while advanced methods leverage lookahead computations or historical data to select variables with the greatest potential impact on the solution process.^[6] A foundational branching method selects a continuous variable with a fractional value from the relaxation solution and splits its domain into two subintervals at the nearest integers below and above that value, typically using binary splits for integer variables (e.g., fixing to floor or ceiling). For variable selection, the most fractional rule identifies the variable whose fractional part is closest to 0.5, as this maximizes the deviation from integrality and is expected to constrain the feasible region most effectively. This heuristic, which emphasizes the variable with the highest immediate impact on feasibility, traces back to the original branch and bound formulation for discrete programming problems.^[13]^[6] Advanced strategies refine this process to better anticipate bound improvements. Strong branching evaluates a set of candidate fractional variables by approximately solving the linear programming relaxations of their child subproblems and selects the one yielding the largest combined increase in bound strength (e.g., via the sum of downward and upward bound gains). Although more computationally intensive—often limited to 8-10 candidates per node—this approach produces superior variable choices that reduce tree size compared to static rules.^[6] Pseudocost branching, in contrast, relies on empirical history: for each variable and direction (up or down), it maintains a pseudocost as the average ratio of observed objective change to the integer gap from prior branchings on that variable. The candidate with the highest predicted bound improvement, computed as the pseudocost times the current fractional distance, is selected; initial values are often set via strong branching until sufficient history accumulates. This method efficiently approximates strong branching's lookahead using low-overhead statistics, making it a staple in modern solvers.^[14] Recent advancements incorporate machine learning techniques to enhance branching decisions. For instance, reinforcement learning models can learn to select variables by imitating strong branching or optimizing long-term tree growth, achieving performance improvements on large-scale mixed-integer programs. These approaches, trained on historical solver data, have been integrated into commercial solvers to reduce node counts significantly as of 2024.^[15] In addition to variable selection, the order of traversing the search tree—via node selection rules—affects how quickly viable solutions are found and unproductive paths pruned. Depth-first traversal employs a last-in-first-out stack to dive deeply into one branch before backtracking, offering low memory use (linear in tree depth) but potentially delaying global bound updates that enable fathoming. Breadth-first traversal uses a first-in-first-out queue to process nodes level by level, yielding earlier bound improvements at the cost of exponential memory growth. Best-first traversal mitigates these trade-offs with a priority queue ordered by node bounds (e.g., lowest lower bound for minimization), systematically favoring promising subproblems to balance depth and breadth while minimizing explored nodes.^[6] These strategies collectively reduce the number of nodes processed in the tree; for instance, combining pseudocost branching with best-first selection can decrease evaluations by factors of 10 or more on mixed-integer problems compared to naive depth-first with most fractional rules, as demonstrated in empirical studies of solver performance.^[14]^[6]

Bounding Techniques

In branch and bound algorithms for minimization problems, lower bounds are computed at each node to estimate the minimum objective value in the corresponding subproblem, enabling the pruning of branches that cannot yield better solutions than the current global upper bound.^[5] The most common technique for obtaining lower bounds is linear programming (LP) relaxation, where integrality constraints are ignored, transforming the integer program into a continuous optimization problem that can be solved efficiently. For an integer linear program of the form \min \{ \mathbf{c}^T \mathbf{x} \mid A\mathbf{x} \leq \mathbf{b}, \mathbf{x} \geq \mathbf{0}, \mathbf{x} \in \mathbb{Z}^n \}, the LP relaxation replaces the integer requirement with \mathbf{x} \geq \mathbf{0} (real-valued), yielding a lower bound equal to \min \{ \mathbf{c}^T \mathbf{x} \mid A\mathbf{x} \leq \mathbf{b}, \mathbf{x} \geq \mathbf{0} \}.^[13] This relaxation provides a valid bound because the feasible region of the LP includes that of the original problem, and the minimum over a larger set is at most as large as over the smaller set. LP relaxations are typically solved using the simplex method, which not only computes the primal bound but also generates dual information for sensitivity analysis and stronger bounds in subsequent nodes. The tightness of the bound is often measured by the integrality gap, defined as \frac{UB - LB}{UB}, where UB is the current upper bound and LB is the lower bound; a small gap indicates that the relaxation closely approximates the true optimum. For problems where LP relaxations are weak—such as those with complicating constraints—Lagrangian relaxation offers an alternative by dualizing selected constraints into the objective function via multipliers, resulting in an unconstrained or easier subproblem whose solution provides a lower bound. In Lagrangian relaxation, for constraints A'\mathbf{x} \leq \mathbf{b}', the relaxed problem becomes \min \{ \mathbf{c}^T \mathbf{x} + \boldsymbol{\lambda}^T (A'\mathbf{x} - \mathbf{b}') \mid A''\mathbf{x} \leq \mathbf{b}'', \mathbf{x} \geq \mathbf{0}, \mathbf{x} \in \mathbb{Z}^n \}, where \boldsymbol{\lambda} \geq \mathbf{0} are multipliers; the best bound is found by maximizing over \boldsymbol{\lambda} (the Lagrangian dual). This approach is particularly effective for structured problems like the traveling salesman problem, where it has historically provided tighter bounds than LP relaxation alone. Upper bounds in minimization branch and bound are obtained by identifying feasible integer solutions, often through heuristics applied at nodes or by rounding fractional solutions from LP relaxations. Rounding techniques, such as rounding up variables in a basic feasible solution to satisfy integrality while preserving feasibility, can quickly generate candidate solutions to update the global upper bound. Heuristics like greedy algorithms or local search may also be invoked to explore the subproblem's feasible region, providing practical upper bounds that facilitate early pruning. Surrogate relaxations combine multiple constraints into a single surrogate constraint, \sum_i \alpha_i (a_i^T \mathbf{x} \leq b_i), where \boldsymbol{\alpha} \geq \mathbf{0} and \sum \alpha_i = 1, to form a simpler subproblem that yields valid bounds for integer programs. The optimal surrogate multipliers are found by solving a dual problem to maximize the bound, which can outperform standard relaxations in cases with many constraints.^[16] A key pruning rule in branch and bound relies on these bounds: a node is fathomed (pruned) if its lower bound is greater than or equal to the current global upper bound, as no solution in that subtree can improve the incumbent.^[13] This mechanism, combined with bound tightening through relaxations, significantly reduces the search space explored.

Practical Applications

Combinatorial Optimization

Branch and bound is widely applied to combinatorial optimization problems, particularly NP-hard search problems such as the traveling salesman problem (TSP) and graph coloring, where it systematically explores the solution space while pruning suboptimal branches to guarantee exact optimal solutions. In these contexts, the algorithm constructs a search tree representing partial solutions, using tight lower bounds to fathom branches that cannot yield better outcomes than the current best solution. This approach excels in providing provable optimality for instances where exhaustive enumeration would be infeasible, though its effectiveness depends on the quality of bounding mechanisms to manage the inherent exponential complexity of the search space. For the traveling salesman problem (TSP), branch and bound builds a search tree where nodes correspond to partial tours, branching by extending the tour with feasible next cities while ensuring no cycles or repetitions. Lower bounds are computed using the Held-Karp relaxation, which involves finding the minimum cost 1-tree—a spanning tree with an added edge to the root—solved via dynamic programming or linear programming, providing a tight underestimate of the optimal tour cost derived from the minimum spanning tree augmented for degree constraints. This bounding technique, originally developed for TSP lower bounds, enables effective pruning, as demonstrated in early implementations that solved instances with up to 40 cities efficiently. In graph coloring, the branch and bound tree represents partial color assignments to vertices, branching on possible colors for uncolored vertices while maintaining adjacency constraints. Bounds are derived from independent set relaxations, where the linear programming relaxation of the independent set polytope in the uncolored subgraph provides a lower bound on the number of additional colors needed, or from clique covers in the complement graph to upper-bound the chromatic number. These relaxations leverage the fact that a valid coloring partitions the graph into independent sets, allowing the algorithm to fathom branches where the partial coloring plus relaxation exceeds the current best. Seminal branch-and-price variants, which extend branch and bound by dynamically generating independent set columns, have been particularly effective for dense graphs. The primary advantage of branch and bound in these combinatorial settings is its ability to guarantee optimality for small-to-medium instances; for example, basic implementations can exactly solve TSP instances with up to 100 cities in seconds on modern hardware, while advanced solvers handle larger cases through integrated enhancements. However, the algorithm faces limitations due to exponential growth in the search tree for large instances, as the number of partial solutions scales factorially with problem size in TSP or exponentially with vertices in coloring. This is often mitigated by exploiting problem symmetries, such as rotational invariance in TSP tours, via preprocessing or branching restrictions to reduce redundant subtrees without altering the optimal value.^[17]^[18]

Integer Linear Programming

Integer linear programming (ILP) problems are formulated as minimizing c^T x subject to linear constraints A x \leq b, non-negativity x \geq 0, and integrality requirements on some or all variables x \in \mathbb{Z}^n. Branch and bound addresses these by relaxing the integrality constraints to obtain a linear program (LP) at each node of a search tree, providing upper bounds on the optimal value for minimization problems.^[19] The process begins at the root node by solving the LP relaxation using methods like the simplex algorithm. If the solution is integer-feasible, it serves as a candidate for the global optimum. Otherwise, a fractional variable is selected for branching, creating two child nodes with tightened bounds (e.g., x_i \leq \lfloor \hat{x}_i \rfloor and x_i \geq \lceil \hat{x}_i \rceil, where \hat{x}_i is the fractional value). LP relaxations are then resolved at these children, with fathoming occurring if a node's bound is worse than the current best integer solution or proves infeasible. Modern implementations leverage simplex basis information from parent nodes to warm-start child solves, enhancing efficiency.^[20]^[21] Commercial solvers such as IBM's CPLEX, developed since 1987, and Gurobi, founded in 2008, integrate advanced branch and bound frameworks with presolving techniques to reduce problem size before tree exploration. These tools routinely solve ILPs with thousands of variables by employing sophisticated node selection, variable branching heuristics, and primal-dual methods for bounding. Parallel implementations exploiting multi-core processors have become standard since the 2010s, distributing tree exploration across threads to accelerate solution times for large-scale instances.^[22]^[23]^[24]

Connections to Other Techniques

Relation to Dynamic Programming

Branch and bound and dynamic programming share fundamental similarities as recursive, divide-and-conquer approaches to solving combinatorial optimization problems, where solutions are constructed by breaking down the problem into overlapping subproblems and systematically exploring the solution space. Both techniques leverage dominance relations or bounding mechanisms to eliminate suboptimal paths, akin to memoization in dynamic programming, which avoids redundant computations by storing intermediate results. This recursive structure allows them to address problems with optimal substructure, such as the traveling salesman or knapsack problems, by evaluating partial solutions to guide the search. A key difference lies in their handling of subproblems: dynamic programming exhaustively tabulates solutions for all possible states, making it particularly effective for problems with overlapping subproblems and manageable state spaces, such as the exact 0-1 knapsack problem, where it achieves pseudo-polynomial time complexity O(nW) with n items and capacity W.^[25] In contrast, branch and bound performs dynamic pruning based on upper or lower bounds without fully enumerating the state space, which is advantageous for exponential-scale problems but can lead to higher worst-case complexity, such as O(n2^n) for knapsack without tight bounds.^[25] While dynamic programming guarantees optimality through complete computation, branch and bound relies on the quality of bounds to fathom branches early, potentially exploring fewer nodes but risking incomplete coverage if bounds are loose.^[26] Branch and bound is preferable for sparse search trees or problems where tight bounds can significantly reduce exploration, such as integer linear programs with few feasible solutions, whereas dynamic programming excels in dense state spaces amenable to tabulation, like sequence alignment or shortest paths in DAGs.^[26] An important overlap occurs when branch and bound incorporates dynamic programming techniques to compute bounds at individual nodes, enhancing efficiency by combining the exhaustive power of dynamic programming with selective pruning, as demonstrated in hybrid approaches for the nonlinear knapsack or traveling salesman problems. This integration allows branch and bound to reduce storage and computation requirements in discrete dynamic programming settings by fathoming suboptimal states early.

Integration with Cutting Plane Methods

The branch-and-cut algorithm integrates cutting plane methods into the branch-and-bound framework by dynamically generating and adding valid inequalities, known as cuts, to the linear programming (LP) relaxations at various nodes of the search tree, thereby tightening the bounds and potentially eliminating suboptimal branches before further subdivision occurs.^[27] This approach enhances the efficiency of solving mixed-integer programming (MIP) problems by reducing the relaxation gap without excluding any feasible integer solutions.^[28] Key cutting plane methods employed in branch-and-cut include Gomory fractional cuts, introduced in the late 1950s and early 1960s, which derive valid inequalities from the fractional parts of the optimal LP solution to eliminate non-integer points. Another prominent class is mixed-integer rounding (MIR) cuts, developed in the 1980s, which aggregate constraints and apply rounding procedures to generate stronger inequalities for mixed-integer sets, often providing a complete description for certain 0-1 polyhedra. Separation routines are crucial for identifying violated cuts; these involve heuristic or exact procedures, such as solving auxiliary optimization problems (e.g., shortest path algorithms for certain inequalities), to detect and add cuts that cut off the current fractional solution.^[28] In the overall framework, at each node, the LP relaxation is solved; if the optimality gap remains large, violated cuts are separated and added to the formulation to strengthen the bound, with branching applied only when the gap is sufficiently small or no further cuts can be found.^[29] This integration is standard in modern MIP solvers like CPLEX and SCIP, where cutting planes contribute to substantial reductions in the number of nodes explored, often achieving 20-50% fewer nodes compared to pure branch-and-bound on benchmark instances.^[30]^[31] Recent advancements incorporate machine learning techniques for cut selection and separation, such as neural networks trained to predict effective cuts or configure separators, leading to improved bound tightening in the 2020s.^[32] For instance, learning-based methods restrict the selection space and guide separation routines, enhancing performance on large-scale MIPs beyond traditional heuristics.

Illustrative Example

Knapsack Problem Demonstration

The 0-1 knapsack problem seeks to maximize the total value of selected items subject to a weight capacity constraint, where each item can be included at most once. Formally, it is given by

\max \sum_{i=1}^n v_i x_i

subject to

\sum_{i=1}^n w_i x_i \leq W, \quad x_i \in \{0, 1\} \quad \forall i = 1, \dots, n,

where v_i and w_i are the value and weight of item i, and W is the knapsack capacity.^[33] To illustrate the branch and bound algorithm, consider a small instance with n=3 items, values v = (40, 25, 25), weights w = (20, 15, 15), and capacity W = 30.^[34] At the root node, representing no decisions made, the upper bound is computed via the fractional knapsack relaxation: sort remaining items by decreasing v_i / w_i (item 1: 2.0, items 2 and 3: 1.67 each), include item 1 fully (v=40, weight used=20, remaining capacity=10), then include a fraction $10/15 of item 2 (v \approx 16.67), yielding an upper bound of approximately 56.67.^[35] Branching begins on item 1, creating two child nodes.

Include item 1 (x_1=1): Current value=40, weight used=20, remaining capacity=10, remaining items 2 and 3. Upper bound: 40 + fractional knapsack on items 2 and 3 with capacity 10 (fraction $10/15 of item 2, v \approx 16.67), total ≈56.67 (> current incumbent 0, explore). Branch on item 2.
Exclude item 1 (x_1=0): Current value=0, weight used=0, remaining capacity=30, remaining items 2 and 3. Upper bound: fractional knapsack on items 2 and 3 with 30 (full item 2: 25/15, full item 3: 25/15, total 50), total=50 (> incumbent 40, explore). Branch on item 2.

The algorithm explores 6 nodes (root + 5 decision nodes before pruning/infeasibility/fathoming), identifying the optimal solution of value 50 using items 2 and 3, compared to the full enumeration of $2^3 = 8 leaves without pruning. This demonstrates how tight upper bounds enable fathoming by infeasibility (2 nodes), optimality (1 subtree), and bound (1 node), substantially reducing the search space.^[34]^[35]