Fact-checked by Grok 2 weeks ago

Subgame perfect equilibrium

In , a subgame perfect equilibrium (SPE), also known as subgame perfect Nash equilibrium (SPNE), is a refinement of the concept applied to extensive-form games, requiring that players' strategies constitute a not only in the overall game but in every arising from any decision node. This ensures sequential rationality, eliminating strategies supported by non-credible threats or promises that would not be optimal if the game reached those points. Introduced by economist in 1965 as part of his analysis of dynamic models, the concept was formally defined and expanded in his 1975 work on perfectness in extensive games. Selten's innovation addressed limitations in Nash equilibria for sequential games, where players might anticipate future moves but could sustain implausible off-equilibrium behavior; SPE resolves this by applying , starting from terminal nodes and working reversely to derive optimal strategies at each information set. In finite extensive games with perfect recall, at least one SPE always exists, and it can be computed by iteratively solving subgames. The significance of SPE lies in its foundational role in , earning Selten a share of the 1994 Nobel Prize in Economic Sciences alongside and for advancing equilibrium analysis. It has profound applications in economics, including credibility in —where central banks' threats of high interest rates must be believable—and dynamics, where firms' entry deterrence strategies are scrutinized for subgame consistency. Beyond economics, SPE informs bargaining models like the , , and even analyses of sequential decision-making under uncertainty.

Fundamentals

Definition

In game theory, a subgame perfect equilibrium (SPE) is a refinement of the tailored to extensive-form games, ensuring that players' strategies remain optimal not only in the overall game but also in every possible subgame that may arise during play. The concept was first introduced by in 1965 in the context of dynamic oligopoly models with , and formally developed for general finite extensive-form games in 1975 to eliminate such non-credible strategies. This concept addresses limitations in equilibria, where off-equilibrium-path behaviors might involve non-credible threats or promises, by imposing sequential rationality throughout the game tree. Formally, consider an represented by a game tree \Gamma with I, strategy sets S_i for each i \in I (encompassing both pure strategies, which specify actions at each decision , and mixed strategies, which assign probabilities to actions), payoff functions u_i: S \to \mathbb{R} for each i, and sets that group indistinguishable to in cases of . A profile \sigma^* = (\sigma_i^*, \sigma_{-i}^*) is a subgame perfect equilibrium of \Gamma if, for every \Gamma' of \Gamma starting at some or h, the restriction \sigma^*|_{\Gamma'} constitutes a Nash equilibrium of the subgame \Gamma'. That is, \forall i \in I, \ \forall \sigma_i \in S_i(\Gamma'), \quad u_i(\sigma^*|_{\Gamma'}) \geq u_i(\sigma_i, \sigma_{-i}^*|_{\Gamma'}) where S_i(\Gamma') denotes player i's feasible strategies in \Gamma', and payoffs are evaluated within that subgame. The core requirement of SPE is sequential rationality, meaning that at every decision node (or information set) in any subgame, each player's strategy maximizes their expected payoff given the strategies of others, conditional on reaching that point. This ensures that strategies prescribe credible actions everywhere in the game tree, preventing equilibria supported solely by empty threats, and applies to both perfect and imperfect information settings as long as subgames are well-defined.

Subgames and Extensive Form Games

Extensive form games provide a detailed representation of sequential strategic interactions, capturing the order of moves and information available to players. Introduced by and , these games are modeled using a , where nodes represent decision points or terminal outcomes, and branches denote possible actions taken by players or chance events. Each nonterminal node is assigned to a specific player who chooses among the available actions leading to child nodes, while terminal nodes specify payoff vectors for all players based on the path taken to reach them. To handle imperfect information, information sets group nodes where a player cannot distinguish between them, ensuring that strategies are consistent across indistinguishable situations. A key structural element in extensive form games is the , which allows for the of the overall into smaller, self-contained components for . Formally, a is defined as follows: consider a nonterminal h in the such that the information set containing the corresponding to h is a \{h\}; the rooted at this consists of all successor nodes of h, along with the associated actions, information sets, and payoffs, provided that every information set intersecting this subtree is either entirely contained within it or entirely outside. This ensures that the replicates the original 's structure from that point onward, independent of prior . Subgames rooted at the initial of the full are trivial in the sense that they encompass the entire ; proper subgames, by contrast, begin at some intermediate information set and exclude the preceding , enabling recursive nested of strategic incentives. Trivial subgames, often consisting of single terminal nodes, offer no further decisions and thus trivial payoffs. Games of form a special class of extensive form games where every set is a , meaning that at each decision , the knows exactly the full history of all previous actions. This structure eliminates uncertainty about prior moves and facilitates straightforward sequential reasoning, as formalized by Kuhn in his analysis of such games. perfect equilibrium refines the standard by imposing consistency requirements specifically within these subgame structures.

Solution Concepts

Backward Induction

Backward induction is the foundational technique for computing subgame perfect equilibria (SPE) in finite extensive-form games with , where players move sequentially and observe all prior actions. Introduced as part of the SPE concept, it systematically eliminates non-credible strategies by reasoning from the game's end toward the beginning, ensuring that strategies are optimal in every . The process begins at the terminal nodes of the game tree, where payoffs are fixed and known. At each such node, the payoffs determine the outcomes. Moving backward, for every decision node reachable after the terminal payoffs are assigned, the player to move selects the action that maximizes their payoff, given the already determined optimal continuations from subsequent nodes. Suboptimal actions—those yielding lower payoffs—are pruned from consideration, effectively simplifying the tree by removing dominated branches. This iterative pruning continues up the tree until the initial node, yielding a strategy profile where no player has an to deviate at any point. Formally, at any non-terminal history h where player i moves, the optimal action a^* is chosen as a^* = \arg\max_a u_i(\sigma_{-i} \mid h, a), where u_i is player i's payoff and \sigma_{-i} \mid h denotes the continuation strategies of the other players following history h and action a. This choice incorporates the backward-inducted optima from all downstream subgames. This method applies specifically to finite-horizon games with , where the game tree is acyclic and has a finite depth, allowing complete traversal from the end. It does not extend directly to infinite-horizon games, as there are no nodes to initiate the , nor to games with imperfect information without modifications, such as incorporating beliefs over hidden actions. The outcome of in these games is a pure-strategy SPE, as each decision prescribes a unique best response under the assumption of strict preferences over payoffs. However, if ties exist at any —where multiple actions yield the same maximum payoff—multiple pure-strategy SPE may arise, corresponding to different resolutions of the indifference.

Computing Subgame Perfect Equilibria

The computation of perfect equilibria (SPE) in extensive-form relies on verifying sequential —ensuring that no player has a profitable deviation in any —across the game's structure. A general approach uses dynamic programming to recursively evaluate , starting from terminal nodes and propagating optimal strategies backward through the game tree, which enforces the condition in each . For specific classes of , such as two-player constant-sum extensive-form , linear programming formulations in the sequence form allow efficient solving of SPE by optimizing realization probabilities subject to best-response constraints in . In extensive-form games with imperfect information, where players cannot distinguish certain histories, computing SPE necessitates incorporating consistent beliefs over information sets to maintain sequential rationality. This often involves refinements like sequential equilibrium, which pairs behavioral strategies with belief systems that are derived via Bayes' rule where possible and ensure no player regrets past actions given updated beliefs. Software tools such as can model imperfect-information games and compute equilibria using sequence-form methods (for two-player constant-sum cases), which coincide with SPE in perfect-information settings. For imperfect information, refinements like sequential equilibrium are used to ensure subgame perfection. Graphical tools like the aid in visualizing and solving game trees for equilibria. The of finding SPE is PPAD-complete for approximating equilibria in general n-player extensive-form games of perfect recall, reflecting the challenge of locating fixed points in spaces with subgame constraints. However, for games with , SPE can be computed in polynomial time using as a special case. One approach is to find a \sigma that is sequentially rational in every , meaning that for every j and subgame, \sigma_j is a best response to \sigma_{-j} in that subgame; this is often operationalized through best-response dynamics, iteratively updating strategies until convergence in subgames.

Examples

Simple Ultimatum Game

The is a two-player, sequential bargaining game in which one player, the proposer, offers to split a fixed amount of —typically $1—with the other player, the responder, who can either accept the offer or reject it. If the responder accepts, the division occurs as proposed, with the proposer receiving the remainder; if rejected, both players receive nothing. This setup highlights strategic decision-making under , where the responder moves last after observing the offer. In standard analysis without subgame perfection, multiple outcomes qualify as equilibria: the proposer can offer any share x where $0 < x \leq 1, and the responder accepts any positive offer while being indifferent to zero, as long as the strategy profile is mutually best responding overall. These equilibria include "fair" splits like 50-50, since a responder who rejects low offers could deter them, but such rejections are not sequentially rational in every . However, Nash equilibria fail to eliminate non-credible threats by the responder to reject small offers, potentially sustaining higher shares for the responder. Subgame perfect equilibrium refines this by applying backward induction, starting from the responder's subgames. In each subgame following an offer x > 0, the responder rationally accepts, as x > 0 strictly dominates rejection (yielding 0). For x = 0, the responder is indifferent and can accept or reject in equilibrium. Anticipating acceptance of any positive offer, the proposer optimally offers the smallest feasible positive amount, \epsilon (approaching zero, such as one in discrete versions), maximizing their payoff at nearly $1 while giving the responder nearly nothing. The unique subgame perfect equilibrium thus consists of the proposer offering \epsilon and the responder accepting all offers x \geq 0. This equilibrium reveals the responder's rational indifference to minimal offers, eliminating fairness-based rejections as non-credible strategies that unravel under backward induction. By requiring optimality in every subgame, subgame perfect equilibrium underscores how sequential rationality prevents empty threats, leading to a lopsided outcome that contrasts with experimental behavior where responders often reject low offers.

Entry Deterrence Game

The entry deterrence game, commonly referred to as the chain-store paradox, serves as a canonical example of how subgame perfect equilibrium addresses credibility problems in sequential strategic interactions. Developed by in 1978, the game models an monopolist (the chain store) confronting a of potential entrants across a finite number of independent s, typically denoted as k markets. Each entrant decides whether to stay out or enter their respective market, with subsequent entrants observing all prior outcomes. If an entrant stays out, the incumbent secures a monopoly . Upon entry, the incumbent chooses between accommodating the entrant (sharing the market) or fighting (engaging in a costly ). The structure captures the tension between short-term gains and long-term deterrence strategies. The payoffs in the stage game are designed to reflect realistic economic incentives, where the incumbent strictly prefers no entry to accommodation, and accommodation to fighting. A standard representation, consistent with Selten's formulation, is as follows:
OutcomeEntrant PayoffIncumbent Payoff
Stay out09
Enter and accommodate52
Enter and fight-20
These values ensure that entrants prefer accommodation over staying out but avoid fighting, while the incumbent values monopoly rents highest but suffers from aggression. Intuitively, or under naive "folk theorem" reasoning, the incumbent could credibly deter all entries by fighting the first one, establishing a reputation for toughness that leads subsequent entrants to stay out, thereby securing profits across all markets. However, , computed via , reveals this strategy as non-credible. In the final market, with no future periods to influence, the incumbent rationally accommodates any entry, as fighting yields a lower payoff (0 versus 2). Anticipating this, the last entrant enters. This logic unravels inductively: each prior entrant expects , leading to entry in every market and by the incumbent throughout. This unique SPE outcome eliminates empty threats, compelling myopic play in each and resolving the by prioritizing sequential over global reputation-building in finite horizons.

Applications and Extensions

In Repeated Games

In finitely repeated games with perfect information, subgame perfect equilibria (SPE) unravel through backward induction, leading players to replicate the stage game's Nash equilibrium in every period. This occurs because, in the final period, players have no future incentives and thus play the stage Nash equilibrium; anticipating this, the penultimate period similarly reduces to the stage game, and the process iterates backward until the first period. However, with imperfect information—such as private or noisy monitoring of actions—cooperation can be sustained as SPE outcomes for games of sufficiently long but finite length, approximating the richer set of equilibria seen in infinite horizons. In infinitely repeated games, where players maximize the discounted sum of stage payoffs with discount factor \delta \in (0,1), SPE permit a broader class of sustainable behaviors compared to finite repetition. The folk theorem establishes that any feasible strictly above the players' individually rational levels (typically the payoffs) can be supported as an SPE payoff profile, provided \delta is sufficiently high. This result relies on the ability to construct strategies with credible punishments that deter deviations, enabling outcomes like mutual that are inefficient in the one-shot stage game. Trigger strategies exemplify how SPE enforce such cooperation in infinitely repeated settings. In the , the strategy—cooperate as long as no deviation has occurred, then defect forever thereafter—forms an SPE when paired with itself, as the threat of permanent is credible and perfect. For this to hold, the discount factor must ensure the one-period gain does not outweigh the discounted future losses from , yielding the condition \frac{\delta}{1-\delta} \geq \frac{t - c}{c - p}, where t is the payoff (deviate against cooperation), c the mutual cooperation payoff, and p the payoff under mutual . In the standard with payoffs of 3 for mutual cooperation, 5 for , 1 for mutual , and 0 for being exploited, this simplifies to \delta \geq \frac{1}{2}. Thus, sufficiently patient players (high \delta) can sustain efficient SPE outcomes via these mechanisms, markedly contrasting the defective of the stage game.

Relation to Other Equilibria

Subgame perfect equilibrium (SPE) represents a refinement of the tailored to extensive-form games, requiring that the strategy profile constitutes a equilibrium not only for the entire game but also for every subgame. This condition eliminates Nash equilibria that depend on non-credible off-equilibrium-path behaviors, such as empty threats that players would not rationally carry out if reached, thereby ensuring sequential rationality throughout the game tree. In contrast, standard Nash equilibria in extensive-form games can include spurious outcomes where players commit to actions in unreached subgames that are suboptimal if those subgames were to occur, a problem SPE resolves by imposing across all possible . While SPE assumes perfect information where players observe all prior actions, the perfect Bayesian equilibrium (PBE) extends this refinement to games with incomplete by incorporating players' beliefs over unobserved histories or types and requiring Bayes-consistent updating and sequential rationality conditional on those beliefs. In perfect-information settings, every SPE is a PBE, but PBE is necessary for imperfect-information games because subgames may not be well-defined when information sets span multiple nodes, preventing direct application of subgame perfection. This extension, formalized by Kreps and Wilson, allows analysis of signaling and screening scenarios where hidden affects strategic choices. Trembling-hand perfect equilibrium provides another refinement of , demanding that strategies remain optimal even under small perturbations representing accidental mistakes by players, thus ensuring robustness to slight errors in execution. Although SPE enforces perfection to rule out non-credible threats, it does not inherently guarantee trembling-hand perfection, as some SPE may unravel under such perturbations if off-path behaviors lack stability against minor deviations. Selten's 1975 concept of trembling-hand thus goes beyond SPE by addressing potential fragility in dynamic games, particularly in normal-form representations of extensive games. SPE exhibits key limitations in its scope and selectivity. It is primarily suited to perfect-information environments and does not fully accommodate incomplete information, necessitating extensions like for broader applicability in . Moreover, even in finite perfect-information games, multiple SPE can exist, leading to a lack of uniqueness and requiring additional criteria, such as payoff dominance or risk dominance, to select among them.

References

  1. [1]
    The Prize in Economics 1994 - Press release - NobelPrize.org
    Selten's subgame perfection has direct significance in discussions of credibility in economic policy, the analysis of oligopoly, the economics of information, ...
  2. [2]
    [PDF] Reexamination of the Perfectness Concept for Equilibrium Points in ...
    Aug 23, 1974 · The new definition of perfectness has the property that a perfect equilibrium point is always subgame perfect but a subgame perfect equi-.
  3. [3]
    [PDF] Chapter 11 Subgame-Perfect Nash Equilibrium
    Definition 11.1 A Nash equilibrium is said to be subgame perfect if an only if it is a. Nash equilibrium in every subgame of the game. A subgame must be a well- ...
  4. [4]
    Reexamination of the perfectness concept for equilibrium points in ...
    Selten, R. Reexamination of the perfectness concept for equilibrium points in extensive games. Int J Game Theory 4, 25–55 (1975).
  5. [5]
    Subgame Perfection - Game Theory
    A subgame perfect Nash equilibrium is a Nash equilibrium in which the strategy profiles specify Nash equilibria for every subgame of the game. Note. This ...
  6. [6]
    [PDF] A Course in Game Theory - Mathematics Department
    ... Osborne and Ariel Rubinstein (ISBN 0-262-65040-1). Copyright c 1994 ... Subgame Perfect Equilibrium 97. 6.3 Two Extensions of the Definition of a Game ...
  7. [7]
    [PDF] Reinhard Selten - Prize Lecture
    A subgame perfect equilibrium set is a set of subgame perfect equilibria all of which yield the same payoffs, not only in the game as a whole, but also in each ...
  8. [8]
    gambit-lp: Compute equilibria in a two-player constant-sum game ...
    This switch instructs the program to find only equilibria which are subgame perfect. (This has no effect for strategic games, since there are no proper subgames ...
  9. [9]
    Sequential Equilibria - jstor
    We propose a new criterion for equilibria of extensive games, in the spirit of Selten's perfectness criteria. This criterion requires that players' ...
  10. [10]
    Gambit: User documentation — Gambit 16.4.0 documentation
    ### Summary: How Gambit Computes Subgame Perfect Equilibria in Extensive Form Games
  11. [11]
    The complexity of computing a (quasi-)perfect equilibrium for an n ...
    We study the complexity of computing or approximating refinements of Nash equilibrium for finite n-player extensive form games of perfect recall (EFGPR).
  12. [12]
    [PDF] Subgame-Perfect Equilibrium in Repeated Games Played by ...
    The one typically used in dynamic games of perfect information is subgame-perfect equilibrium, suggested by Selten [14]. A strategy profile is a subgame-.
  13. [13]
    An experimental analysis of ultimatum bargaining - ScienceDirect.com
    The special property of ultimatum bargaining games is that on every stage of the bargaining process only one player has to decide.
  14. [14]
    [PDF] 6 Extensive Games with Perfect Information: Illustrations
    Further, a variant of the ultimatum game in which the players are equity conscious has subgame perfect equilibria in which offers are significant (as you will ...<|control11|><|separator|>
  15. [15]
    [PDF] An Experimental Analysis of Ultimatum Bargaining
    According to the procedure used by Güth and Schwarze (1983) the position is sold to the highest bidder at the price of the second highest bid. Then the. Winners ...
  16. [16]
    [PDF] Advanced Microeconomics II Handout on Subgame Perfect ...
    Therefore, according to the subgame perfect equilibrium prediction in the ultimatum bargaining game, the proposer should make a tiny offer (one cent or, if ...
  17. [17]
    The chain store paradox - Theory and Decision
    ### Chain Store Game Setup Summary
  18. [18]
    [PDF] The chain-store paradox revisited - CORE
    Selten (1978) considered a fictitious chain-store confronted with potential entrances of local competitors. In his formal analysis via an extensive game with.Missing: citation | Show results with:citation
  19. [19]
    [PDF] Chapter 12 Repeated Games - MIT OpenCourseWare
    he Folk Theorem states that any strictly individually rational and feasible payoff vector can be supported in subgame perfect Nash equilibrium when the players ...
  20. [20]
    Finitely Repeated Games - jstor
    These examples are disturbing because intuition and anecdotal evidence suggest that some degree of cooperation is plausible, even likely, in finitely repeated ...<|control11|><|separator|>
  21. [21]
    [PDF] The Folk Theorem in Repeated Games with Discounting or with ...
    Jul 24, 2007 · The Folk Theorem asserts that any individually rational outcome can arise as a. Nash equilibrium in infinitely repeated games with sufficiently ...Missing: seminal | Show results with:seminal
  22. [22]
    Non-cooperative Equilibrium for Supergames12 - Oxford Academic
    James W. Friedman; A Non-cooperative Equilibrium for Supergames12, The Review of Economic Studies, Volume 38, Issue 1, 1 January 1971, Pages 1–12, https://
  23. [23]
    [PDF] imperfect information (Perfect Bayesian Equilibrium) - UGA SPIA
    Imperfect information means a player may not know all previous actions, and is uncertain about at least part of the history of play.
  24. [24]
    [PDF] Game Theory, Lecture 2: Equilibrium Refinements
    A strategy profile in an extensive-form game is a subgame perfect equilibrium (SPE) if its restriction to each subgame is a Nash equilibrium. Backward induction ...<|control11|><|separator|>