Sion's_minimax_theorem

Sion's minimax theorem is a cornerstone result in mathematical optimization and game theory, stating that if X is a nonempty compact convex subset of a linear topological space and Y is a nonempty convex subset of a linear topological space, and if f: X \times Y \to \mathbb{R} is a real-valued function such that f(x, \cdot) is upper semicontinuous and quasi-concave on Y for every fixed x \in X while f(\cdot, y) is lower semicontinuous and quasi-convex on X for every fixed y \in Y, then

\min_{x \in X} \sup_{y \in Y} f(x, y) = \sup_{y \in Y} \min_{x \in X} f(x, y).

^[1] Proved by Maurice Sion in 1958, the theorem generalizes John von Neumann's earlier minimax theorem from 1928, which applies specifically to continuous bilinear functions (hence concave-convex) over compact convex sets in finite-dimensional spaces.^[1] Sion's version relaxes these restrictions by requiring compactness only on one set, allowing quasi-concavity/convexity instead of strict concavity/convexity, incorporating semicontinuity for topological control, and extending to infinite-dimensional linear topological spaces such as Banach spaces.^[1] This broader applicability makes it a powerful tool for handling noncooperative games and variational problems where full compactness or linearity assumptions fail. The theorem's proof relies on fixed-point techniques, including the Knaster–Kuratowski–Mazurkiewicz (KKM) lemma, to establish the equality of the minimax and maximin values, ensuring the existence of saddle points under the given conditions.^[2] It has profound implications across disciplines: in game theory, it guarantees equilibrium values in zero-sum games with quasi-concave payoff functions, extending beyond finite strategies;^[1] in optimization, it underpins strong duality for convex-concave problems, facilitating Lagrangian relaxation and saddle-point methods for constrained programs;^[3] and in machine learning and signal processing, it supports robust algorithms like Fisher discriminant analysis and beamforming by enabling minimax formulations for adversarial settings.^[4] Subsequent extensions have further adapted it to metric spaces and non-Euclidean geometries, broadening its use in modern computational challenges.^[5]

Historical context

Von Neumann's minimax theorem

John von Neumann established the foundational minimax theorem in 1928, addressing the strategic equilibrium in two-player zero-sum games with finite action sets.^[6] In this setting, each player selects a mixed strategy—a probability distribution over their finite pure strategies—from compact convex sets X and Y, typically simplices representing the strategy spaces.^[6] The payoff is given by a bilinear function f(x, y) = x^T A y, where A is the payoff matrix with entries denoting the row player's gain (and thus the column player's loss) for each pair of pure strategies.^[6] Von Neumann's theorem asserts that \max_{x \in X} \min_{y \in Y} f(x, y) = \min_{y \in Y} \max_{x \in X} f(x, y), guaranteeing the existence of a game value that both players can secure regardless of the opponent's play.^[6] This result formalizes mixed strategies as a tool for rational play in games where pure strategies may lead to exploitable outcomes.^[7] In a zero-sum game, the row player (maximizer) aims to choose x to maximize the minimum payoff against the column player's (minimizer) best response, while the column player seeks to minimize the maximum payoff.^[7] The equality ensures that optimal mixed strategies exist, yielding the game's value as this common quantity, which represents a fair division of outcomes under perfect information and rationality.^[7] Von Neumann's 1928 paper, "Zur Theorie der Gesellschaftsspiele," published in German in Mathematische Annalen, introduced these concepts but received limited initial attention outside mathematical circles.^[6] Its broader impact emerged with the 1944 English publication of Theory of Games and Economic Behavior, co-authored with economist Oskar Morgenstern, which translated and expanded the work, popularizing game theory across economics, mathematics, and social sciences.^[7] A classic illustration is the rock-paper-scissors game, a symmetric zero-sum game with no pure strategy equilibrium.^[8] The payoff matrix A for the row player (positive for win, negative for loss, zero for tie) is:

	Rock	Paper	Scissors
Rock	0	-1	1
Paper	1	0	-1
Scissors	-1	1	0

Any pure strategy choice allows the opponent to counter profitably, but the unique mixed strategy equilibrium has each player randomizing uniformly (probability $1/3 for each action), yielding a game value of 0.^[8] This demonstrates how von Neumann's theorem resolves apparent instability through probabilistic play.^[8]

Extensions to concave-convex functions

In the mid-20th century, mathematicians extended von Neumann's minimax theorem from finite-dimensional bilinear payoffs in zero-sum games to more general continuous functions over infinite strategy spaces, particularly focusing on concave-convex structures in convex analysis. A pivotal advancement came in 1952 with Ky Fan's work, which generalized the theorem to compact convex sets within locally convex topological linear spaces, allowing for quasi-concave-quasi-convex functions without requiring bilinearity. This bridged the discrete finite case to broader continuous settings, leveraging tools from functional analysis.^[9] Fan’s theorem states that if L_1 and L_2 are locally convex topological linear spaces, and K_1 \subset L_1, K_2 \subset L_2 are nonempty compact convex sets, then for a real-valued continuous function f: K_1 \times K_2 \to \mathbb{R} that is quasi-concave in the first variable (for fixed y \in K_2) and quasi-convex in the second variable (for fixed x \in K_1), the minimax equality holds:

\max_{x \in K_1} \min_{y \in K_2} f(x, y) = \min_{y \in K_2} \max_{x \in K_1} f(x, y).

This formulation ensures the existence of the optima through the continuity and compactness, with quasi-concavity guaranteeing convex upper-level sets for maximization and quasi-convexity for minimization. In the special case where f is concave in x and convex in y, the conditions are satisfied since strict concavity/convexity implies the quasi-variants.^[9] Unlike von Neumann's 1928 theorem, which applies to bilinear functions over finite simplices and relies on finite matrix properties, Fan's version accommodates infinite-dimensional spaces by invoking separation theorems for convex sets in locally convex topologies, enabling the handling of non-finite strategy sets without explicit summation. This key difference facilitated the study of equilibrium in continuous games and variational problems.^[9] These extensions played an early role in optimization by providing existence results for saddle points in concave-convex functionals, such as in duality theory and equilibrium problems, where the minimax equality identifies optimal dual solutions without enumerating strategies. For instance, in linear programming reformulations, the theorem underpins the equivalence of primal and dual optima over polyhedral sets.^[9]

Sion's contribution in 1958

In 1958, Maurice Sion published his seminal paper "On general minimax theorems" in the Pacific Journal of Mathematics, volume 8, number 1, pages 171–176.^[1] This work addressed key limitations in earlier minimax results, such as those requiring both strategy sets to be compact or the payoff function to be fully concave-convex, by extending applicability to scenarios involving unbounded domains.^[1] Sion's primary innovation relaxed these constraints while preserving the core equality between the minimum maximum and maximum minimum values. Specifically, he required one set, X, to be compact and convex in a linear topological space, while the other set, Y, needed only to be convex in a linear topological space; the function was required to be upper semicontinuous and quasi-concave in y on Y for each fixed x ∈ X, and lower semicontinuous and quasi-convex in x on X for each fixed y ∈ Y.^[1] These conditions generalized prior extensions, such as Ky Fan's theorem, which emerges as a special case when both sets are compact.^[1] The proof offered by Sion was notably compact, leveraging fixed-point theory alongside separation arguments for convex sets to establish the result without invoking more restrictive assumptions.^[1] This contribution had lasting historical impact, serving as a foundational tool in nonlinear analysis and inspiring subsequent developments in game theory and optimization by enabling minimax principles in broader topological settings.^[10]^[11]

Formal statement

Assumptions on sets and function

Sion's minimax theorem requires specific topological and convexity conditions on the underlying sets and function to guarantee the equality of minimax values. Let X be a nonempty compact convex subset of a linear topological space and let Y be a nonempty convex subset of a linear topological space.^[12] These assumptions ensure that X supports convex combinations and compactness facilitates the attainment of minima under appropriate continuity conditions, while Y supports the necessary convexity for superma. Consider a function f: X \times Y \to \mathbb{R} that satisfies the following properties: for each fixed x \in X, f(x, \cdot) is upper semicontinuous and quasi-concave on Y; and for each fixed y \in Y, f(\cdot, y) is lower semicontinuous and quasi-convex on X.^[12] Upper semicontinuity of f(x, \cdot) means that for every y_0 \in Y and sequence \{y_n\} \subset Y with y_n \to y_0, \limsup_{n \to \infty} f(x, y_n) \leq f(x, y_0).^[12] Similarly, lower semicontinuity of f(\cdot, y) implies that for every x_0 \in X and sequence \{x_n\} \subset X with x_n \to x_0, \liminf_{n \to \infty} f(x_n, y) \geq f(x_0, y).^[12] Quasi-concavity of f(x, \cdot) on the convex set Y is defined such that for all y_1, y_2 \in Y, t \in [0,1], and x \in X,

f(x, t y_1 + (1-t) y_2) \geq \min\{f(x, y_1), f(x, y_2)\},

or equivalently, the superlevel sets \{y \in Y : f(x, y) \geq c\} are convex for any c \in \mathbb{R}.^[12] Dually, quasi-convexity of f(\cdot, y) on the compact convex set X means that for all x_1, x_2 \in X, t \in [0,1], and y \in Y,

f(t x_1 + (1-t) x_2, y) \leq \max\{f(x_1, y), f(x_2, y)\},

or equivalently, the sublevel sets \{x \in X : f(x, y) \leq c\} are convex for any c \in \mathbb{R}.^[12] These quasi-convexity and quasi-concavity conditions generalize strict convexity and concavity, respectively, while the semicontinuity properties provide the necessary continuity to control limits in topological spaces.^[12] These assumptions are essential because the compactness of X combined with lower semicontinuity and quasi-convexity ensures the minimum over X is attained, while the quasi-concavity and upper semicontinuity on Y leverage arguments to establish the existence of suprema without requiring full compactness of Y.^[12] Such conditions relax the stricter requirements of earlier theorems, like von Neumann's, which demand compactness and linearity on both sets.

The theorem equality

Sion's minimax theorem asserts that, under the stated assumptions on the convex sets X and Y and the real-valued function f: X \times Y \to \mathbb{R}, the following equality holds:

\min_{x \in X} \sup_{y \in Y} f(x, y) = \sup_{y \in Y} \min_{x \in X} f(x, y).

^[1] This common value, often denoted as the game's value or the optimization value, is attained, meaning the suprema and infima are achieved as maxima and minima due to the compactness of one set and the continuity properties of f.^[1] The theorem further guarantees the existence of a saddle point (x^*, y^*) \in X \times Y, where x^* minimizes the maximum payoff and y^* maximizes the minimum payoff, satisfying

f(x^*, y) \leq f(x^*, y^*) \leq f(x, y^*)

for all x \in X and y \in Y. This saddle-point condition implies that f(x^*, y^*) equals the common value of the minimax equality.^[1] In notation, \sup and \inf are used generally, but the attainment ensures they coincide with \max and \min under the theorem's hypotheses, reflecting the finite achievement of optimal values in the abstract setting.^[1] Intuitively, the theorem extends the existence of a well-defined value in zero-sum games from finite or simplicial strategies to broader convex domains with quasi-convex-concave objectives, ensuring equilibrium attainment without requiring full concavity-convexity.^[1]

Proof

Key lemmas

The proof of Sion's minimax theorem relies on several auxiliary results from combinatorial topology, convex geometry, and functional analysis, which enable handling of quasi-concave-quasi-convex functions over convex sets without full continuity or bilateral compactness. These lemmas provide the tools for finite approximations and contradiction arguments central to the main proof.^[12] The Knaster–Kuratowski–Mazurkiewicz (KKM) lemma is a combinatorial fixed-point theorem: for an n-simplex S with vertices a_0, \dots, a_n, if open sets A_i cover S such that S \setminus A_i is convex and a_i \notin A_j for i \neq j, then \bigcap A_i \neq \emptyset. In Sion's proof, this is applied to finite simplicial approximations of the strategy sets, ensuring the intersection of level sets derived from the payoff function f is nonempty, which yields near-optimal responses and facilitates the contradiction.^[12] Helly's theorem, in its form for convex sets in \mathbb{R}^k, states that if a family of convex sets has the property that every k+1 sets intersect, then the whole family intersects. Sion uses an adaptation (Theorem 3.2) for points in linear spaces: for n+1 points in a space of dimension less than n, the intersection of convex hulls excluding each point is nonempty. This supports the reduction of infinite covers to finite minimal subsets in the proof, leveraging the compactness of X to select finite collections of points where level sets behave well.^[12] The separation theorem for convex sets, derived from the Hahn-Banach theorem, states that in a locally convex topological vector space, two nonempty disjoint convex sets, with one compact, can be strictly separated by a continuous linear functional. Although Sion's method unifies separation approaches (like Kneser-Fan), this theorem underpins the convex nature of sublevel sets \{y \in Y \mid f(x, y) \geq \alpha\} (from quasi-concavity in y) and enables arguments distinguishing optimal values when assuming inequality between minimax and maximin.^[12] A structural lemma (Lemma 3.3) asserts that for a convex set M, finite set Y, and upper semicontinuous quasi-concave function f: M \times Y \to \mathbb{R}, there exists \mu_0 \in M such that f(\mu_0, y) < c for all y \in Y, for a suitable c. Its symmetric counterpart (Lemma 3.3') for lower semicontinuous quasi-convex functions ensures points where f > c holds uniformly. These guarantee the existence of points in convex hulls avoiding or achieving level thresholds, crucial for constructing the contradiction in the main argument. Quasiconcavity/convexity ensures the relevant level sets are convex, while semicontinuity controls closure properties.^[12]

Main argument

The core proof of Sion's minimax theorem establishes \min_{x \in X} \sup_{y \in Y} f(x,y) = \sup_{y \in Y} \min_{x \in X} f(x,y) via a contradiction argument, exploiting the compactness of X and convexity properties without requiring compactness of Y. The general minimax inequality holds: \sup_{y \in Y} \min_{x \in X} f(x,y) \leq \min_{x \in X} \sup_{y \in Y} f(x,y). To show equality, assume \sup_{y} \min_{x} f < c < \inf_{x} \sup_{y} f for some c.^[12] Define sets A_\mu = \{v \in Y \mid f(\mu, v) > c\} for \mu \in X (open and convex by upper semicontinuity and quasi-concavity in v) and B_v = \{\mu \in X \mid f(\mu, v) < c\} for v \in Y (closed and convex by lower semicontinuity and quasi-convexity in \mu). By the assumption, the A_\mu cover Y, and compactness of X allows selection of a finite subcover Y_0 \subset Y with corresponding finite X_0 \subset X. Iteratively minimize X_0 and Y_0 such that no proper subset covers, using convexity and the assumptions.^[12] Apply Lemma 3.3 to find \mu_0 \in \operatorname{conv}(X_0) with f(\mu_0, y) < c for all y \in Y_0, and Lemma 3.3' to find v_0 \in \operatorname{conv}(Y_0) with f(x, v_0) > c for all x \in X_0. Helly's theorem (via Theorem 3.2) and KKM (Theorem 3.1) ensure these points exist within the simplicial structure of the finite sets, leading to f(\mu_0, v_0) > c and f(\mu_0, v_0) < c, a contradiction. Thus, the inequality assumption fails, proving equality.^[12] The argument implies the existence of a saddle point (x^*, y^*) where f(x, y^*) \leq f(x^*, y^*) \leq f(x^*, y) for all x, y, attained via limits or finite approximations. Sion's original proof spans about five pages, emphasizing this topological-combinatorial method over purely analytic or fixed-point approaches.^[12]

Applications

In zero-sum games beyond simplices

Sion's minimax theorem generalizes the existence of equilibria in zero-sum games to settings where players' strategy sets are infinite, specifically compact convex subsets of topological vector spaces, such as the unit interval [0,1] representing continuous effort levels in competitive economic interactions. In these models, the payoff function f(x, y) must be quasiconcave in the maximizer's strategy x and quasiconvex in the minimizer's strategy y, ensuring the equality \max_{x \in X} \min_{y \in Y} f(x, y) = \min_{y \in Y} \max_{x \in X} f(x, y), which implies the existence of a value and optimal mixed strategies.^[12] Unlike von Neumann's original minimax theorem, which is limited to finite strategy spaces (simplices), Sion's version accommodates infinite compact convex sets, and subsequent extensions further relax compactness assumptions to handle non-compact action spaces, such as those involving unbounded utilities in economic models of competition.^[13]

In optimization and variational inequalities

Sion's minimax theorem plays a central role in the analysis of saddle-point problems in optimization, where objectives are reformulated as minimax problems of the form \min_x \max_y L(x,y) with L(x,y) = f(x) + \langle y, Ax \rangle, ensuring the existence of a saddle point under quasiconvexity in x and quasiconcavity in y over appropriate convex sets.^[12]^[5] This formulation allows the primal minimization over x to be coupled with maximization over y, guaranteeing that \min_x \max_y L(x,y) = \max_y \min_x L(x,y) when the sets are compact and convex, thereby establishing strong duality without requiring full convexity-concavity.^[12]^[14] A key application arises in Lagrange duality for convex programming, where the Lagrangian L(x,y) = f(x) + \langle y, Ax - b \rangle is minimized over primal variables x and maximized over dual multipliers y \geq 0, with the dual function g(y) = \inf_x L(x,y) being concave (hence quasiconcave) in y. Sion's theorem applies here to affirm the existence of a saddle point, validating zero duality gap under compactness of the dual feasible set and quasiconvexity assumptions on the primal, which extends beyond strictly convex cases.^[14] In variational inequalities, Sion's theorem connects to the problem of finding x^* \in K such that \langle F(x^*), x - x^* \rangle \geq 0 for all x \in K, by reformulating it as a minimax problem over a suitable function involving F, ensuring solution existence when F satisfies quasimonotonicity conditions compatible with quasiconvex-quasiconcave structures.^[15]^[12] This link facilitates proofs of equilibrium existence in monotone operator settings, where the minimax equality implies the variational condition holds at the saddle point.^[15] Post-2010, the theorem has informed modern machine learning applications in adversarial training, such as generative adversarial networks (GANs), where the objective \min_G \max_D V(D,G) leverages Sion's guarantees for saddle-point existence under weak quasiconvexity assumptions, aiding convergence analyses in nonconvex settings.^[16]^[17] These applications highlight the theorem's utility in ensuring stable training dynamics for minimax formulations beyond traditional convex domains.^[16] In signal processing, Sion's theorem supports minimax formulations for robust beamforming and Fisher discriminant analysis, enabling optimal solutions in adversarial noise environments over convex constraint sets.^[4]

Fan's minimax theorem

Fan's minimax theorem, introduced by Ky Fan in 1953, provides a foundational result in convex analysis and game theory. It states that if X and Y are compact convex subsets of topological linear spaces and f: X \times Y \to \mathbb{R} is a continuous function that is quasi-convex in the first variable for each fixed element of the second variable and quasi-concave in the second variable for each fixed element of the first variable, then

\min_{x \in X} \max_{y \in Y} f(x, y) = \max_{y \in Y} \min_{x \in X} f(x, y).

^[18] This result imposes stricter assumptions than Sion's minimax theorem, requiring compactness of both sets and full continuity of f, whereas Sion's allows one set to be merely convex and replaces continuity with appropriate semicontinuity conditions (upper semicontinuity and quasi-concavity in one variable, lower semicontinuity and quasi-convexity in the other). Sion's theorem thus generalizes Fan's by weakening these requirements while preserving the minimax equality. The proof of Fan's theorem relies on Tychonoff's fixed-point theorem applied to a continuous self-map on the product space X \times Y, ensuring the existence of a saddle point that equates the min-max and max-min values.^[19] This approach leverages the compactness and convexity to construct the fixed-point argument. Fan's theorem applies particularly well to finite-dimensional settings where compactness arises naturally, such as zero-sum games with bounded polyhedral strategy sets, facilitating equilibrium analysis without needing the broader topological flexibility of later generalizations.^[18]

Border's refinement

In 1985, Kim C. Border introduced a refinement of Sion's minimax theorem in his monograph Fixed Point Theorems with Applications to Economics and Game Theory. This version extends the theorem by considering correspondences (set-valued functions) with convex sections, where for a correspondence G: X \rightrightarrows Y, the sections G(x) = \{y \in Y \mid (x,y) \in G\} are convex for each x, and similarly for the inverse sections. Under suitable continuity and compactness assumptions, Border derives a minimax equality for the associated scalarized function, relaxing the strict requirement for linear topological structure on the entire space by focusing on local convexity properties of the sections.^[20] This approach is particularly useful in economic models with infinite-dimensional strategy spaces or non-Hausdorff topologies, where traditional vector space assumptions may not hold, but convex section conditions ensure the existence of equilibria via fixed-point arguments. Border's result builds on Sion's by adapting the proof to these generalized settings, maintaining the core minimax equality \min_{x \in X} \sup_{y \in Y} f(x,y) = \sup_{y \in Y} \min_{x \in X} f(x,y) under weakened structural requirements on the sets. Despite its generality, Border's extension requires the correspondences to satisfy hemicontinuity and convexity of sections, which may not cover all partially ordered or lattice-based structures but provides a robust tool for variational problems in economics and game theory.