Second-order arithmetic
Second-order arithmetic is a formal system in mathematical logic that extends first-order Peano arithmetic by incorporating second-order quantification over subsets of the natural numbers, allowing the expression and proof of theorems involving sets and higher-level properties within the domain of arithmetic.[1][2] Its language is two-sorted, featuring first-order variables for individual natural numbers (e.g., x, y) and second-order variables for sets of natural numbers (e.g., X, Y), along with function symbols for addition (+) and multiplication (\times), relations for equality (= ) and order (< ), and membership (\in).[3][2] The axioms of full second-order arithmetic, often denoted Z_2, include the basic axioms of Robinson arithmetic Q (such as successor non-zero and no cycles in successors), a comprehension scheme asserting the existence of sets defined by any formula in the language (i.e., for any formula \phi(n) free of the set variable X, there exists X such that \forall n (n \in X \leftrightarrow \phi(n))), and an induction axiom schema that applies to arbitrary sets: if a set contains 0 and is closed under the successor function, it contains all natural numbers.[2][3] This system is semantically interpreted over the standard model of the natural numbers \mathbb{N} with the full power set \mathcal{P}(\mathbb{N}), enabling it to categorically characterize the structure (\mathbb{N}, +, \times) up to isomorphism—a capability beyond first-order arithmetic due to limitations like the Löwenheim-Skolem theorem.[1] Second-order arithmetic plays a central role in the foundations of mathematics, particularly through its subsystems (e.g., \mathrm{[RCA](/page/RCA)}_0 with recursive comprehension and \Sigma^0_1-induction, \mathrm{[ACA](/page/CA)}_0 with arithmetical comprehension, and stronger variants like \Pi^1_1-\mathrm{[CA](/page/CA)}_0), which are studied in reverse mathematics to determine the precise set-existence axioms required to prove core theorems in analysis, algebra, and topology.[3] For instance, many classical results, such as the Bolzano-Weierstrass theorem, are equivalent to \mathrm{[ACA](/page/CA)}_0 over \mathrm{[RCA](/page/RCA)}_0, highlighting the system's utility in calibrating the strength of mathematical principles without full set theory.[3] Philosophically, it bridges first-order and set-theoretic logics, raising questions about impredicativity and the reliance on a set-theoretic metatheory for its semantics, while formalizing significant portions of countable mathematics, including aspects of real analysis via arithmetical comprehension.[1]Overview and Historical Context
Core Concepts
Second-order arithmetic is a formal theory in mathematical logic that extends the foundations of arithmetic by incorporating quantification over sets of natural numbers, providing a framework for much of classical analysis and beyond. It is formulated as a two-sorted first-order theory, with one sort for individual natural numbers (typically denoted by variables like n, m \in \mathbb{N}) and another sort for sets of natural numbers (denoted by variables like X, Y \subseteq \mathbb{N}). This structure allows the theory to reason about both numerical computations and higher-level properties of collections of numbers, such as sequences, functions, and relations definable over \mathbb{N}.[3] In contrast to first-order Peano arithmetic (PA), which is limited to quantification solely over individual natural numbers and thus cannot directly express concepts involving infinite subsets or full induction over all properties, second-order arithmetic introduces variables ranging over the power set \mathcal{P}(\mathbb{N}). This enables the full second-order induction axiom, which asserts that any property definable by a formula involving set quantifiers holds for all natural numbers if it holds for zero and is preserved under the successor function. Additionally, the comprehension scheme allows the existence of sets defined by arbitrary formulas, ensuring that the theory can construct subsets of \mathbb{N} corresponding to predicates in its language, thereby capturing the expressive power needed for mathematical analysis without invoking a full set-theoretic universe.[3] A basic illustration of this capability is how second-order arithmetic models the power set of the naturals: set variables directly represent elements of \mathcal{P}(\mathbb{N}), and the comprehension axiom guarantees that for any formula \phi(n), there exists a unique set X such that n \in X if and only if \phi(n) holds, all within the theory's arithmetic base. This avoids the need for a separate axiomatic set theory like ZFC, as the sets are inherently tied to arithmetic predicates, providing a lightweight yet potent system for encoding countable mathematics.[3] Informally, second-order arithmetic motivates much of reverse mathematics, a program that classifies theorems of ordinary mathematics by determining the minimal subsystems of the theory required to prove them, revealing the logical strength underlying concepts from analysis, algebra, and geometry.[3]Development and Significance
The formalization of second-order arithmetic traces back to Richard Dedekind's 1888 work Was sind und was sollen die Zahlen?, where he employed second-order quantification to axiomatize the natural numbers and prove the uniqueness of their structure up to isomorphism, with significant developments in the early 20th century as part of efforts to formalize the foundations of analysis and arithmetic within a predicative framework. Hermann Weyl introduced a foundational system in his 1918 work Das Kontinuum, which restricted comprehension to arithmetical definitions, resembling the modern subsystem ACA₀ and emphasizing predicative methods to avoid impredicative set formations. This approach was motivated by Weyl's critique of classical analysis, aiming to construct the real numbers using only hereditarily finite sets and arithmetical predicates. Subsequently, David Hilbert and Paul Bernays developed a more comprehensive second-order arithmetic in their Grundlagen der Mathematik (Volumes I and II, 1934 and 1939), integrating it into Hilbert's program for finitistic proofs of consistency. Their system formalized substantial fragments of real analysis, using second-order variables to represent sets of naturals, and served as a bridge between finitary arithmetic and higher mathematics.[3][4] Key advancements in the mid-20th century built on these foundations through model-theoretic and proof-theoretic analyses. In the 1950s, Georg Kreisel contributed significantly to the study of models of second-order arithmetic, particularly in his 1950 paper on arithmetic models for predicate calculus formulae, where he explored consistency and definability using hyperarithmetical sets. Kreisel's work in the 1950s and 1960s further examined predicative subsystems, showing limitations like the failure of the perfect set theorem in such systems and advancing ordinal analyses that informed later hierarchies. The field gained renewed momentum in the late 20th century with Stephen G. Simpson's development of reverse mathematics during the 1970s and 1980s, culminating in his 2009 book Subsystems of Second Order Arithmetic. Simpson systematized the "Big Five" subsystems (RCA₀, WKL₀, ACA₀, ATR₀, Π¹₁-CA₀), demonstrating how they calibrate the set existence axioms needed for core mathematical theorems.[3] The significance of second-order arithmetic lies in its role as a foundational framework that bridges first-order arithmetic with analysis, allowing the formalization of much of classical mathematics—such as countable algebra, graph theory, and parts of analysis—without the full power of Zermelo-Fraenkel set theory with choice (ZFC). It provides the basis for reverse mathematics, where theorems are "reversed" to identify minimal axioms for their proofs, revealing the logical structure underlying ordinary mathematics and partial realizations of Hilbert's program. This bridge enables precise studies of computability and definability, as sets of naturals correspond to reals, facilitating encodings of continuous structures within discrete arithmetic. As of 2025, second-order arithmetic remains central to ongoing research in proof theory, where subsystems inform ordinal notations and consistency strengths; in computability theory, supporting analyses of hyperarithmetic sets and Turing degrees; and in descriptive set theory, underpinning determinacy results for projective sets. Recent works, such as those on constructive variants and their proof-theoretic ordinals, continue to extend its applications, highlighting its enduring relevance in foundational mathematics.[5]Formal Foundations
Syntax
Second-order arithmetic is formalized in a two-sorted first-order language L_2, consisting of a sort for natural numbers and a sort for subsets of natural numbers.[3] The individual variables, often denoted by lowercase letters such as n, m, k \in \mathbb{N}, range over the natural numbers \omega = \{0, 1, 2, \dots \}. The set variables, denoted by uppercase letters such as X, Y, Z \subseteq \mathbb{N}, range over subsets of \omega.[3] The language includes the constant symbols $0 and $1 for the numerical terms, as well as the binary function symbols + and \cdot (multiplication, often denoted \times) for addition and multiplication on natural numbers.[3] Numerical terms are formed recursively: they include the number variables, the constants $0 and $1, and expressions built from these using + and \cdot, such as n + m or (n \cdot k) + 1.[3] The predicate symbols consist of equality = (between numerical terms), the strict order < (between numerical terms), and membership \in (relating a numerical term to a set variable, as in n \in X).[3] Atomic formulas are the equality statements t_1 = t_2, the order statements t_1 < t_2, and the membership statements t \in X, where t_1, t_2, t are numerical terms and X is a set variable.[3] Well-formed formulas are constructed inductively from atomic formulas using the propositional connectives \neg (negation), \land (conjunction), \lor (disjunction), \to (implication), and \leftrightarrow (biconditional), as well as the quantifiers: universal and existential quantifiers over numbers (\forall n, \exists n) and over sets (\forall X, \exists X).[3] A formula with no free variables is called a sentence.[3] Formulas in L_2 are classified based on their quantifier structure, distinguishing the first-order and second-order aspects. Arithmetical formulas (or \Delta_0^1 and higher levels in the arithmetical hierarchy) are those that use only number quantifiers and no set quantifiers, effectively forming the first-order part of the language equivalent to Peano arithmetic.[3] Second-order formulas incorporate set quantifiers, enabling the expression of properties involving subsets of natural numbers, such as comprehension principles.[3]Semantics
The semantics of second-order arithmetic interprets its language in mathematical structures that capture both individual natural numbers and collections of such numbers, providing a precise meaning to formulas involving quantification over sets. A structure for second-order arithmetic consists of a pair (\mathbb{N}, \mathcal{P}(\mathbb{N})), where \mathbb{N} is the domain of standard natural numbers equipped with the usual operations of addition (+), multiplication (·), and the order relation (<), along with constants 0 and 1, and \mathcal{P}(\mathbb{N}) denotes the full power set of \mathbb{N}, serving as the domain for second-order variables that range over all possible subsets of natural numbers.[3][1] The satisfaction relation for formulas in second-order arithmetic follows a Tarskian definition, recursively specifying when a structure satisfies a given formula under a variable assignment. For the first-order fragment, satisfaction is defined in the standard way over \mathbb{N}, evaluating atomic formulas using the interpretations of predicates and functions. Second-order quantifiers, such as \forall X \phi or \exists X \phi where X is a second-order variable, are satisfied if the formula \phi holds for all (or some) elements of \mathcal{P}(\mathbb{N}), respectively, with the assignment interpreting X as an arbitrary subset of \mathbb{N}.[1][6] Two primary semantic frameworks distinguish interpretations of second-order arithmetic: full semantics and Henkin semantics. In full semantics, second-order quantifiers range over the complete power set \mathcal{P}(\mathbb{N}), including all subsets, which ensures a robust interpretation but leads to incompleteness with respect to provability since not all subsets are definable. Henkin semantics, in contrast, permits a partial interpretation where second-order quantifiers range over a restricted collection of subsets (e.g., those definable by formulas in the language), allowing for a completeness theorem but yielding non-categorical models that may not capture the full expressive power of the theory.[1][3] In the standard model of second-order arithmetic, denoted (\mathbb{N}, \mathcal{P}(\mathbb{N}), +, \cdot, 0, 1, <), truth is determined by evaluating formulas against all subsets of \mathbb{N}, encompassing both recursive (computable) sets and non-recursive sets that cannot be effectively described by any algorithm. This model uniquely characterizes the intended structure up to isomorphism under full semantics, as the second-order induction axiom and comprehension principles enforce the standard naturals and their full power set.[3][1]Axioms
Second-order arithmetic, often denoted as \mathrm{Z_2} or SOA, is formalized in a two-sorted first-order language with individual variables for natural numbers and set variables for subsets of the natural numbers, including symbols for zero (0), one (1), addition (+), multiplication (×), ordering (<), and membership (∈). The successor function is expressed by the term n + 1.[3] The basic axioms consist of the first-order Peano axioms adapted to this language, governing the structure of the natural numbers and the arithmetic operations. These include:- ∀n (n + 1 ≠ 0)
- ∀m ∀n (m + 1 = n + 1 → m = n)
- ∀m (m + 0 = m)
- ∀m ∀n (m + (n + 1) = (m + n) + 1)
- ∀m (m × 0 = 0)
- ∀m ∀n (m × (n + 1) = (m × n) + m)
- ¬∃m (m < 0)
- ∀m ∀n (m < n + 1 ↔ (m < n ∨ m = n))