Sure-thing principle
The Sure-thing principle is a foundational axiom in decision theory, introduced by Leonard J. Savage in his 1954 book The Foundations of Statistics, stating that if an individual prefers act f to act g upon learning that event E has occurred, and also prefers f to g upon learning that E has not occurred, then they should prefer f to g unconditionally, without knowledge of E.[1] This principle captures the intuition that irrelevant uncertainties should not influence rational choice, using Savage's classic example of a businessman deciding whether to buy property: if the purchase seems worthwhile regardless of whether the Republican or Democrat wins the presidential election, the election outcome is irrelevant to the decision.[2] Savage presented the sure-thing principle informally as an overarching desideratum for rational behavior under uncertainty, motivating two key postulates in his axiomatization of subjective expected utility (SEU) theory: P2 (separability) and P3 (state-independence).[3] These postulates ensure that preferences are separable across states of the world and independent of irrelevant events, leading to the representation of preferences by a utility function and subjective probabilities.[3] The principle is universally regarded as a core extralogical requirement for decision-making, distinguishing it from purely logical necessities, and it underpins Bayesian inference by formalizing how agents update beliefs and choices without being swayed by causally inert information.[1] Despite its foundational status, the sure-thing principle has faced scrutiny in contexts involving causal dependencies or dynamic inconsistencies. For instance, a causal variant (Causal Sure-Thing Principle) requires that acts do not affect the event E, as formalized in later work distinguishing evidential from causal independence.[1] Recent analyses have shown that the principle, when fully formalized, equates to dominance conditions that simplify SEU axiomatizations, such as those by Savage and Anscombe-Aumann, and can even weaken P3 to a strengthened form of obvious dominance while preserving rationality.[3] However, violations arise in non-causal settings, like the Allais paradox, prompting alternative theories such as prospect theory or ambiguity aversion models that relax the principle to better fit empirical behavior.[4] Overall, the sure-thing principle remains a benchmark for normative decision theory, influencing fields from economics to artificial intelligence in modeling uncertainty.[4]Background
Decision Theory Context
Decision theory addresses how rational agents make choices under uncertainty, evolving significantly in the mid-20th century to incorporate both objective probabilities and subjective beliefs.[5] In 1944, John von Neumann and Oskar Morgenstern introduced an axiomatic foundation for expected utility theory, applicable to situations of risk where probabilities of outcomes are objectively known, such as in games of chance or lotteries with specified odds.[5] Their framework posits that rational preferences over lotteries can be represented by the maximization of expected utility, where utility reflects the agent's valuation of outcomes weighted by their known probabilities.[5] However, this approach was limited to scenarios with quantifiable probabilities, leaving unresolved the broader class of decisions under uncertainty where such probabilities are unknown or undefined, such as personal judgments about future events.[5] The 1950s marked a pivotal historical shift toward axiomatic systems that derived both utilities and probabilities from observable preferences alone, without presupposing objective chances. This development, influenced by earlier ideas from Frank Ramsey and Bruno de Finetti on subjective probability, aimed to provide a normative foundation for rational choice in all uncertain environments by treating probabilities as degrees of belief elicited from behavior.[5] Central to these subjective approaches are key concepts: acts, defined as functions mapping states of the world to outcomes; states, which partition possibilities into exhaustive and mutually exclusive events; and consequences, the tangible results of acts in given states.[5] Subjective probabilities emerge from the agent's preference orderings over acts, allowing uncertainty to be quantified through comparative choices rather than external data.[5] This contrasts sharply with objective uncertainty, or risk, where probabilities are provided by the decision context, versus subjective uncertainty, where they stem from the agent's internal beliefs and are not verifiable independently.[6] Leonard Savage's 1954 work exemplified this paradigm, extending von Neumann-Morgenstern utility to a fully subjective setting.[5]Savage's Framework
Leonard J. Savage developed his axiomatic framework for decision making under uncertainty in his 1954 book The Foundations of Statistics, where he models choices in situations without objective probabilities.[7] The setup assumes a finite set of states of the world S, representing all possible mutually exclusive and exhaustive scenarios that might occur, and a set of consequences C, denoting the possible outcomes or payoffs relevant to the decision-maker.[7] Acts are defined as functions from S to C, mapping each state to a specific consequence, and the decision-maker expresses preferences over these acts using relations such as strict preference \succ, indifference \sim, and weak preference \succeq.[7] Savage's system relies on a set of postulates, or axioms, that constrain these preferences to ensure rational consistency. The postulates are as follows:- P1 (Ordering): The preference relation \preceq is a weak order, meaning it is complete (for any two acts f and g, either f \preceq g or g \preceq f) and transitive (if f \preceq g and g \preceq h, then f \preceq h). This ensures preferences are consistent and comparable.[7]
- P2 (Sure-thing principle): If acts f and g agree outside event B and f' and g' agree outside B, with f and f' agreeing inside B and similarly for g and g', and f \prec g, then f' \prec g'. This ensures consistency of preferences under modifications outside an event.[7]
- P3 (Conditional preferences): If f = g and f' = g', and B is non-null, then f \prec f' conditional on B if and only if g \prec g' conditional on B. This ensures that preferences among identical acts are consistent conditional on non-null events.[7]
- P4 (Independence of prize size): Preferences between acts that offer constant consequences except on disjoint events depend only on the qualitative probabilities of those events, independent of the specific consequences involved. This helps define subjective probabilities.[7]
- P5 (Non-degeneracy): There exist at least two consequences such that one is strictly preferred to the other, ensuring preferences are not universally indifferent.[7]
- P6 (Qualitative probability): If g \prec h, there exists a finite partition of S such that modifying g or h on one part to a fixed consequence f preserves the preference ordering. This enables the construction of probability scales.[7]
- P7 (Dominance): If f \prec g(s) conditional on B for every state s \in B, then f \prec g conditional on B. This formalizes a dominance condition akin to the sure-thing principle.[7]