Fact-checked by Grok 2 weeks ago

Wason selection task

The Wason selection task, commonly referred to as the four-card problem, is a in designed to evaluate individuals' ability to apply and falsification principles to test conditional rules. Developed by British psychologist , the task presents participants with a conditional statement, such as "If there is a D on one side of the card, then there is a 3 on the other side," and four cards laid out face-up, each showing either a letter (D or k) or a number (3 or 7) from one side, with the opposite side unknown. The objective is to select only the cards that must be turned over to definitively determine whether the rule holds true or false for the entire set, with the logically correct choices being the D card (to check for a non-3) and the 7 card (to check for a D, as its presence would violate the contrapositive). Despite its apparent simplicity, performance on the abstract version of the task reveals systematic biases in human reasoning, with meta-analytic evidence indicating that only approximately 19% of participants across numerous studies select the correct cards (p and not-q), while a majority erroneously include the not-p (k) or q (3) cards, often driven by a tendency toward rather than falsification. This low success rate, first documented in Wason's original experiments where fewer than 10% succeeded, underscores the challenge of applying formal logic to abstract scenarios and has positioned the task as a cornerstone for studying . Subsequent research has highlighted a content effect, where success rates dramatically improve—to around 64% in deontic (social contract or permission) versions, such as "If a person is drinking , then they must be over 19"—suggesting that contextual , evolutionary adaptations for cheater detection, or pragmatic inferences facilitate reasoning when rules involve obligations or prohibitions. The task's enduring influence spans theories of rational analysis, Bayesian modeling of data selection, and dual-process accounts of intuitive versus deliberative , with over 200 experiments confirming its robustness in revealing discrepancies between normative and everyday .

Background

Origin and development

The Wason selection task was devised in 1966 by , an English cognitive psychologist based at . Wason developed the task as an experimental tool to investigate human processes, building on his prior research into cognitive biases. It was first described in Wason's chapter titled "Reasoning," published in the edited volume New Horizons in Psychology by B. M. Foss. In this work, Wason introduced the task to empirically demonstrate —the tendency to favor information that supports a while neglecting that could refute it—in the context of conditional statements. The design drew inspiration from Karl Popper's , particularly the principle of falsification, which emphasizes testing hypotheses through potential disconfirmation rather than mere . The task evolved from Wason's earlier experiments on , such as the 2-4-6 problem introduced in 1960, which similarly highlighted confirmation-seeking behaviors but focused on rule discovery. Initial studies using abstract versions of the selection task, involving , yielded low success rates, with approximately 10% of participants identifying the cards necessary to falsify the rule. These findings underscored the challenges in applying logical falsification in non-concrete scenarios and laid the groundwork for subsequent research in the of reasoning.

Significance in cognitive science

The Wason selection task exemplifies , where individuals preferentially seek confirming evidence for a while neglecting potentially falsifying information, a phenomenon first systematically demonstrated through Peter Wason's experimental paradigm. This bias highlights fundamental limitations in human hypothesis testing, as people often fail to select cards that could disprove a conditional rule, instead focusing on affirmative instances. Wason's work established this as a core cognitive error, influencing subsequent research on how such tendencies persist across diverse populations and contexts. As a cornerstone in cognitive psychology, the task probes the distinction between deductive reasoning—requiring logical falsification—and inductive reasoning, which favors pattern confirmation, revealing systematic errors in abstract logical inference. It has become a benchmark for studying reasoning biases, with early experiments showing success rates as low as 10% in abstract forms, underscoring the challenge of applying formal logic in non-contextual scenarios. This has positioned the task as essential for exploring how cognitive processes deviate from normative models of rationality. The task's influence extends to , notably through ' application, which posited that enhanced performance on social contract versions reflects evolved cognitive modules for detecting cheating in cooperative exchanges rather than general logical ability. In the , it aligns with Karl Popper's falsification principle, illustrating why scientific progress demands rigorous disconfirmation over mere verification. By 2025, the task has been cited in over 2,000 studies, including meta-analyses that confirm persistent low performance rates of 10-25% on abstract rules across decades of replication. Beyond psychology, the task informs applications in , where models are evaluated for human-like reasoning errors to improve decision-support systems—recently including tests of large models that exhibit similar confirmation biases on abstract versions—and in training programs that teach falsification strategies to mitigate biases in professional decision-making, such as in and .

Task Description

Standard presentation

In the standard presentation of the Wason selection task, participants are individually presented with four cards laid out on a table or screen, each showing only one side. These cards are drawn from a larger set where every card has a letter on one side and a number on the other. The experimenter provides a conditional rule, such as: "If there is a on one side of a card, then there is an even number on the other side." The visible sides of the cards display an 'A' (), a 'D' (), a '4' (even number), and a '7' (odd number). Participants are instructed to select and turn over only those cards necessary to determine whether the rule holds true or false for the entire set, emphasizing the need to test for potential violations without unnecessary turns. This abstract version relies on arbitrary letters and numbers to isolate pure . Concrete versions adapt the format to everyday scenarios for contextual relevance, such as the rule: "If a is drinking , then they must be at least 19 years old," with cards showing "drinking ," "drinking ," "16 years old," and "25 years old." Participants receive the same instructions to select cards that could confirm or disconfirm the rule. The task is typically conducted without time constraints in controlled settings, using paper-based materials in early studies or computerized interfaces in contemporary research, where selections are made by circling options, pointing, or clicking. Cards are arranged in random order to avoid positional biases, and the setup ensures participants understand that turning a card reveals the hidden side completely.

Logical structure of the conditional

The Wason selection task employs a conditional rule of the form "if P, then Q," where P represents the presence of a vowel on one side of a card and Q represents an even number on the other side. This rule is formally interpreted as a material conditional in propositional logic, denoted as P \to Q, which asserts that whenever the antecedent P is true, the consequent Q must also be true. The of the P \to Q is determined by a standard , which specifies its behavior across all possible combinations of truth values for P and Q:
PQP \to Q
TrueTrueTrue
TrueFalseFalse
FalseTrueTrue
FalseFalseTrue
As shown, the conditional is false only in the case where P is true and Q is false; all other combinations yield true. In the task, four cards are presented, each showing either an instance of P (e.g., "A" for ), not-P (e.g., "D" for consonant), Q (e.g., "4" for even number), or not-Q (e.g., "7" for odd number). To verify the rule logically, participants must identify cards that could potentially falsify it, which requires examining instances where P is true (to check if Q holds) and where not-Q is true (to check if P is absent, per the contrapositive \neg Q \to \neg P). Thus, the cards showing P and not-Q are the critical ones for testing, as they alone can reveal the falsifying combination from the .

Logical Solution

Falsification principle

The falsification principle, as applied to the Wason selection task, stems from Karl Popper's demarcation criterion for scientific theories, which requires that hypotheses be empirically testable through potential disproof rather than mere confirmation. In his seminal 1934 work Logik der Forschung—later translated and expanded as in 1959—Popper argued that genuine scientific knowledge advances by conjectures that are bold and refutable, emphasizing rigorous attempts to falsify theories via contradictory evidence. This approach contrasts with inductivist methods that accumulate verifying instances, which Popper deemed insufficient for establishing truth. Within the Wason selection task, the principle directs reasoners to identify only those cases that could potentially disprove the conditional rule P → Q, focusing on that might reveal an instance of P paired with not-Q. Instances consistent with the rule, such as not-P (regardless of the consequent) or Q (which may follow from not-P), cannot falsify it and thus hold no diagnostic value for refutation. By prioritizing disconfirmatory , the strategy aligns with Popper's view that scientific progress depends on eliminating false hypotheses rather than seeking perpetual affirmation. Peter Wason, whose work was influenced by , designed the task to illustrate how everyday reasoning often deviates from this scientific ideal, probing hypothesis elimination in cognitive processes. The selection task, in particular, illustrates the tension between confirmation-seeking tendencies and the demands of falsification, highlighting cognitive biases that impede logical rigor in non-scientific contexts.

Correct card selections

In the standard Wason selection task, the rule to be tested is a conditional statement: "If there is a vowel on one side of a card, then there is an even number on the other side," denoted logically as P → Q, where P represents a vowel and Q an even number. The four cards displayed show A (a vowel, instance of P), D (a consonant, instance of not-P), 4 (an even number, instance of Q), and 7 (an odd number, instance of not-Q). To determine whether the rule holds for all cards, only those that could potentially falsify it need to be turned over, based on the falsification principle that a conditional is violated solely when P is true but Q is false. The logically correct selections are the A and 7 cards. The A card must be turned because it shows P (vowel); if the other side is odd (not-Q), the rule is falsified. Similarly, the 7 card must be turned because it shows not-Q (odd); if the other side is a (P), this again falsifies the rule by presenting a case of P without Q. In contrast, the D card (, not-P) is irrelevant, as the rule makes no claim about cards without a ; whether the other side is even or odd cannot violate P → Q. The 4 card (even, Q) also need not be turned, since an even number on the other side—whether paired with a or —confirms the rule if P is present but cannot disprove it. A common error pattern involves selecting the A and 4 cards, reflecting a tendency toward by seeking instances that support the rule rather than those that could refute it. This approach fails logically because only cards representing P and not-Q can provide evidence against the conditional.

Empirical Performance

Abstract rule results

In Peter Wason's seminal 1968 experiment employing an abstract conditional rule ("If there is a on one side of the , then there is an even number on the other side") with cards showing A, D, 4, and 7, only about 10% of the 62 undergraduate participants correctly selected the A and 7 cards to test the rule. A comprehensive of 104 experiments on abstract versions of the task, spanning decades of research, reported an average success rate of 19% for selecting the logically appropriate cards (antecedent and negated consequent). This low performance level shows substantial consistency across studies ( = 0.34, p < 0.001) and holds steady regardless of cultural context, indicating a robust tendency to underperform on decontextualized logical . Participants commonly err by favoring verification over falsification, with approximately 39% selecting the A and 4 cards—those that could confirm the —while neglecting the 7 card that could disprove it; selections of the irrelevant D card occur in a smaller proportion of cases. Although formal training has limited effects on abstract performance, specialized instruction can modestly improve accuracy, though success remains below 50% without contextual aids, underscoring persistent challenges in applying pure deductive principles.

Concrete rule facilitation

Performance on the Wason selection task improves when conditional are embedded in , familiar contexts rather than abstract ones, though the degree varies by type: everyday content averages around 29%, while deontic (obligation-based) rules reach 64%. In deontic scenarios, participants more readily identify cards that could falsify the rule. A seminal demonstration of this concrete rule facilitation came from Cheng and Holyoak (), who introduced the concept of pragmatic reasoning , abstract knowledge structures activated by thematic content. In their Experiment 1, they presented a permission schema rule—"If one is to drive on campus, then one must have a parking permit"—to American and participants. American subjects selected the logically correct cards (antecedent and consequent-negation) 74% of the time, while Hong Kong subjects did so 92% of the time, far exceeding abstract task baselines. Representative examples illustrate this effect across domains. For instance, the rule "If a person is drinking , then they must be over 19 years old," with cards showing a beer glass, age 25, soda, and age 16, elicits correct selections from about 74-80% of participants in various studies, as the social regulation context cues violation-checking. Similarly, a rule like "If a letter is sent to , then it has a Boston postmark," with cards for Boston address, 5-cent stamp, address, and 3-cent stamp, yielded a ~75% success rate in early studies, highlighting facilitation from routine . Medical prescription scenarios, such as "If a is prescribed a , then they must have a certain symptom," produce comparable boosts in deontic contexts, with participants focusing on potential non-compliance. Meta-analytic evidence from 228 experiments (as of ) underscores the robustness of this facilitation, particularly for deontic content (64%) and to a lesser extent everyday content (29%), compared to 19% in abstract versions; the effect's magnitude varies by rule type and population.

Theoretical Explanations

Social contract theory

The social contract theory posits that human reasoning abilities have been shaped by to facilitate the detection of cheaters in social exchanges, providing a domain-specific explanation for performance on the Wason selection task.90023-1) Proposed by in her doctoral dissertation and elaborated in her 1989 paper, the theory argues that the mind includes specialized cognitive modules for deontic reasoning—evaluations of permissions, obligations, and prohibitions—particularly within the context of s, which are conditional rules of the form "If you take the , then you must pay the ."90023-1) According to this view, when the task's conditional rule is interpreted as a social contract, participants are triggered to search for violations, such as individuals who accept a without incurring the required , leading to selection of the cards representing the antecedent () and the negated consequent (no paid).90023-1) Empirical support for the comes from Cosmides' experiments, where participants achieved approximately 70% accuracy in selecting the logically correct cards (P and not-Q) when the rule involved cheater detection in a social exchange scenario, such as a drinking-age framed as a permission to drink only if over 19, compared to only about 20% accuracy on abstract versions of the task with neutral content like letters and numbers.90023-1) This facilitation is attributed to an evolved "look-for-cheaters" that prioritizes falsifying evidence of rule violations in interactions, enhancing survival in ancestral environments where detecting non-reciprocators was adaptive.90023-1) The emphasizes that this mechanism operates automatically when social contract cues are present, bypassing the difficulties of general logical seen in non-social contexts. Criticisms of theory highlight challenges to its claims of domain-specific modularity and evolutionary specialization. Jerry Fodor, in his 1983 framework for , argued that the mind's modular components are largely confined to peripheral sensory processes, questioning the feasibility of numerous content-specific modules like a dedicated cheater-detection system as proposed by Cosmides. Fodor later reinforced this in 2000, suggesting that high performance on cheater-detection tasks could stem from general interpretive strategies rather than an innate module.00074-2) Additionally, subsequent studies have indicated an overemphasis on uniquely social elements, as pragmatic reasoning schemas—such as permissions without explicit evolutionary framing—also elicit similar facilitation, suggesting broader deontic or contextual cues suffice beyond strict social contracts.90016-4)

Mental models and pragmatic approaches

The mental models theory, proposed by Philip N. Johnson-Laird, posits that individuals reason by constructing and manipulating mental representations of possible situations consistent with the premises, rather than applying formal logical rules. In the context of the Wason selection task, reasoners build initial models of the conditional rule, such as "if P then Q," focusing on the most salient or explicit possibilities while overlooking implicit ones that could falsify the rule. This partial enumeration leads to incomplete card selections, as individuals prioritize models that confirm the rule over those that explore alternatives. Pragmatic reasoning schemas, developed by Patricia W. Cheng and Keith J. Holyoak, suggest that performance improves when the task evokes domain-specific knowledge structures tied to goals like permissions or obligations, which guide evidence-seeking strategies. For instance, a permission schema prompts selection of cards that check for violations, such as antecedent-present and consequent-absent cases, by framing the conditional as a regulatory rule rather than an abstract problem. These schemas are acquired through experience and activated linguistically, explaining why concrete scenarios facilitate correct responses without relying on evolutionary adaptations. Relevance theory, advanced by Dan Sperber, Deirdre Wilson, and colleagues, argues that card selections arise from inferential processes aimed at maximizing relevance in communication, where is chosen based on its potential to yield contextual effects rather than strict logical . In the selection task, participants interpret the rule's as conveying implicatures about what counts as useful , leading to selections that align with everyday pragmatic inferences over formal semantics. This approach emphasizes cognitive efficiency, predicting that abstract rules elicit fewer relevant inferences, resulting in poorer performance. These theories collectively account for the task's abstract difficulties—due to limited model construction, absent schemas, or low relevance—while concrete rules succeed through enriched representations and goal-directed inferences, all without positing domain-specific innate mechanisms. Recent computational models, including those using large language models, replicate these content effects, demonstrating human-like biases in reasoning without hardcoded , thus supporting the flexibility of mental, pragmatic, and relevance-based processes.

Extensions and Applications

Deontic variants

Deontic variants of the Wason selection task replace indicative conditional rules with deontic ones, involving permissions, obligations, or prohibitions derived from deontic logic, which specifies normative relations of what is permitted, required, or forbidden. These rules emphasize detecting violations rather than verifying truth, such as in a permission schema: "If an envelope has a Hamburg postmark, then it may have a German stamp." To test this, participants must select the card showing a Hamburg postmark (to check for a non-German stamp, which would be impermissible) and the card showing a non-German stamp (to identify impermissible combinations), thereby focusing on potential breaches of the permission. This shift aligns reasoning with the falsification principle by framing card selection as a search for rights violations or duty infringements, rather than abstract hypothesis testing. A seminal study by Griggs and Cox demonstrated the facilitative effect of deontic content using a U.S.-relevant rule: "If a is drinking , then they must be over 19." American undergraduates achieved a 74% correct selection rate on this task—choosing the beer-drinking and the under-19 —compared to approximately 20% success on the standard abstract rule ("If a shows a on one side, then it shows an even number on the other"). This improvement highlights how familiarity with deontic norms, such as regulations, cues participants to prioritize violation detection. Deontic variants extend to threat rules (e.g., "If you take the benefit, you cannot avoid the cost") and obligation rules (e.g., "If you want to enter, you must show "), both eliciting high performance by evoking similar normative concerns. A of over 80 deontic experiments confirms a robust facilitation effect, with an average () of 0.54 indicating better-than-chance performance independent of social or contractual content, underscoring the task's sensitivity to deontic framing over mere thematic familiarity.

Implications for reasoning research

In , the task serves as a for evaluating in large language models (LLMs), revealing both strengths and limitations in their deductive capabilities. Recent 2024 evaluations demonstrate that advanced LLMs, such as , achieve accuracy similar to human averages (around 10-20%) on abstract versions of the task, often selecting the correct falsifying cards through in data. However, these models struggle with deontic variants relative to humans, achieving around 30% accuracy compared to human rates of about 70% in scenarios, which underscores gaps in their understanding of contextual norms. This has led to targeted approaches using the task to enhance LLMs' resistance to biases in real-world applications like decision support systems. Criticisms of the task highlight potential overestimation of reasoning biases due to instructional framing, where abstract presentations may not reflect natural hypothesis testing environments. Recent analyses, including a 2019 critical , argue that some theoretical models fail to fully account for data patterns in the task. A 2025 state-of-the-art review calls for further on factors modulating deontic reasoning to better understand everyday cognitive processes. Future research directions include examining cross-cultural variations in task performance to test universality claims, with studies on indigenous groups like the Shiwiar showing higher deontic success rates potentially tied to ecological demands. Additionally, the task's emphasis on falsification has implications for misinformation detection, where training individuals to seek disconfirming evidence—mirroring correct selections—could mitigate belief in false claims, informing interventions in digital literacy programs.

References

  1. [1]
    [PDF] The Quarterly Journal of Experimental Psychology - MIT
    In the experimental group the subjects first carried out a selection task. They decided which of four cards (P, P, 9, G) would enable them to determine ...
  2. [2]
    [PDF] The Wason Selection Task: A Meta-Analysis - eScholarship
    Abstract. In Wason's selection task, participants select whichever of four cards could provide evidence about the truth or falsity of a conditional rule.
  3. [3]
    Peter Wason | The Independent
    Apr 22, 2003 · In 1950, he decided to start again by reading for a degree in Psychology at University College London. He was to stay at UCL for the rest of his ...
  4. [4]
    Wason selection task. - APA PsycNet
    Wason selection task. · Citation. Evans, J. S. B. T. (2017). · Abstract. Peter Wason is regarded by many as the founder of the modern psychology of reasoning.
  5. [5]
    Peter C. Wason, Reasoning - PhilPapers
    Peter C. Wason. In New Horizons in Psychology. Penguin 135-151 (1966). Abstract, This article has no associated abstract. (fix it). Keywords, No keywords ...Missing: British | Show results with:British
  6. [6]
    Wason selection task - Oxford Reference
    The Wason selection task is a logical puzzle where you determine which of four cards (E, K, 4, 7) to turn over to test a rule about vowels and even numbers. ...Missing: postmark Hamburg German<|control11|><|separator|>
  7. [7]
    Leda Cosmides and the Wason Selection Task
    The Wason selection test therefore evaluates subjects' ability to find facts that violate a hypothesis, specifically a conditional hypothesis of the form If P ...
  8. [8]
    On the failure to eliminate hypotheses in a conceptual task
    Apr 7, 2008 · This investigation examines the extent to which intelligent young adults seek (i) confirming evidence alone (enumerative induction) or (ii) confirming and ...
  9. [9]
    Reasoning about a Rule - P. C. Wason, 1968 - Sage Journals
    Two experiments were carried out to investigate the difficulty of making the contra-positive inference from conditional sentences of the form, “if P then Q.”Missing: original paper
  10. [10]
    The logic of social exchange: Has natural selection shaped how ...
    This article presents a series of experiments designed to test these hypotheses, using the Wason selection task, a test of logical reasoning.
  11. [11]
    The Wason Selection Task: A Meta-Analysis - eScholarship
    In Wason's selection task, participants select whichever offour cards could provide evidence about the truth or falsity ofa conditional rule.
  12. [12]
    [PDF] The elusive thematic-materials effect in Wason's selection task.
    drinking-age problem while no one did so for the abstract problem. On Trial ... Wason's selection task: Verification, falsification and matching.
  13. [13]
    Karl Popper - Stanford Encyclopedia of Philosophy
    Nov 13, 1997 · Every test of a theory, whether resulting in its corroboration or falsification, must stop at some basic statement or other which we decide to ...Missing: Wason | Show results with:Wason
  14. [14]
    Karl Popper: Philosophy of Science
    In the book, Popper offered his first detailed account of scientific methodology and of the importance of falsification.Background · Falsification and the Criterion... · Methodology in the Social...Missing: original | Show results with:original
  15. [15]
    Pragmatic versus syntactic approaches to training deductive reasoning
    In two experiments, syntactically equivalent forms of conditional rules produced different patterns of performance in Wason's selection task, depending on the ...
  16. [16]
    Pragmatic reasoning schemas - ScienceDirect.com
    Experiment 1 demonstrated that by evoking the permission schema it is possible to facilitate performance in Wason's selection paradigm for subjects who have had ...
  17. [17]
    Psychology Classics: Wason Selection Task (Part II)
    Oct 7, 2013 · This post is the second in a series on the Wason selection task (Part I), and part of our ongoing series exploring classic experiments and theories.<|control11|><|separator|>
  18. [18]
    Deduction or Darwinian Algorithms? An explanation of the “elusive ...
    This thesis develops the idea that natural selection has shaped how humans reason about evolutionarily important domains of human activity.
  19. [19]
    Mental models and human reasoning - PNAS
    Mental models and human reasoning. Philip N. Johnson-Laird ... RS Nickerson, Hempel's paradox and Wason's selection task: Logical and psychological problems of ...
  20. [20]
    Modal logics, possible worlds, and mental models
    Jul 16, 2024 · Everyone reasons about possibilities. This article explains how they could do so using mental models. The theory makes four major claims.
  21. [21]
    (PDF) Pragmatic Reasoning Schemas - ResearchGate
    Aug 6, 2025 · (1985). Facilitation in Wason's selection task: Content and instruction. Unpublished manuscript, Lake Erie College. Yachanin, S. A ...
  22. [22]
    Relevance theory explains the selection task - ScienceDirect.com
    We propose a general and predictive explanation of the Wason Selection Task (where subjects are asked to select evidence for testing a conditional “rule”).
  23. [23]
    [PDF] Relevance theory explains the selection task | Dan Sperber
    We propose a general and predictive explanation of the Wason Selection Task. (where subjects are asked to select evidence for testing a conditional "rule").
  24. [24]
  25. [25]
    The Brain Structure and Intrinsic Characters of Falsification Thinking ...
    Wason's selection task (WST) as a representative of the field of conditional proposition testing has been explored by multiple disciplines for more than 50 ...
  26. [26]
    Language models, like humans, show content effects on reasoning ...
    Jul 16, 2024 · However, in some cases the humans and models behave differently—particularly on the Wason task, where humans perform much worse than large ...Missing: benchmarks | Show results with:benchmarks
  27. [27]
    [PDF] Evaluating the Deductive Competence of Large Language Models
    Jun 16, 2024 · The Wason selection task is a reasoning task from the cognitive science literature that evaluates de- ductive competence (Wason, 1968).
  28. [28]
    Theories of the Wason Selection Task: a Critical Assessment of ...
    ... Wason's (1968, 1969) 'selection task'. This suggestion was tested by giving subjects the selection task with conditional rules in which the presence and ...Missing: original | Show results with:original
  29. [29]
    (PDF) DEONTIC REASONING AND WASON´S SELECTION TASK
    Oct 22, 2025 · Deontic selection task is a metainference problem in which subjects reason from a conditional rule involving deontic terms (must, should).
  30. [30]
    Cognitive adaptations for social exchange: Shiwiar evidence
    The Wason selection task (17–19) is a test of conditional reasoning in which subjects are asked to identify possible violations of a conditional rule of the ...
  31. [31]
    When Does an Individual Accept Misinformation? An Extended ...
    Misinformation and disinformation, such as “fake news”, are phenomena reaching far back in the history of the media. It typically refers to intentionally or ...