Apps

🧪 This platform is in early beta. Features may change and you might encounter bugs. We appreciate your patience!

Simpson's Paradox

Also Known As: reversal paradox amalgamation paradox Yule-Simpson effect
Statistical Error ID: simpsons_paradox

Definition

Simpson's Paradox occurs when a trend that appears in several different groups of data reverses or disappears when these groups are combined. This happens because of a lurking variable that changes the composition of the groups. The paradox reveals that aggregated data can tell a fundamentally different story than disaggregated data, making the choice of how to partition data a critical analytical decision.

Examples

University admission data shows that overall, women are admitted at a lower rate than men (apparent gender bias). But when broken down by department, women are admitted at equal or higher rates in every single department. The reversal occurs because women disproportionately applied to highly competitive departments with low admission rates for everyone.

An online platform reports that its new recommendation algorithm increases average time spent per user overall. But when broken down by user type, both casual users and power users actually spend less time — the overall increase is driven entirely by a surge in new user sign-ups, who naturally spend more time exploring the platform.

A school district reports that its overall average test scores improved after a new curriculum was introduced. However, when scores are broken down by school, every individual school shows a decline. The district-level improvement is an artifact of higher-performing schools growing in enrollment while lower-performing schools shrank.

Verification Steps
Verification Steps
Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.
Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

Binary (yes/no) questions an LLM must answer to identify this aspect:

  1. 1

    Is aggregated data being used to draw a conclusion?

    Type: binary
  2. 2

    Does the trend reverse or disappear when the data is broken into subgroups?

    Type: binary
  3. 3

    Is a confounding variable (lurking variable) driving the reversal?

    Type: binary
Deep Dive
The expandable detail section on each aspect page with examples, psychology, and counter-strategies.
The Deep Dive section provides in-depth information about each aspect: a real-world example showing the pattern in action, an explanation of why it works psychologically, practical advice on how to counter it, alternative names, and links to related aspects.

Hierarchical Context