Simpson's Paradox — When Logic Wears a Disguise
Simpson's Paradox occurs when a trend that appears in several different groups of data reverses or disappears when these groups are combined. This happens because of a lurking variable that changes the composition of the groups. The paradox reveals that aggregated data can tell a fundamentally different story than disaggregated data, making the choice of how to partition data a critical analytical decision.
Also known as: reversal paradox, amalgamation paradox, Yule-Simpson effect
How It Works
People trust aggregate statistics as objective summaries. The paradox exploits the assumption that what holds for the whole must hold for the parts, and vice versa.
A Classic Example
University admission data shows that overall, women are admitted at a lower rate than men (apparent gender bias). But when broken down by department, women are admitted at equal or higher rates in every single department. The reversal occurs because women disproportionately applied to highly competitive departments with low admission rates for everyone.
More Examples
An online platform reports that its new recommendation algorithm increases average time spent per user overall. But when broken down by user type, both casual users and power users actually spend less time — the overall increase is driven entirely by a surge in new user sign-ups, who naturally spend more time exploring the platform.
A school district reports that its overall average test scores improved after a new curriculum was introduced. However, when scores are broken down by school, every individual school shows a decline. The district-level improvement is an artifact of higher-performing schools growing in enrollment while lower-performing schools shrank.
Where You See This in the Wild
Simpson's Paradox famously appeared in the UC Berkeley gender bias case (1973) and regularly surfaces in medical treatment comparisons where patient severity varies between treatment groups.
How to Spot and Counter It
Always examine data at multiple levels of aggregation. Ask whether the composition of subgroups differs significantly and whether a confounding variable might reverse the observed trend.
The Takeaway
The Simpson's Paradox is one of those reasoning errors that sounds perfectly logical at first glance. That's what makes it dangerous — it wears the costume of valid reasoning while smuggling in a broken conclusion. The best defense? Slow down and ask: does this conclusion actually follow from these premises, or am I just connecting dots that happen to be near each other?
Next time someone presents you with an argument that "just makes sense," check the structure. The feeling of logic is not the same as logic itself.