Base Rate Fallacy

Also Known As: base rate neglect prosecutor's fallacy prior probability neglect

Statistical Error ID: base_rate_fallacy

Definition

The base rate fallacy occurs when people ignore or underweight the prior probability (base rate) of an event when evaluating conditional probabilities. Instead, they focus on specific, often vivid information about the individual case. This is a violation of Bayesian reasoning, where the posterior probability must account for both the likelihood of the evidence given the hypothesis and the prior probability of the hypothesis itself.

Examples

A medical test for a rare disease (prevalence 1 in 10,000) has a 99% accuracy rate. A patient tests positive and believes they almost certainly have the disease. In reality, with 10,000 tests, roughly 100 false positives occur versus just 1 true positive, giving only about a 1% chance of actually having the disease.

An airport security algorithm flags a passenger as a potential threat with '95% accuracy.' The security officer treats this as near-certain danger. But if only 1 in 50,000 passengers is actually a threat, the vast majority of flagged passengers are false positives — innocent travelers caught by the algorithm.

A hiring manager uses a personality test the vendor claims is '90% accurate' at identifying high performers. She assumes a high-scoring candidate will almost certainly excel. But if only 10% of applicants are genuinely high performers, most high scorers are still average employees who happened to test well.

Verification Steps

Verification Steps

Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.

Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

View in glossary →

Binary (yes/no) questions an LLM must answer to identify this aspect:

1

Is a probability or likelihood estimate being made?
Type: binary
2

Is specific individual-case information being given more weight than general base rates?
Type: binary
3

Would incorporating the base rate significantly change the conclusion?
Type: binary

Description

Why It Works

Specific, concrete information (the test result) feels more relevant than abstract statistical information (the disease prevalence). Humans are intuitively poor at integrating base rates into probabilistic judgments.

How to Counter

Always ask 'How common is this condition or event in the first place?' Use natural frequencies rather than percentages (e.g., '1 out of 101 positive tests is a true positive') to make base rates intuitive.

Also Known As

base rate neglect prosecutor's fallacy prior probability neglect

Real-World Context

This fallacy is critical in medical screening programs, criminal forensics (DNA match probabilities), and airport security (flagging false positives among millions of travelers).

Related Aspects

Base Rate Neglect Conjunction Fallacy Ratio Bias (Denominator Neglect) Neglect of Probability

Related Aspects

→ correlates with

Availability Heuristic

Forming worldview based on examples that come most easily to mind.

→ correlates with

Hasty Generalization

Drawing broad conclusions from limited, unrepresentative, or anecdotal evidence.

← correlates with

Bayesian Reasoning

Probability-based belief revision using Bayes' theorem.

← correlates with

Spectrum Bias

Diagnostic test accuracy varies when evaluated across different disease severity levels.

← correlates with

Insensitivity to Sample Size

The tendency to draw strong conclusions from small samples, failing to recognize that small samples are more variable and less reliable than large ones.

← correlates with

Law of Small Numbers

Believing that small samples accurately represent the underlying population distribution.

← correlates with

Illusion of Validity

The tendency to overestimate the accuracy of one's judgments, especially when available information is internally consistent, even if the information is limited or unreliable.

← correlates with

Friendship Paradox

On average, people's friends have more friends than they do, due to sampling bias toward popular nodes.

← correlates with

Inspection Paradox

Random observation of a process is more likely to catch long-duration events than short ones.

← correlates with

Accuracy Paradox

A model with higher accuracy can have worse predictive power than a less accurate one on imbalanced data.

← correlates with

Lindley's Paradox

Bayesian and frequentist approaches yield contradictory conclusions with large sample sizes.