False Positive Paradox

Also Known As: Base rate neglect in testing Screening paradox

Aspect ID: false_positive_paradox

Definition

The false positive paradox occurs when a highly accurate test applied to a rare condition produces more false positives than true positives in absolute terms. Even a test with 99% sensitivity and 99% specificity will produce one false positive for every true positive when testing a population with 1% prevalence, and ten false positives for every true positive at 0.1% prevalence.

Examples

A disease affects 1 in 1,000 people. A test has 99% sensitivity and 99% specificity. Testing 100,000 people: 100 true cases, of which 99 test positive. But 99,900 healthy people test, of which 999 test positive (false positives). There are 10 false positives for every true positive.

An airport security algorithm flags potential threats with 99% accuracy and a false positive rate of just 1%. On a day with 10,000 travelers, if only 10 are genuine threats, the system correctly catches 9 of them — but also wrongly detains 100 innocent passengers. For every real threat identified, roughly 11 innocent people are flagged alongside them.

A social media platform deploys an AI to detect bot accounts, claiming 98% accuracy. If only 0.5% of its 10 million users are bots — that's 50,000 bots — the system correctly identifies 49,000 of them but also falsely flags 199,000 real users. The overwhelming majority of accounts banned are actually legitimate human users.

Verification Steps

Verification Steps

Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.

Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

View in glossary →

Binary (yes/no) questions an LLM must answer to identify this aspect:

1

Is the test being applied to a low-prevalence condition?
Type: binary
2

Is the specificity of the test high enough to prevent false positives from dominating true positives?
Type: binary
3

Is the positive predictive value (PPV) calculated using the actual population prevalence?
Type: binary
4

Are absolute counts of true positives versus false positives reported, not just sensitivity and specificity?
Type: binary

Description

Why It Works

Sensitivity and specificity are conditional probabilities that seem impressive in isolation. The base rate transforms them into the positive predictive value (PPV), which is what matters for clinical and policy decisions.

How to Counter

Always calculate the positive predictive value: PPV = (sensitivity x prevalence) / [(sensitivity x prevalence) + (1 minus specificity) x (1 minus prevalence)]. Report absolute numbers, not just rates.

Also Known As

Base rate neglect in testing Screening paradox

Real-World Context

Airport security screening, mass COVID testing, and drug testing programs all face the false positive paradox; with rare conditions or infractions, most positives are false positives.

Related Aspects

Prosecutor's Fallacy Base Rate Neglect Overdiagnosis

Try it in action

Use these tools to detect, analyze, or train this aspect.

🔍 Text Analyzer

Scan a text for this pattern

⚗️ Argument Lab

Analyze an argument step by step

🎓 Fallacy Trainer

Quiz yourself on this aspect