Accuracy Paradox

Also Known As: Accuracy trap

Statistical Error ID: accuracy_paradox

Definition

The Accuracy Paradox occurs when a predictive model with higher overall accuracy performs worse at the task it was designed for than a model with lower accuracy. This typically happens when classes are imbalanced — a model that always predicts the majority class can score very high accuracy while being completely useless for detecting the minority class.

Examples

A fraud detection system classifies 99.5% of transactions correctly by labeling everything as legitimate. A competing model has only 95% accuracy but catches 80% of fraudulent transactions. The less accurate model is far more useful despite its lower accuracy score.

A hospital deploys an AI model to screen chest X-rays for a rare lung condition affecting 1% of patients. The model achieves 99% accuracy simply by flagging nobody as sick. A second, 'less accurate' model at 96% overall accuracy correctly identifies 70% of true cases and is far more clinically useful, yet the first model looks superior on the headline metric.

A content moderation team evaluates two spam filters for their platform, where only 0.5% of posts are spam. Filter A scores 99.5% accuracy by approving every post. Filter B scores 97% accuracy but catches 85% of actual spam. Management almost deploys Filter A after seeing the numbers, not noticing it would let every single piece of spam through.

Verification Steps

Verification Steps

Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.

Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

View in glossary →

Binary (yes/no) questions an LLM must answer to identify this aspect:

1

Is the dataset highly imbalanced, with one class vastly outnumbering the other?
Type: binary
2

Could a naive model achieve high accuracy simply by predicting the majority class?
Type: binary
3

Does the model with higher accuracy fail to detect the minority class effectively?
Type: binary
4

Are metrics like precision, recall, or F1-score being ignored in favor of overall accuracy?
Type: binary

Description

Why It Works

Overall accuracy treats all correct predictions equally, regardless of class. When 99% of cases belong to one class, a trivial model that ignores the rare class achieves 99% accuracy. This masks its complete failure at the task that matters — identifying the rare but important events.

How to Counter

Evaluate models using class-specific metrics such as precision, recall, F1-score, or area under the ROC curve. Use confusion matrices to inspect performance on each class separately. Never rely on accuracy alone when dealing with imbalanced datasets.

Also Known As

Accuracy trap

Real-World Context

This paradox is pervasive in medical diagnostics (rare diseases), cybersecurity (intrusion detection), manufacturing (defect detection), and any domain where the event of interest is rare but consequential.