Regression Artifact

Also Known As: Regression to the mean Galton's paradox

Aspect ID: regression_artifact

Definition

A regression artifact occurs when individuals are selected for a study or intervention because of extreme scores on a variable that contains measurement error, and subsequent measurements appear to improve simply because extreme scores tend to regress toward the population mean on remeasurement. This regression is a mathematical property of imperfect reliability, not a treatment effect.

Examples

Students who score in the bottom 10% on a reading test are enrolled in a remedial reading program. On follow-up testing, their scores improve substantially. However, a control group of equally low-scoring students who received no intervention also improves almost as much, due to regression to the mean.

A corporate wellness program enrolls the 15% of employees who scored highest on a stress screening questionnaire. Three months later, their average stress scores have dropped noticeably, and HR declares the program a success. However, extreme scores on any self-report measure naturally drift toward the mean on retesting, regardless of any intervention.

Athletes who have their worst-ever performance in a qualifying round are selected for an experimental sports psychology coaching program. Most of them perform better in the next competition. Coaches attribute the improvement to the program, not recognizing that an unusually bad performance is statistically likely to be followed by a more typical — and better — one.

Verification Steps

Verification Steps

Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.

Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

View in glossary →

Binary (yes/no) questions an LLM must answer to identify this aspect:

1

Were participants selected or identified based on extreme scores on the outcome variable?
Type: binary
2

Does performance improve in a follow-up measurement compared to selection?
Type: binary
3

Is a control group that was not selected based on extreme scores available for comparison?
Type: binary
4

Would the improvement be expected by regression to the mean alone, independent of any intervention?
Type: binary

Description

Why It Works

Any measurement with less than perfect reliability will show regression to the mean when extreme scorers are remeasured. Selection of extreme cases guarantees that the remeasured scores will be less extreme on average.

How to Counter

Include a control group selected by the same extreme-score criterion. Use repeated baseline measurements before treatment. Apply analysis of covariance (ANCOVA) correctly to adjust for regression to the mean.

Also Known As

Regression to the mean Galton's paradox

Real-World Context

Sports coaches who punish poor performance are surprised when performance improves afterward — regression to the mean, not punishment, is the likely explanation.

Related Aspects

Winner's Curse Type 1 Error (False Positive) Variance Neglect

Try it in action

Use these tools to detect, analyze, or train this aspect.

🔍 Text Analyzer

Scan a text for this pattern

⚗️ Argument Lab

Analyze an argument step by step

🎓 Fallacy Trainer

Quiz yourself on this aspect