Winner's Curse

Also Known As: Effect size inflation Regression to the mean in meta-analysis

Aspect ID: winners_curse

Definition

The winner's curse states that the first statistically significant finding of an effect almost certainly overestimates the true effect size, due to the mathematical properties of significance testing combined with publication bias. To reach significance, an underpowered study must by chance observe an effect substantially larger than the true effect.

Examples

The first study reporting an association between a genetic variant and a trait finds an odds ratio of 3.2. Subsequent genome-wide association studies find the true odds ratio is 1.12. The original finding was the winner's curse — only an unusually large estimate happened to be statistically significant given the small sample.

The first published trial of a new antidepressant reports a dramatic effect size of d = 0.85, landing on the cover of a psychiatry journal. As subsequent larger trials accumulate, the meta-analytic effect size converges to d = 0.28 — a modest benefit. The original trial was published precisely because its result was striking, not because it was typical.

A startup's internal A/B test of a new checkout button color shows a 25% lift in conversions, prompting a company-wide redesign. When the experiment is repeated at scale over a longer period, the lift shrinks to 3%. The initial result was an upward fluctuation that crossed the significance threshold — and was therefore the one that got acted upon.

Verification Steps

Verification Steps

Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.

Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

View in glossary →

Binary (yes/no) questions an LLM must answer to identify this aspect:

1

Is this the first study to report a large, statistically significant effect in a new area?
Type: binary
2

Does the effect size appear implausibly large given the study's sample size?
Type: binary
3

Have subsequent larger studies replicated the effect with a similar magnitude?
Type: binary
4

Is publication bias plausible in this research area?
Type: binary

Description

Why It Works

Underpowered studies are common. With power of 50%, only the largest estimates from the sampling distribution will clear the significance threshold. Published results are therefore systematically biased upward regardless of researcher behavior.

How to Counter

Apply winner's curse correction methods (shrinkage, Empirical Bayes). Treat first-in-field effect sizes with skepticism. Look for large pre-registered studies or meta-analyses. Correct for publication bias.

Also Known As

Effect size inflation Regression to the mean in meta-analysis

Real-World Context

Candidate gene studies in psychiatry showed massive effect sizes in the 1990s and 2000s, most of which were eliminated by large GWAS studies.

Related Aspects

Single Study Generalization Publication Bias (File Drawer Problem) Regression Artifact

Try it in action

Use these tools to detect, analyze, or train this aspect.

🔍 Text Analyzer

Scan a text for this pattern

⚗️ Argument Lab

Analyze an argument step by step

🎓 Fallacy Trainer

Quiz yourself on this aspect