Confounding Variable Neglect — When Numbers Lie
Has this ever happened to you? A study finds that coffee drinkers have higher rates of lung cancer and concludes coffee causes cancer.
Also known as: omitted variable bias, third variable problem, uncontrolled confounding
What's Actually Happening
Confounding variable neglect occurs when a study fails to account for a variable that is associated with both the treatment/exposure and the outcome, leading to biased estimates of the causal relationship. Unlike ghost variables which are unknown, confounding variables are often identifiable but are simply not controlled for in the analysis. This neglect can make a harmful treatment appear beneficial, or an effective treatment appear useless.
Observational data can only show associations. Without controlling for confounders, the observed association is a mixture of the true causal effect and the spurious effect of the confounder, and audiences rarely distinguish between the two.
Real Talk: You See This Every Day
A study finds that coffee drinkers have higher rates of lung cancer and concludes coffee causes cancer. The confounding variable is smoking: coffee drinkers in the study population are much more likely to smoke, and smoking is the actual cause of the elevated cancer rates.
Confounding is the central challenge in observational epidemiology, health policy research, and social science studies where randomized experiments are often impractical or unethical.
Your BS Detector
Use randomization to eliminate confounding, or apply statistical controls (regression, matching, stratification). Draw causal diagrams (DAGs) to identify potential confounders before analyzing data.
- ✓ Who collected this data, and why?
- ✓ Is the sample big enough and fair?
- ✓ Could there be another explanation?
The Challenge
Next time someone throws a statistic at you — in class, online, in the news — don't just accept it. Ask: what's missing from this picture?
Part of the TellDear Teen Book — criticalthinking.guide