Endogeneity Bias — When Logic Wears a Disguise
Endogeneity bias arises when an independent variable in a regression model is correlated with the error term, violating a core assumption of ordinary least squares estimation. This can occur through omitted variables, measurement error, or simultaneous causation. The result is biased and inconsistent coefficient estimates that do not reflect true causal relationships.
Also known as: Simultaneity bias, Endogeneity problem
How It Works
Many real-world relationships involve feedback loops or shared unobserved causes. Standard regression assumes the independent variable is determined outside the system, but this assumption often fails in observational data.
A Classic Example
A study examines whether police presence reduces crime by regressing crime rates on the number of officers. However, cities with more crime hire more police, so police presence is endogenous — it is both a potential cause and a consequence of the crime rate.
More Examples
A marketing analyst regresses a brand's sales on its advertising spend and finds a weak positive effect, concluding advertising barely works. In reality, the company increases advertising precisely when sales are already declining, making ad spend negatively correlated with underlying demand — the reverse causality attenuates the true effect.
Researchers studying whether higher wages reduce employee absenteeism find almost no relationship in their regression. However, firms that already experience high absenteeism tend to raise wages to retain staff, creating reverse causality that obscures the genuine negative effect of wages on absenteeism.
Where You See This in the Wild
Pervasive in economics and policy evaluation, such as estimating the effect of education on earnings (ability confounds both), or the impact of institutions on economic growth.
How to Spot and Counter It
Use instrumental variable estimation, natural experiments, or regression discontinuity designs. Clearly articulate why a variable is believed to be exogenous. Test for endogeneity using Hausman tests or similar diagnostics.
The Takeaway
The Endogeneity Bias is one of those reasoning errors that sounds perfectly logical at first glance. That's what makes it dangerous — it wears the costume of valid reasoning while smuggling in a broken conclusion. The best defense? Slow down and ask: does this conclusion actually follow from these premises, or am I just connecting dots that happen to be near each other?
Next time someone presents you with an argument that "just makes sense," check the structure. The feeling of logic is not the same as logic itself.