Apps

🧪 This platform is in early beta. Features may change and you might encounter bugs. We appreciate your patience!

Endogeneity Bias

Also Known As: Simultaneity bias Endogeneity problem
Statistical Error ID: endogeneity_bias

Definition

Endogeneity bias arises when an independent variable in a regression model is correlated with the error term, violating a core assumption of ordinary least squares estimation. This can occur through omitted variables, measurement error, or simultaneous causation. The result is biased and inconsistent coefficient estimates that do not reflect true causal relationships.

Examples

A study examines whether police presence reduces crime by regressing crime rates on the number of officers. However, cities with more crime hire more police, so police presence is endogenous — it is both a potential cause and a consequence of the crime rate.

A marketing analyst regresses a brand's sales on its advertising spend and finds a weak positive effect, concluding advertising barely works. In reality, the company increases advertising precisely when sales are already declining, making ad spend negatively correlated with underlying demand — the reverse causality attenuates the true effect.

Researchers studying whether higher wages reduce employee absenteeism find almost no relationship in their regression. However, firms that already experience high absenteeism tend to raise wages to retain staff, creating reverse causality that obscures the genuine negative effect of wages on absenteeism.

Verification Steps
Verification Steps
Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.
Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

Binary (yes/no) questions an LLM must answer to identify this aspect:

  1. 1

    Is there reason to suspect the independent variable is correlated with the error term in the model?

    Type: binary
  2. 2

    Could there be simultaneous causation between the independent and dependent variables?

    Type: binary
  3. 3

    Is the analysis treating an endogenous variable as if it were exogenous?

    Type: binary
  4. 4

    Has the study failed to use instrumental variables or other techniques to address endogeneity?

    Type: binary
Deep Dive
The expandable detail section on each aspect page with examples, psychology, and counter-strategies.
The Deep Dive section provides in-depth information about each aspect: a real-world example showing the pattern in action, an explanation of why it works psychologically, practical advice on how to counter it, alternative names, and links to related aspects.

Hierarchical Context