Apps

🧪 This platform is in early beta. Features may change and you might encounter bugs. We appreciate your patience!

Ghost Variables

Also Known As: lurking variables unmeasured confounders hidden third variables
Statistical Error ID: ghost_variables

Definition

Ghost variables are unmeasured or unacknowledged variables that influence both the independent and dependent variables in a study, creating a spurious apparent relationship. Unlike confounding variables which may at least be discussed, ghost variables are entirely absent from the analysis and often from the researcher's awareness. Their invisibility makes them particularly dangerous because there is nothing in the data itself that reveals their presence.

Examples

A study finds that children who eat breakfast perform better in school and concludes that breakfast improves academic performance. The ghost variable is household income: wealthier families are more likely to provide breakfast AND have children who perform better due to other resource advantages.

A city study finds that neighborhoods with more coffee shops have lower crime rates and concludes that coffee shops reduce crime by fostering community. The ghost variable is gentrification and rising property values, which simultaneously attract coffee shops and displace lower-income populations associated with higher crime statistics.

Researchers find that children who own more books score higher on reading tests and recommend that schools distribute free books. The ghost variable is parental education level, which predicts both the number of books in a home and the emphasis placed on reading and literacy development.

Verification Steps
Verification Steps
Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.
Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

Binary (yes/no) questions an LLM must answer to identify this aspect:

  1. 1

    Were data collected on multiple dependent variables?

    Type: binary
  2. 2

    Are only the significant variables reported in the final analysis?

    Type: binary
  3. 3

    Is the full set of measured variables disclosed?

    Type: binary
Deep Dive
The expandable detail section on each aspect page with examples, psychology, and counter-strategies.
The Deep Dive section provides in-depth information about each aspect: a real-world example showing the pattern in action, an explanation of why it works psychologically, practical advice on how to counter it, alternative names, and links to related aspects.

Hierarchical Context