Pseudo-Replication

Also Known As: Hurlbert's pseudo-replication Clustering error

Aspect ID: pseudo_replication

Definition

Pseudo-replication occurs when non-independent observations are treated as if they were statistically independent, artificially inflating sample size and deflating standard errors. This is common when multiple measurements are taken from the same individual, or multiple individuals come from the same cage or ecological plot. The result is vastly overconfident statistical tests.

Examples

A neuroscience study records spike activity from 50 neurons across 5 mice (10 neurons per mouse). If the analysis treats the 50 neurons as 50 independent observations, it commits pseudo-replication. Neurons from the same mouse are correlated. The true independent sample size is 5, not 50.

An educational researcher tests a new teaching method in one classroom of 30 students and a traditional method in a second classroom of 30 students. Analyzing the 60 students as 60 independent observations ignores that students within the same classroom share a teacher, classroom environment, and group dynamics. The true sample size for comparing methods is two classrooms, not sixty students.

A food scientist tests whether a new preservative extends shelf life by placing 20 samples from the same loaf of bread in treated bags and 20 samples from the same loaf in control bags. Treating these as 40 independent observations commits pseudo-replication — all treated samples share the properties of one loaf, and all controls share the properties of another.

Verification Steps

Verification Steps

Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.

Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

View in glossary →

Binary (yes/no) questions an LLM must answer to identify this aspect:

1

Are observations within groups or clusters truly independent of one another?
Type: binary
2

Is the statistical analysis treating sub-samples within units as independent observations?
Type: binary
3

Does the sample size claimed correspond to the number of independent experimental units, not the number of measurements?
Type: binary
4

Was a multilevel or mixed-effects model used to account for non-independence?
Type: binary

Description

Why It Works

The additional measurements within units feel like additional data and statistically look like additional degrees of freedom. Researchers are often unaware that the independence assumption is violated by their data structure.

How to Counter

Identify the true unit of randomization and replication. Use multilevel models or generalized estimating equations that account for clustering. Confirm that the reported n corresponds to independent experimental units.

Also Known As

Hurlbert's pseudo-replication Clustering error

Real-World Context

Pseudo-replication is endemic in animal research (multiple readings per animal treated as independent), genomics (correlated features within genes), and ecological studies.

Related Aspects

Atomistic Fallacy Model Selection Bias Type 1 Error (False Positive)

Try it in action

Use these tools to detect, analyze, or train this aspect.

🔍 Text Analyzer

Scan a text for this pattern

⚗️ Argument Lab

Analyze an argument step by step

🎓 Fallacy Trainer

Quiz yourself on this aspect