Law of Small Numbers

Also Known As: Hasty generalization from small samples Belief in the law of small numbers

Statistical Error ID: law_of_small_numbers

Definition

The law of small numbers is the erroneous belief that small samples should be representative of the population from which they are drawn, mirroring the statistical properties of the population in miniature. Named as an ironic counterpart to the actual law of large numbers, it reflects the cognitive tendency to expect patterns and regularities even in sequences too short to reliably display them. This leads to premature generalization, overinterpretation of noise, and false confidence in unreliable data.

Examples

A school district observes that three small rural schools (each with 30 students) rank among the top 10 in state test scores and concludes small schools are superior. They fail to notice that three other small schools rank in the bottom 10. Small schools appear at both extremes because their small samples produce volatile averages — not because of school quality.

An investor notices that a particular stock-picking newsletter correctly predicted the market direction three months in a row and immediately moves his savings into the recommended portfolio, convinced the analyst has a genuine edge — ignoring that with hundreds of newsletters, a few will get three in a row purely by chance.

A restaurant owner tries a new social media ad campaign for two weekends and gets unusually high foot traffic both times. She immediately cancels all other marketing and doubles her ad budget, not realizing that two weekends is far too small a sample to distinguish a real effect from normal weekly variation.

Verification Steps

Verification Steps

Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.

Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

View in glossary →

Binary (yes/no) questions an LLM must answer to identify this aspect:

1

Is a small sample being treated as if it accurately represents the population?
Type: binary
2

Are patterns observed in a small sample being assumed to be stable and generalizable?
Type: binary
3

Has the analysis failed to consider that small-sample results may simply reflect random variation?
Type: binary
4

Would the conclusion change substantially if based on a much larger sample?
Type: binary

Description

Why It Works

The human mind is designed to extract patterns quickly, which was adaptive in our evolutionary environment but leads us astray with statistical data. We intuitively apply a mental version of the law of large numbers to samples of any size, expecting even tiny samples to mirror the population faithfully.

How to Counter

Recognize that small samples naturally produce extreme and variable results. Demand larger samples before drawing conclusions. Use formal statistical tests that account for sample size. Be especially suspicious of impressive-looking results from very small datasets.

Also Known As

Hasty generalization from small samples Belief in the law of small numbers

Real-World Context

Affects medical decisions (rare case reports driving treatment choices), business strategy (pivoting based on a few customer interactions), and sports (judging player ability from a handful of games).

Related Aspects

Insensitivity to Sample Size Gambler's Fallacy (Representativeness) Base Rate Fallacy Underpowered Study Regression to the Mean Fallacy

Related Aspects

→ correlates with

Insensitivity to Sample Size

The tendency to draw strong conclusions from small samples, failing to recognize that small samples are more variable and less reliable than large ones.

→ correlates with

Gambler's Fallacy (Representativeness)

The mistaken belief that if an event has occurred more frequently than expected in the past, it is less likely to happen in the future (and vice versa), even when events are independent.

→ correlates with

Base Rate Fallacy

Ignoring general statistical base rates in favor of specific individual-case info.

→ correlates with

Underpowered Study

A study with too few participants or observations to reliably detect the effect being investigated. Low statistical power increases both false negatives and the rate at which significant findings are false positives.

→ correlates with

Regression to the Mean Fallacy

Attributing natural fluctuation to a specific intervention.

← correlates with

Illusion of Validity

The tendency to overestimate the accuracy of one's judgments, especially when available information is internally consistent, even if the information is limited or unreliable.

Hierarchical Context

→ is a Statistical Errors

Try it in action

Use these tools to detect, analyze, or train this aspect.

🔍 Text Analyzer

Scan a text for this pattern

⚗️ Argument Lab

Analyze an argument step by step

🎓 Fallacy Trainer

Quiz yourself on this aspect