Apps

🧪 This platform is in early beta. Features may change and you might encounter bugs. We appreciate your patience!

P-Value Misinterpretation

Also Known As: Significance misinterpretation NHST misuse
Aspect ID: p_value_misinterpretation

Definition

The p-value is the probability of observing data at least as extreme as the data obtained, given that the null hypothesis is true. It is not the probability that the results are due to chance, not the probability that the null hypothesis is true, and not the probability that the findings will replicate. Surveys consistently show that over 60% of scientists hold at least one of these incorrect interpretations.

Examples

A researcher finds p = 0.03 and concludes 'there is a 3% chance this result is a fluke.' But p = 0.03 means: if the null were true, 3% of studies run this way would produce results this extreme. It says nothing about the probability that the null is true in this specific case.

A social media post about a new diet study announces: 'Scientists proved the diet works — only a 1% chance the results are random chance (p = 0.01)!' In reality, p = 0.01 means that if the diet had no effect whatsoever, there would be a 1% chance of seeing data this extreme. It says nothing about the probability that the diet is effective or that the null hypothesis is true.

A product manager reviews an A/B test showing the new website design outperforms the old one at p = 0.04 and tells the team: 'There's a 96% chance our new design is genuinely better.' This conflates the p-value with the probability that the alternative hypothesis is true — p = 0.04 only describes how surprising the data would be under the null, not the posterior probability that the design improvement is real.

Verification Steps
Verification Steps
Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.
Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

Binary (yes/no) questions an LLM must answer to identify this aspect:

  1. 1

    Is p<0.05 being interpreted as the probability that results are due to chance?

    Type: binary
  2. 2

    Is statistical significance being equated with practical or clinical significance?

    Type: binary
  3. 3

    Is p>0.05 being interpreted as evidence of no effect?

    Type: binary
  4. 4

    Is the p-value being used without reporting effect sizes and confidence intervals?

    Type: binary
Deep Dive
The expandable detail section on each aspect page with examples, psychology, and counter-strategies.
The Deep Dive section provides in-depth information about each aspect: a real-world example showing the pattern in action, an explanation of why it works psychologically, practical advice on how to counter it, alternative names, and links to related aspects.