Apps

🧪 This platform is in early beta. Features may change and you might encounter bugs. We appreciate your patience!

Spectrum Bias

Also Known As: Case-Mix Bias Spectrum Effect
Statistical Error ID: spectrum_bias

Definition

Spectrum bias occurs when the accuracy of a diagnostic test is evaluated using a patient population that does not reflect the full range of disease severity encountered in practice. Tests often perform best when distinguishing severe disease from healthy controls, but perform poorly in the clinically relevant middle ground where the diagnosis is uncertain.

Examples

A blood test for a liver disease is validated by comparing patients with advanced liver failure to perfectly healthy volunteers, achieving 98% accuracy. When used in a primary care setting on patients with mild symptoms, accuracy drops to 60% because the test cannot distinguish early-stage disease from other minor conditions.

A rapid strep throat test is evaluated by testing children admitted to hospital with severe, culture-confirmed streptococcal infections against children with no symptoms at all, yielding 97% sensitivity. In a school nurse's office, where most children have mild sore throats from viral infections, the test performs far worse because the signal is much harder to distinguish from noise.

A machine-learning skin cancer screening tool is trained and validated on images of obvious melanomas confirmed by biopsy versus clearly benign moles, achieving near-perfect accuracy. When deployed in a dermatology clinic where most referrals involve ambiguous, borderline lesions, accuracy drops sharply because the tool was never tested on the difficult middle-ground cases it now encounters most often.

Verification Steps
Verification Steps
Binary yes/no questions that an AI must answer to detect a reasoning pattern in a text.
Each of the 452 aspects has verification steps — simple yes/no questions designed to systematically detect whether a pattern appears in a text. For ad hominem: "Does the argument attack a person rather than their claim?" For false dichotomy: "Are only two options presented when more exist?" This ensures consistent, reproducible analysis.

Binary (yes/no) questions an LLM must answer to identify this aspect:

  1. 1

    Was the diagnostic test evaluated on a sample that includes a narrow range of disease severity?

    Type: binary
  2. 2

    Does the study population differ from the population where the test will actually be used?

    Type: binary
  3. 3

    Could the test's sensitivity or specificity change across different patient subgroups?

    Type: binary
  4. 4

    Are the test's performance metrics presented as universal without noting population-specific limitations?

    Type: binary
Deep Dive
The expandable detail section on each aspect page with examples, psychology, and counter-strategies.
The Deep Dive section provides in-depth information about each aspect: a real-world example showing the pattern in action, an explanation of why it works psychologically, practical advice on how to counter it, alternative names, and links to related aspects.

Hierarchical Context