Category

Statistical Biases

Impact level

3 / 5

Last updated

Nov 2025

Category Statistical Biases

Impact 3 / 5

STATISTICAL BIASES

Sampling
Bias

Sampling bias is a specific subtype of selection bias where the method of collecting the sample inherently favors certain types of participants. It means the sample is not a miniature version of the population.

Also known as: Ascertainment bias

01

Overview

Sampling Bias

The Psychology Behind It

Sampling bias is the "lazy researcher" bias. It is easier to ask your friends, your students, or people on the street corner than it is to design a truly random national sample. We gravitate towards "convenience sampling."

Additionally, certain groups are "hard to reach" (the homeless, the very rich, the busy). If a survey requires a 20-minute phone call, you are sampling "people with 20 minutes of free time who like talking on the phone," not the general public.

Real-World Examples

Self-Selection Bias

Internet polls are notorious for this. "Vote on our website!" Only people who visit that specific website and care enough to click will vote. The results tell you nothing about the wider world.

Pre-Screening Bias

In clinical trials, researchers often exclude patients with other conditions (comorbidities) to make the data cleaner. But in the real world, patients often have multiple conditions. The drug might work in the "clean" sample but fail in the "messy" real world.

Survivorship Sampling

Analyzing the financial performance of current companies ignores those that went bankrupt (a form of survivorship bias that is also a sampling error).

Consequences

Sampling bias can lead to:

  • Echo Chambers: We think everyone agrees with us because we only sample our own social circle.
  • Product Failure: A product tests well in a focus group of loyal fans but flops in the mass market.
  • Medical Harm: Treatments are approved based on young, healthy samples and then cause side effects in the elderly.

How to Mitigate It

Define the population, then sample the population.

  1. Stratified Sampling: Divide the population into groups (age, race, income) and sample randomly from each group to ensure representation.
  2. Oversampling: Intentionally sample more small groups (minorities) to ensure you have enough data to analyze them.
  3. Response Rate Analysis: If only 10% of people answered your survey, analyze the 90% who didn't. Are they different?

Conclusion

A cup of water from the ocean tells you about the ocean only if you stirred the ocean first. Sampling bias is what happens when you dip your cup in a stagnant pool and think you know the sea.

Cognitive processing

System 2 (deliberate). Biases often lean on quick judgments (System 1) unless you slow down and analyze (System 2).

Evidence & time

Evidence strength: experimental. Typical read: about 2 min.

02

Mitigation strategies

Convenience Sampling Awareness: Always label convenience samples as such. Never claim they represent the population.

Effectiveness: medium

Difficulty: moderate

03

Potential decision harms

Health studies often exclude women or minorities, leading to medical guidelines that are dangerous for those groups.

critical Severity

A restaurant changes its menu based on feedback from a few loud regulars, alienating the broader customer base.

moderate Severity

04

Key research studies

The WEIRDest people in the world?

Henrich, J., Heine, S. J., & Norenzayan, A. (2010) Behavioral and Brain Sciences

Highlighted the extreme sampling bias in psychology, where 96% of subjects come from Western, Educated, Industrialized, Rich, and Democratic societies.

Read Study →

Tags