Sampling Bias

Also known as: Ascertainment bias

Sampling bias is a specific subtype of selection bias where the method of collecting the sample inherently favors certain types of participants. It means the sample is not a miniature version of the population.

Statistical Biases

2 min read

experimental Evidence


Sampling Bias

The Psychology Behind It

Sampling bias is the "lazy researcher" bias. It is easier to ask your friends, your students, or people on the street corner than it is to design a truly random national sample. We gravitate towards "convenience sampling."

Additionally, certain groups are "hard to reach" (the homeless, the very rich, the busy). If a survey requires a 20-minute phone call, you are sampling "people with 20 minutes of free time who like talking on the phone," not the general public.

Real-World Examples

Self-Selection Bias

Internet polls are notorious for this. "Vote on our website!" Only people who visit that specific website and care enough to click will vote. The results tell you nothing about the wider world.

Pre-Screening Bias

In clinical trials, researchers often exclude patients with other conditions (comorbidities) to make the data cleaner. But in the real world, patients often have multiple conditions. The drug might work in the "clean" sample but fail in the "messy" real world.

Survivorship Sampling

Analyzing the financial performance of current companies ignores those that went bankrupt (a form of survivorship bias that is also a sampling error).

Consequences

Sampling bias can lead to:

  • Echo Chambers: We think everyone agrees with us because we only sample our own social circle.
  • Product Failure: A product tests well in a focus group of loyal fans but flops in the mass market.
  • Medical Harm: Treatments are approved based on young, healthy samples and then cause side effects in the elderly.

How to Mitigate It

Define the population, then sample the population.

  1. Stratified Sampling: Divide the population into groups (age, race, income) and sample randomly from each group to ensure representation.
  2. Oversampling: Intentionally sample more small groups (minorities) to ensure you have enough data to analyze them.
  3. Response Rate Analysis: If only 10% of people answered your survey, analyze the 90% who didn't. Are they different?

Conclusion

A cup of water from the ocean tells you about the ocean only if you stirred the ocean first. Sampling bias is what happens when you dip your cup in a stagnant pool and think you know the sea.

Mitigation Strategies

Convenience Sampling Awareness: Always label convenience samples as such. Never claim they represent the population.

Effectiveness: medium

Difficulty: moderate

Potential Decision Harms

Health studies often exclude women or minorities, leading to medical guidelines that are dangerous for those groups.

critical Severity

A restaurant changes its menu based on feedback from a few loud regulars, alienating the broader customer base.

moderate Severity

Key Research Studies

The WEIRDest people in the world?

Henrich, J., Heine, S. J., & Norenzayan, A. (2010) Behavioral and Brain Sciences

Highlighted the extreme sampling bias in psychology, where 96% of subjects come from Western, Educated, Industrialized, Rich, and Democratic societies.

Read Study →


Related Biases

Explore these related cognitive biases to deepen your understanding

Neglect of Probability

2 min read

Neglect of probability is the tendency to completely disregard probability when making a decision under uncertainty.

Statistical Biases

/ Probability blindness

Ludic Fallacy

2 min read

The ludic fallacy is the misuse of games to model real-life situations.

Statistical Biases

/ Gaming fallacy

Selection Bias

2 min read

Selection bias is the bias introduced by the selection of individuals, groups or data for analysis in such a way that proper randomization is not achieved.

Statistical Biases

/ Sampling bias (related)

Survivorship Bias

2 min read

Survivorship bias is the logical error of concentrating on the people or things that made it past some selection process and overlooking those that did not, typically because of their lack of visibility.

Statistical Biases

/ Survival bias

Texas Sharpshooter Fallacy

2 min read

The Texas sharpshooter fallacy is an informal fallacy which is committed when differences in data are ignored, but similarities are overemphasized. From this reasoning, a false conclusion is inferred.

Statistical Biases

/ Clustering illusion (related)

Pareidolia

2 min read

Pareidolia is a specific form of apophenia involving the perception of images or sounds in random stimuli, such as seeing faces in clouds.

Statistical Biases

/ Face pareidolia