Regression to the Mean

Also known as: Regression Toward the Mean, RTM Effect

Regression to the mean is a statistical phenomenon in which unusually extreme observations on a first measurement tend to be closer to the average on subsequent measurements, and vice versa, when there is random variation. People often misinterpret this natural drift toward the mean as meaningful improvement, deterioration, or the impact of specific actions, leading to erroneous causal stories.

Cognitive Biases

/ Statistical misinterpretation

12 min read

experimental Evidence


Regression to the Mean: Extreme Events Drifting Back Toward Average

In many areas of life—test scores, athletic performance, investment returns—extreme results are often followed by more ordinary ones. A student who does exceptionally well on one exam may do closer to average on the next; a star athlete’s record-breaking season is usually followed by a less spectacular one. This pattern is known as regression to the mean.

Regression to the mean is a statistical tendency, not a psychological bias by itself. The bias arises when people misinterpret this natural tendency as evidence of real change or the effect of specific actions, leading to faulty causal stories.

Core Idea

When a variable is influenced by both stable factors (like skill or underlying condition) and random variation (luck, noise, measurement error), then:

  • Extremely high observations (far above the average) are likely to be followed by lower, more average values.
  • Extremely low observations are likely to be followed by higher, more average values.

This happens even if nothing substantive has changed in the underlying system.

Why It’s Misleading: Psychological Mechanisms

  1. Causal Storytelling
    People are driven to explain changes. When an extreme result is followed by a more average one, we search for reasons (e.g., new training, punishment, reward), even when the shift is largely due to statistical regression.

  2. Neglect of Random Variation
    It is easy to overlook how much chance and noise contribute to performance. We often assume that every fluctuation must have a meaningful cause.

  3. Salience of Interventions After Extremes
    Interventions (like coaching, bonuses, or punishments) commonly occur right after extreme outcomes. When performance later moves closer to average, people attribute the change to the intervention, not to regression.

  4. Limited Intuition for Probabilistic Processes
    Many people lack an intuitive grasp of how variables distributed around a mean behave over repeated measurements.

Everyday Examples

  • Student Performance: A student with an unusually high test score may receive praise or a reward; their next score, closer to their true average ability, is often lower. This can be misread as "the praise made them complacent."

  • Sports Coaching: An athlete performs terribly in a match and is criticized by their coach. Next time, their performance is closer to average and appears improved, leading to the belief that criticism "worked."

  • Business Metrics: A team’s sales spike one quarter due to a mixture of effort and luck. After management introduces a new process, sales drop closer to the long-term average. The drop may be blamed on the new process, even though regression is playing a role.

  • Medical Symptoms: Patients often seek treatment at the worst point of their symptoms. Because extreme symptoms naturally tend to ease toward average, improvement is often attributed entirely to the treatment.

When Regression to the Mean Matters Most

Regression to the mean is especially important when:

  • Measurements are noisy, with substantial random variation.
  • We focus on extreme cases or outliers (top performers, worst failures, peak symptoms).
  • Interventions are timed to coincide with extremes (e.g., punishments after bad outcomes, rewards after good ones).

Mitigation Strategies

  1. Use Control Groups and Baselines
    In experiments, compare changes in a treated group to a similar untreated group. Both will exhibit regression; differences between them can reveal genuine treatment effects.

  2. Examine Longer-Term Trends
    Look at patterns over many time points, not just before-and-after snapshots around extremes. This helps distinguish regression from systematic change.

  3. Model Noise and Variability
    Use statistical models that account for random variation, measurement error, and regression effects, especially in performance evaluation and clinical studies.

  4. Be Cautious with Stories Around Extremes
    When interpreting dramatic rises or falls, explicitly ask: "How much of this might be regression to the mean?"

Relationship to Other Biases

  • Outcome Bias: Judging decisions solely by outcomes can obscure regression effects and encourage spurious causal attributions.
  • Illusion of Control: Overestimating how much interventions change outcomes, when some observed change is just regression.
  • Post Hoc Ergo Propter Hoc: Assuming that because one event followed another, the first caused the second; regression provides many tempting post hoc stories.

Conclusion

Regression to the mean reminds us that extremes are often followed by moderation, even without any deliberate intervention. Mistaking this statistical pattern for meaningful change can mislead coaches, managers, policymakers, clinicians, and individuals.

Recognizing when regression is likely—especially around extreme results—helps us interpret data more cautiously, design better evaluations, and avoid over-crediting or over-blaming specific actions for shifts that are largely statistical inevitabilities.

Common Triggers

Selection of extreme cases

Short-term before-and-after comparisons

Typical Contexts

Performance evaluation and incentives

Clinical outcomes and treatment evaluation

Sports analytics and player assessment

Business metrics and KPI tracking

Mitigation Strategies

Use randomized or matched comparison groups: Compare changes in targeted individuals to similar others who did not receive the intervention, to separate regression from real effects.

Effectiveness: high

Difficulty: moderate

Communicate expected variability: Educate stakeholders about normal fluctuations and regression effects so they do not overinterpret single extreme data points.

Effectiveness: medium

Difficulty: moderate

Potential Decision Harms

Leaders may adopt or abandon policies based on apparent effects that are actually regression to the mean.

moderate Severity

Treatments may be credited with improvements that would have occurred naturally as extreme symptoms regress toward average.

major Severity


Related Biases

Explore these related cognitive biases to deepen your understanding

Loaded Language

Loaded language (also known as loaded terms or emotive language) is rhetoric used to influence an audience by using words and phrases with strong connotations.

Cognitive Biases

/ Emotive language

Euphemism

A euphemism is a mild or indirect word or expression substituted for one considered to be too harsh or blunt when referring to something unpleasant or embarrassing.

Cognitive Biases

/ Doublespeak (related)

Paradox of Choice

10 min read

The paradox of choice is the idea that having too many options can make decisions harder, reduce satisfaction, and even lead to decision paralysis.

Cognitive Biases / Choice and complexity

/ Choice Overload

Choice Overload Effect

10 min read

The choice overload effect occurs when having too many options makes it harder to decide, reduces satisfaction, or leads people to avoid choosing at all.

Cognitive Biases / Choice and complexity

/ Paradox of Choice

Procrastination

2 min read

Procrastination is the action of unnecessarily and voluntarily delaying or postponing something despite knowing that there will be negative consequences for doing so.

Cognitive Biases

/ Akrasia (weakness of will)

Time-Saving Bias

2 min read

The time-saving bias describes the tendency of people to misestimate the time that could be saved (or lost) when increasing (or decreasing) speed.

Cognitive Biases

/ Time-saving illusion