Skip to main content

Simpson’s Paradox

  • Reference work entry
  • First Online:
International Encyclopedia of Statistical Science
  • 238 Accesses

An association measurement between two variables X and Y may be dramatically changed from positive to negative by omitting a third variable Z, which is called Simpson’s paradox or the Yule-Simpson paradox (Yule, 1903; Simpson, 1951). A numerical example is shown in Table 1. The risk difference (RD) is defined as the difference between the recovery proportion in the treated group and that in the placebo group, RD = (80 ∕ 200) − (100 ∕ 200) = − 0. 10. If the population is split into two populations of male and female, a dramatic change can be seen from Table 2. The risk differences for male and female are both changed to 0. 10. Thus we obtain a self-contradictory conclusion that the new drug is effective for both male and female but it is ineffective for the whole population. Should patients in the population take the new drug or not? Should the correct answer depend on whether the doctor know the gender of patients?

Simpson’s Paradox. Table 1 Simpson’s Paradox. Table 1Recovery...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 1,100.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References and Further Reading

  • Chen H, Geng Z, Jia J (2007) Criteria for surrogate endpoints. J R Stat Soc B 69:919–932

    MathSciNet  Google Scholar 

  • Cox DR, Wermuth N (2003) A general condition for avoiding effect reversal after marginalization. J R Stat Soc B 65:937–941

    MathSciNet  MATH  Google Scholar 

  • Geng Z, Guo J, Fung WK (2002) Criteria for confounders in epidemiological studies. J R Stat Soc B 64:3–15

    MathSciNet  MATH  Google Scholar 

  • Ju C, Geng Z (2010) Criteria for surrogate endpoints based on causal distributions. J R Stat Soc B 72:129–142

    MathSciNet  Google Scholar 

  • Ma ZM, Xie XC, Geng Z (2006) Collapsibility of distribution dependence. J R Stat Soc B 68:127–133

    MathSciNet  MATH  Google Scholar 

  • Moore T (1995) Deadly medicine: why tens of thousands of patients died in America’s worst drug disaster. Simon & Shuster, New York

    Google Scholar 

  • Pearl J (2000) Causality: models, reasoning, and inference. University Press, Cambridge

    Google Scholar 

  • Reintjes R, de Boer A, van Pelt W, Mintjes-de Groot J (2000) Simpson’s paradox: an example from hospital epidemiology. Epidemiology 11:81–83

    Google Scholar 

  • Simpson EH (1951) The interpretation of interaction in contingency tables. J R Stat Soc B 13:238–241

    MATH  Google Scholar 

  • Wagner CH (1982) Simpson’s paradox in real life. Am Stat 36:46–48

    Google Scholar 

  • Yule GU (1903) Notes on the theory of association of attributes in statistics. Biometrika 2:121–134

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this entry

Cite this entry

Geng, Z. (2011). Simpson’s Paradox. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_519

Download citation

Publish with us

Policies and ethics