Simpson’s Paradox

Geng, Zhi

doi:10.1007/978-3-642-04898-2_519

Zhi Geng²

238 Accesses

An association measurement between two variables X and Y may be dramatically changed from positive to negative by omitting a third variable Z, which is called Simpson’s paradox or the Yule-Simpson paradox (Yule, 1903; Simpson, 1951). A numerical example is shown in Table 1. The risk difference (RD) is defined as the difference between the recovery proportion in the treated group and that in the placebo group, RD = (80 ∕ 200) − (100 ∕ 200) = − 0. 10. If the population is split into two populations of male and female, a dramatic change can be seen from Table 2. The risk differences for male and female are both changed to 0. 10. Thus we obtain a self-contradictory conclusion that the new drug is effective for both male and female but it is ineffective for the whole population. Should patients in the population take the new drug or not? Should the correct answer depend on whether the doctor know the gender of patients?

Simpson’s Paradox. Table 1 Simpson’s Paradox. Table 1Recovery...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 1,100.00; Price excludes VAT (USA)

Hardcover Book: USD 549.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References and Further Reading

Chen H, Geng Z, Jia J (2007) Criteria for surrogate endpoints. J R Stat Soc B 69:919–932
MathSciNet Google Scholar
Cox DR, Wermuth N (2003) A general condition for avoiding effect reversal after marginalization. J R Stat Soc B 65:937–941
MathSciNet MATH Google Scholar
Geng Z, Guo J, Fung WK (2002) Criteria for confounders in epidemiological studies. J R Stat Soc B 64:3–15
MathSciNet MATH Google Scholar
Ju C, Geng Z (2010) Criteria for surrogate endpoints based on causal distributions. J R Stat Soc B 72:129–142
MathSciNet Google Scholar
Ma ZM, Xie XC, Geng Z (2006) Collapsibility of distribution dependence. J R Stat Soc B 68:127–133
MathSciNet MATH Google Scholar
Moore T (1995) Deadly medicine: why tens of thousands of patients died in America’s worst drug disaster. Simon & Shuster, New York
Google Scholar
Pearl J (2000) Causality: models, reasoning, and inference. University Press, Cambridge
Google Scholar
Reintjes R, de Boer A, van Pelt W, Mintjes-de Groot J (2000) Simpson’s paradox: an example from hospital epidemiology. Epidemiology 11:81–83
Google Scholar
Simpson EH (1951) The interpretation of interaction in contingency tables. J R Stat Soc B 13:238–241
MATH Google Scholar
Wagner CH (1982) Simpson’s paradox in real life. Am Stat 36:46–48
Google Scholar
Yule GU (1903) Notes on the theory of association of attributes in statistics. Biometrika 2:121–134
Google Scholar

Download references

Author information

Authors and Affiliations

Peking University, Beijing, China
Zhi Geng

Authors

Zhi Geng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Statistics and Informatics, Faculty of Economics, University of Kragujevac, City of Kragujevac, Serbia
Miodrag Lovric

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Geng, Z. (2011). Simpson’s Paradox. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_519

Download citation

DOI: https://doi.org/10.1007/978-3-642-04898-2_519
Published: 02 December 2014
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04897-5
Online ISBN: 978-3-642-04898-2
eBook Packages: Mathematics and StatisticsReference Module Computer Science and Engineering

Publish with us

Policies and ethics