An association measurement between two variables X and Y may be dramatically changed from positive to negative by omitting a third variable Z, which is called Simpson’s paradox or the Yule-Simpson paradox (Yule, 1903; Simpson, 1951). A numerical example is shown in Table 1. The risk difference (RD) is defined as the difference between the recovery proportion in the treated group and that in the placebo group, RD = (80 ∕ 200) − (100 ∕ 200) = − 0. 10. If the population is split into two populations of male and female, a dramatic change can be seen from Table 2. The risk differences for male and female are both changed to 0. 10. Thus we obtain a self-contradictory conclusion that the new drug is effective for both male and female but it is ineffective for the whole population. Should patients in the population take the new drug or not? Should the correct answer depend on whether the doctor know the gender of patients?
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References and Further Reading
Chen H, Geng Z, Jia J (2007) Criteria for surrogate endpoints. J R Stat Soc B 69:919–932
Cox DR, Wermuth N (2003) A general condition for avoiding effect reversal after marginalization. J R Stat Soc B 65:937–941
Geng Z, Guo J, Fung WK (2002) Criteria for confounders in epidemiological studies. J R Stat Soc B 64:3–15
Ju C, Geng Z (2010) Criteria for surrogate endpoints based on causal distributions. J R Stat Soc B 72:129–142
Ma ZM, Xie XC, Geng Z (2006) Collapsibility of distribution dependence. J R Stat Soc B 68:127–133
Moore T (1995) Deadly medicine: why tens of thousands of patients died in America’s worst drug disaster. Simon & Shuster, New York
Pearl J (2000) Causality: models, reasoning, and inference. University Press, Cambridge
Reintjes R, de Boer A, van Pelt W, Mintjes-de Groot J (2000) Simpson’s paradox: an example from hospital epidemiology. Epidemiology 11:81–83
Simpson EH (1951) The interpretation of interaction in contingency tables. J R Stat Soc B 13:238–241
Wagner CH (1982) Simpson’s paradox in real life. Am Stat 36:46–48
Yule GU (1903) Notes on the theory of association of attributes in statistics. Biometrika 2:121–134
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this entry
Cite this entry
Geng, Z. (2011). Simpson’s Paradox. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_519
Download citation
DOI: https://doi.org/10.1007/978-3-642-04898-2_519
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04897-5
Online ISBN: 978-3-642-04898-2
eBook Packages: Mathematics and StatisticsReference Module Computer Science and Engineering