Introduction
A significance test is a statistical procedure for testing a hypothesis based on experimental or observational data. Let, for example, \(\bar{{X}}_{1}\) and \(\bar{{X}}_{2}\) be the average scores obtained in two groups of randomly selected subjects and let μ 1 and μ 2 denote the corresponding population averages. The observed averages can be used to test the null hypothesis μ 1 = μ 2, which expresses the idea that both populations have equal average scores. A significant result occurs if \(\bar{{X}}_{1}\) and \(\bar{{X}}_{2}\) are very different from each other, because this contradicts or falsifies the null hypothesis. If the two group averages are similar to each other, the null hypothesis is not contradicted by the data. What exact values of the difference \(\bar{{X}}_{1} -\bar{ {X}}_{2}\)of the group averages are judged as significant depends on various elements. The variation of the scores between the subjects, for example, must be taken into account. This...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References and Further Reading
Berger JO (2003) Could Fisher, Jeffreys and Neyman have agreed on testing? Stat Sci 18(1):1–32, With discussion
Dollinger MB, Kulinskaya E, Staudte RG (1996) Fuzzy hypothesis tests and confidence intervals. In: Dowe DL, Korb KB, Oliver JJ (eds) Information, statistics and induction in science. World Scientific, Singapore, pp 119–128
Fisher RA (1990) Statistical methods, experimental design and scientific inference. Oxford University Press, Oxford. Reprints of Fisher’s main books, first printed in 1925, 1935 and 1956, respectively. The 14th edition of the first book was printed in 1973
Geyer C, Meeden G (2006) Fuzzy confidence intervals and p-values (with discussion). Stat Sci 20:258–387
Hubbard R, Bayarri MJ (2003) Confusion over measures of evidence (p’s) versus errors (a’s) in classical statistical testing. Am Stat 57(3):171–182, with discussion
Kass RE, Raftery AE (1995) Bayes factors. J Am Stat Assoc 90: 773–795
Krantz D (1999) The null hypothesis testing controversy in psychology. J Am Stat Assoc 94(448):1372–1381
Kulinskaya E, Morgenthaler S, Staudte RG (2008) Meta analysis: a guide to calibrating and combining statistical evidence. Wiley, Chichester, www.wiley.com/go/meta_analysis
Lavine M, Schervish M (1999) Bayes factors: what they are and what they are not. Am Stat 53:119–122
Lehmann EL (1986) Testing statistical hypotheses, 2nd edn. Wiley, New York
Lehmann EL (1993) The Fisher, Neyman-Pearson theories of testing hypotheses: one theory or two? J Am Stat Assoc 88:1242–1249
Neyman J (1935) On the problem of confidence intervals. Ann Math Stat 6:111–116
Neyman J (1950) First course in probability and statistics. Henry Holt, New York
Neyman J, Pearson ES (1928) On the use and interpretation of certain test criteria for purposes of statistical inference. Biometrika 20A:175–240 and 263–294
Neyman J, Pearson ES (1933) On the problem of the most efficient tests of statistical hypotheses. Philos Trans R Soc A 231:289–337
Welch BL (1938) The significance of the difference between two means when the variances are unequal. Biometrika 29:350–361
Wellek S (2003) Testing statistical hypotheses of equivalence. Chapman & Hall/CRC Press, New York
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this entry
Cite this entry
Kulinskaya, E., Morgenthaler, S., Staudte, R.G. (2011). Significance Testing: An Overview. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_514
Download citation
DOI: https://doi.org/10.1007/978-3-642-04898-2_514
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04897-5
Online ISBN: 978-3-642-04898-2
eBook Packages: Mathematics and StatisticsReference Module Computer Science and Engineering