Skip to main content

Significance Testing: An Overview

  • Reference work entry
  • First Online:
International Encyclopedia of Statistical Science

Introduction

A significance test is a statistical procedure for testing a hypothesis based on experimental or observational data. Let, for example, \(\bar{{X}}_{1}\) and \(\bar{{X}}_{2}\) be the average scores obtained in two groups of randomly selected subjects and let μ 1 and μ 2 denote the corresponding population averages. The observed averages can be used to test the null hypothesis μ 1 = μ 2, which expresses the idea that both populations have equal average scores. A significant result occurs if \(\bar{{X}}_{1}\) and \(\bar{{X}}_{2}\) are very different from each other, because this contradicts or falsifies the null hypothesis. If the two group averages are similar to each other, the null hypothesis is not contradicted by the data. What exact values of the difference \(\bar{{X}}_{1} -\bar{ {X}}_{2}\)of the group averages are judged as significant depends on various elements. The variation of the scores between the subjects, for example, must be taken into account. This...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References and Further Reading

  • Berger JO (2003) Could Fisher, Jeffreys and Neyman have agreed on testing? Stat Sci 18(1):1–32, With discussion

    Google Scholar 

  • Dollinger MB, Kulinskaya E, Staudte RG (1996) Fuzzy hypothesis tests and confidence intervals. In: Dowe DL, Korb KB, Oliver JJ (eds) Information, statistics and induction in science. World Scientific, Singapore, pp 119–128

    Google Scholar 

  • Fisher RA (1990) Statistical methods, experimental design and scientific inference. Oxford University Press, Oxford. Reprints of Fisher’s main books, first printed in 1925, 1935 and 1956, respectively. The 14th edition of the first book was printed in 1973

    Google Scholar 

  • Geyer C, Meeden G (2006) Fuzzy confidence intervals and p-values (with discussion). Stat Sci 20:258–387

    MathSciNet  Google Scholar 

  • Hubbard R, Bayarri MJ (2003) Confusion over measures of evidence (p’s) versus errors (a’s) in classical statistical testing. Am Stat 57(3):171–182, with discussion

    Google Scholar 

  • Kass RE, Raftery AE (1995) Bayes factors. J Am Stat Assoc 90: 773–795

    MATH  Google Scholar 

  • Krantz D (1999) The null hypothesis testing controversy in psychology. J Am Stat Assoc 94(448):1372–1381

    Google Scholar 

  • Kulinskaya E, Morgenthaler S, Staudte RG (2008) Meta analysis: a guide to calibrating and combining statistical evidence. Wiley, Chichester, www.wiley.com/go/meta_analysis

  • Lavine M, Schervish M (1999) Bayes factors: what they are and what they are not. Am Stat 53:119–122

    MathSciNet  Google Scholar 

  • Lehmann EL (1986) Testing statistical hypotheses, 2nd edn. Wiley, New York

    MATH  Google Scholar 

  • Lehmann EL (1993) The Fisher, Neyman-Pearson theories of testing hypotheses: one theory or two? J Am Stat Assoc 88:1242–1249

    MATH  Google Scholar 

  • Neyman J (1935) On the problem of confidence intervals. Ann Math Stat 6:111–116

    Google Scholar 

  • Neyman J (1950) First course in probability and statistics. Henry Holt, New York

    Google Scholar 

  • Neyman J, Pearson ES (1928) On the use and interpretation of certain test criteria for purposes of statistical inference. Biometrika 20A:175–240 and 263–294

    Google Scholar 

  • Neyman J, Pearson ES (1933) On the problem of the most efficient tests of statistical hypotheses. Philos Trans R Soc A 231:289–337

    Google Scholar 

  • Welch BL (1938) The significance of the difference between two means when the variances are unequal. Biometrika 29:350–361

    MATH  Google Scholar 

  • Wellek S (2003) Testing statistical hypotheses of equivalence. Chapman & Hall/CRC Press, New York

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this entry

Cite this entry

Kulinskaya, E., Morgenthaler, S., Staudte, R.G. (2011). Significance Testing: An Overview. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_514

Download citation

Publish with us

Policies and ethics