Skip to main content

False Discovery Rate

  • Reference work entry
  • First Online:
International Encyclopedia of Statistical Science

Multiple Hypothesis Testing

In hypothesis testing, statistical significance is typically based on calculations involving p-values and Type I error rates. A p-value calculated from a single statistical hypothesis test can be used to determine whether there is statistically significant evidence against the null hypothesis. The upper threshold applied to the p-value in making this determination (often 5% in the scientific literature) determines the Type I error rate; i.e., the probability of making a Type I error when the null hypothesis is true. Multiple hypothesis testing is concerned with testing several statistical hypotheses simultaneously. Defining statistical significance is a more complex problem in this setting.

A longstanding definition of statistical significance for multiple hypothesis tests involves the probability of making one or more Type I errors among the family of hypothesis tests, called the family-wise error rate. However, there exist other well established...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References and Further Reading

  • Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B 85:289–300

    MathSciNet  Google Scholar 

  • Benjamini Y, Liu W (1999) A step-down multiple hypothesis procedure that controls the false discovery rate under independence. J Stat Plann Infer 82:163–170

    MATH  MathSciNet  Google Scholar 

  • Benjamini Y, Yekutieli D (2001) The control of the false discovery rate in multiple testing under dependency. Ann Stat 29: 1165–1188

    MATH  MathSciNet  Google Scholar 

  • Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55:997–1004

    MATH  Google Scholar 

  • Efron B (2004) Large-scale simultaneous hypothesis testing: the choice of a null hypothesis. J Am Stat Assoc 99:96–104

    MATH  MathSciNet  Google Scholar 

  • Efron B, Tibshirani R, Storey JD, Tusher V (2001) Empirical Bayes analysis of a microarray experiment. J Am Stat Assoc 96: 1151–1160

    MATH  MathSciNet  Google Scholar 

  • Leek JT, Storey JD (2007) Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet 3:e161

    Google Scholar 

  • Leek JT, Storey JD (2008) A general framework for multiple testing dependence. Proc Natl Acad Sci 105:18718–18723

    Google Scholar 

  • Lindsay BG, Kettenring J, Siegmund DO (2004) A report on the future of statistics. Stat Sci 19:387–407

    MATH  MathSciNet  Google Scholar 

  • Morton NE (1955) Sequential tests for the detection of linkage. Am J Hum Genet 7:277–318

    Google Scholar 

  • Shaffer J (1995) Multiple hypothesis testing. Ann Rev Psychol 46:561–584

    Google Scholar 

  • Simes RJ (1986) An improved Bonferroni procedure for multiple tests of significance. Biometrika 73:751–754

    MATH  MathSciNet  Google Scholar 

  • Soric B (1989) Statistical discoveries and effect-size estimation. J Am Stat Assoc 84:608–610

    Google Scholar 

  • Storey JD (2001) The positive false discovery rate: a Bayesian interpretation and the q-value. Technical Report 2001–2012, Department of Statistics, Stanford University

    Google Scholar 

  • Storey JD (2002) A direct approach to false discovery rates. J R Stat Soc Ser B 64:479–498

    MATH  MathSciNet  Google Scholar 

  • Storey JD (2003) The positive false discovery rate: a Bayesian interpretation and the q-value. Ann Stat 31:2013–2035

    MATH  MathSciNet  Google Scholar 

  • Storey JD (2007) The optimal discovery procedure: a new approach to simultaneous significance testing. J R Stat Soc Ser B 69: 347–368

    MathSciNet  Google Scholar 

  • Storey JD, Dai JY, Leek JT (2007) The optimal discovery procedure for large-scale significance testing, with applications to comparative microarray experiments. Biostatistics 8:414–432

    MATH  Google Scholar 

  • Storey JD, Taylor JE, Siegmund D (2004) Strong control, conservative point estimation, and simultaneous conservative consistency of false discovery rates: a unified approach. J R Stat Soc Ser B 66:187–205

    MATH  MathSciNet  Google Scholar 

  • Storey JD, Tibshirani R (2003) Statistical significance for genome-wide studies. Proc Natl Acad Sci 100:9440–9445

    MATH  MathSciNet  Google Scholar 

  • Zaykin DV, Young SS, Westfall PH (1998) Using the false discovery approach in the genetic dissection of complex traits: a response to weller et al. Genetics 150:1917–1918

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this entry

Cite this entry

Storey, J.D. (2011). False Discovery Rate. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_248

Download citation

Publish with us

Policies and ethics