Skip to main content

Sensitivity, Specificity and Prioritization of Gene Set Analysis When Applying Different Ranking Metrics

  • Conference paper
  • First Online:
10th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2016)

Abstract

Microarrays were a trigger to develop new methods which can allow to estimate disturbances in signal cascades, characterized by sets of genes, in various biological conditions. Existing approaches of gene set analysis take information if genes are differentially expressed or are based on some gene ranking. The most commonly used method is Gene Set Enrichment Analysis (GSEA), where an assumption of uniform distribution of genes in some gene set is tested by weighted Kolmogorov-Smirnov test. Many studies present different gene set analysis methods and their comparison, however none of them focus on basic but crucial parameters, like the rank metric. In this paper we compare nine ranking metrics in terms of sensitivity, specificity and prioritization of identification of functional gene sets using a collection of 34 annotated microarray datasets. We show that absolute value of default GSEA measure is the best ranking metric, while the Baumgartner-Weiss-Schindler test statistic is the best statistical-based metrics, which can be used in Gene Set Enrichment Analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Marczyk, M., Jaksik, R., Polanski, A., Polanska, J.: Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition. BMC Bioinformatics 14(1), 101 (2013)

    Article  Google Scholar 

  2. Tavazoie, S., Hughes, J.D., Campbell, M.J., Cho, R.J., Church, G.M.: Systematic determination of genetic network architecture. Nature Genetics 22(3), 281–285 (1999)

    Article  Google Scholar 

  3. Subramanian, A., et al.: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. PNAS 102(43), 15545–15550 (2005)

    Article  Google Scholar 

  4. Tarca, A.L., Draghici, S., Bhatti, G., Romero, R.: Down-weighting overlapping genes improves gene set analysis. BMC Bioinformatics 13, 136 (2012)

    Article  Google Scholar 

  5. Wu, D., Smyth, G.K.: Camera: a competitive gene set test accounting for inter-gene correlation. Nucleic Acids Research 40(17), e133 (2012)

    Article  Google Scholar 

  6. Rahnenführer, J., Domingues, F. S., Maydt, J., Lengauer, T.: Calculating the statistical significance of changes in pathway activity from gene expression data. Statistical Applications in Genetics and Molecular Biology 3(1) (2004)

    Google Scholar 

  7. Shojaie, A., Michailidis, G.: Network enrichment analysis in complex experiments. Statistical Applications in Genetics and Molecular Biology 9(1) (2010)

    Google Scholar 

  8. Hung, J.-H., et al.: Gene set enrichment analysis: performance evaluation and usage guidelines. Briefings in Bioinformatics 13(3), 281–291 (2012)

    Article  Google Scholar 

  9. Maciejewski, H.: Gene set analysis methods: statistical models and methodological differences. Briefings in Bioinformatics 15(4), 504–518 (2014)

    Article  MathSciNet  Google Scholar 

  10. Khatri, P., Sirota, M., Butte, A.J.: Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Computational Biology 8(2), e1002375 (2012)

    Article  Google Scholar 

  11. Tarca, A.L., Bhatti, G., Romero, R.: A comparison of gene set analysis methods in terms of sensitivity, prioritization and specificity. PLoS One 8(11), e79217 (2013)

    Article  Google Scholar 

  12. Kanehisa, M., et al.: KEGG as a reference resource for gene and protein annotation. Nucleic Acids Research 44(D1), D457–D462 (2016)

    Article  Google Scholar 

  13. Bayá, A.E., Larese, M.G., Granitto, P.M., Gómez, J.C., Tapia, E.: Gene set enrichment analysis using non-parametric scores. In: Advances in Bioinformatics and Computational Biology, pp. 12–21. Springer, Heidelberg (2007)

    Google Scholar 

  14. Neuhäuser, M.: An exact two-sample test based on the Baumgartner-Weiß-Schindler statistic and a modification of Lepage’s test. Communications in Statistics-Theory and Methods 29(1), 67–78 (2000)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joanna Zyla .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Zyla, J., Marczyk, M., Polanska, J. (2016). Sensitivity, Specificity and Prioritization of Gene Set Analysis When Applying Different Ranking Metrics. In: Saberi Mohamad, M., Rocha, M., Fdez-Riverola, F., Domínguez Mayo, F., De Paz, J. (eds) 10th International Conference on Practical Applications of Computational Biology & Bioinformatics. PACBB 2016. Advances in Intelligent Systems and Computing, vol 477. Springer, Cham. https://doi.org/10.1007/978-3-319-40126-3_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-40126-3_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-40125-6

  • Online ISBN: 978-3-319-40126-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics