Skip to main content

Balancing Performance Measures in Classification Using Ensemble Learning Methods

  • Conference paper
  • First Online:
Business Information Systems (BIS 2019)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 354))

Included in the following conference series:

Abstract

Ensemble learning methods have recently been widely used in various domains and applications owing to the improvements in computational efficiency and distributed computing advances. However, with the advent of wide variety of applications of machine learning techniques to class imbalance problems, further focus is needed to evaluate, improve and balance other performance measures such as sensitivity (true positive rate) and specificity (true negative rate) in classification. This paper demonstrates an approach to evaluate and balance the performance measures (specifically sensitivity and specificity) using ensemble learning methods for classification that can be especially useful in class imbalanced datasets. In this paper, ensemble learning methods (specifically bagging and boosting) are used to balance the performance measures (sensitivity and specificity) on a diabetes dataset to predict if a patient will be readmitted to the hospital based on various feature vectors. From the experiments conducted, it can be empirically concluded that, by using ensemble learning methods, although accuracy does improve to some margin, both sensitivity and specificity are balanced significantly and consistently over different cross validation approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Strack, B., et al.: Impact of HbA1c measurement on hospital readmission rates: analysis of 70,000 clinical database patient records. BioMed Research International, vol. 2014, Article ID 781670, 11 pages (2014)

    Google Scholar 

  2. Hsiao, J.C.-Y., Lo, H.-Y., Yin, T.-C., Lin, S.-D.: Optimizing specificity under perfect sensitivity for medical data classification. In: Proceedings of International Conference on Data Science and Advanced Analytics (DSAA), Shanghai, pp. 163–169 (2014). https://doi.org/10.1109/dsaa.2014.7058068

  3. Polikar, R.: Ensemble learning. Scholarpedia 4(1), 2776 (2009)

    Article  Google Scholar 

  4. Musicant, D., Kumar, V., Ozgur. A.: Optimizing f-measure with support vector machines. In: Proceedings of FLAIRS (2003)

    Google Scholar 

  5. Nan, Y., Chai, K.M., Lee, W.S., Chieu, H.L.: Optimizing F-measure: a tale of two approaches. In: Proceedings of International Conference on Machine Learning (ICML) (2012)

    Google Scholar 

  6. Diabetes 130-US hospitals for years 1999–2008 Data Set, UC Irvine (UCI) Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/Diabetes+130-US+hospitals+for+years+1999-2008. Accessed 2 Apr 2017

  7. R CRAN Packages. https://cran.r-project.org/web/packages/available_packages_by_name.html. Accessed 2 Apr 2017

  8. Mandal, I.: A novel approach for predicting DNA splice junctions using hybrid machine learning algorithms. Soft Comput. 19(12), 3431–3444 (2015). http://dx.doi.org/10.1007/s00500-014-1550-z

    Article  Google Scholar 

  9. Brownlee, J.: How to Build an Ensemble of Machine Learning Algorithms in R (2016). http://machinelearningmastery.com/machine-learning-ensembles-with-r/. Accessed 20 Apr 2017

  10. Amunategui, M.: Bagging/ Bootstrap Aggregation with R (2015). http://amunategui.github.io/bagging-in-R/index.html. Accessed 20 Apr 2017

  11. Asmita, S., Shukla, K.K.: Review on the architecture, algorithm and fusion strategies in ensemble learning. Int. J. Comput. Appl. (0975 - 8887). 108(8), December 2014

    Article  Google Scholar 

  12. Zeng, X., Wong, D.F., Chao, L.S.: Constructing better classifier ensemble based on weighted accuracy and diversity measure. Sci. World J. vol. 2014, Article ID 961747, 12 pages (2014). https://doi.org/10.1155/2014/961747

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ajay Bansal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bahl, N., Bansal, A. (2019). Balancing Performance Measures in Classification Using Ensemble Learning Methods. In: Abramowicz, W., Corchuelo, R. (eds) Business Information Systems. BIS 2019. Lecture Notes in Business Information Processing, vol 354. Springer, Cham. https://doi.org/10.1007/978-3-030-20482-2_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-20482-2_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-20481-5

  • Online ISBN: 978-3-030-20482-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics