Skip to main content

Classification and Prediction Analysis of Diseases and Other Datasets Using Machine Learning

  • Conference paper
  • First Online:
Intelligent Technologies and Applications (INTAP 2019)

Abstract

Classification is one of the most used machine learning technique especially in the prediction of daily life things. Its first step is grouping, dividing, categorizing, and separation of datasets based on future vectors. Classification procedure has many algorithms, some of them are Random Forest, Naïve Bayes, Decision Tree and Support Vector Machine. Before the implementation of every technique, the model is created and then training of dataset has been made on that model. Learning the algorithm-generated model must be fit for both the input dataset and forecast the records of class label. Many models are available for prediction of a class label from unknown records. In this paper, different classifiers such as Linear SVM, Ensemble, the Decision tree has been applied and their accuracy and time analyzed on different datasets. The Liver Patient, Wine Quality, Breast Cancer and Bupa Liver Disorder datasets are used for calculating the performance and accuracy by using 10 cross-fold validation technique. In the end, all the applied algorithm results have been calculated and compared in the terms of accuracy and execution time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ahuja, S., Angra, S.: Machine learning and it’s applications: a review. In: 2017 International Conference on Big data and Computational Intelligence, pp. 57–60 (2017)

    Google Scholar 

  2. Brownlee, J.: Machine Learning Mastery (2018). https://machinelearningmastery.com/k-fold-cross-validation/

  3. Brodley, C., Friedl, M.A.: Decision tree classification of land cover from remotely sensed data. Remote Sens. Environ. 61, 399–409 (1997)

    Article  Google Scholar 

  4. Brownlee, J.: Logistic regression for machine learning, 1 April 2016. https://machinelearningmastery.com/logistic-regression-for-machine-learning/

  5. Brownlee, J.: Support Vector machine for machine learning, 20 April 2016. https://machinelearningmastery.com/support-vector-machines-for-machine-learning/

  6. Woods, K., Kegelmeyer, W.P., Bowyer, K.: Combination of multiple classifiers using local accuracy estimates. IEEE Trans. Pattern Anal. Mach. Intell. 19, 405–410 (1997)

    Article  Google Scholar 

  7. Lange, S., Zilles, S.: Formal models of incremental learning and their analysis. In: Proceedings of the International Joint Conference on Neural Networks, pp. 2691–2699 (2003)

    Google Scholar 

  8. Dhayanand, S., Vijayarani, S.: Liver disease prediction using SVM and Naive Bayes. Int. J. Sci. Eng. Technol. Res. (IJSETR) 4(4), 816–820 (2015)

    Google Scholar 

  9. Borges, L.R.: Analysis of the wisconsin breast cancer dataset and machine learning for breast cancer detection. In: Proceedings of XI Workshop de Visão Computacional 2010, 05th–07th October 2015, pp. 15–19 (2015)

    Google Scholar 

  10. Cortez, P., Teixeira, J., Cerdeira, A., Almeida, F., Matos, T., Reis, J.: Using data mining for wine quality assessment. In: Gama, J., Costa, V.S., Jorge, A.M., Brazdil, P.B. (eds.) DS 2009. LNCS (LNAI), vol. 5808, pp. 66–79. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04747-3_8

    Chapter  Google Scholar 

  11. Olaniyi, E.O., Adnan, K.: Liver disease diagnosis based on neural networks, pp. 48–53 (2015)

    Google Scholar 

  12. Ahmed, F., et al.: Wireless mesh network IEEE 802.11 s. Int. J. Comput. Sci. Inf. Secur. 14(12), 803–809 (2016)

    Google Scholar 

  13. Aslam, N., Sarwar, N., Batool, A.: Designing a model for improving CPU scheduling by using machine learning. Int. J. Comput. Sci. Inf. Secur. 14(10), 201 (2016)

    Google Scholar 

  14. Bilal, M., Sarwar, N., Saeed, M.S.: A hybrid test case model for medium scale web based applications. In: 2016 Sixth International Conference on Innovative Computing Technology (INTECH), pp. 632–637 (2016)

    Google Scholar 

  15. Bajwa, I.S., Sarwar, N.: Automated generation of express-G models using NLP. Sindh Univ. Res. J.-SURJ (Sci. Ser.) 48(1), 5–12 (2016)

    Google Scholar 

  16. Cheema, S.M., Sarwar, N., Yousaf, F.: Contrastive analysis of bubble & merge sort proposing hybrid approach. In: 2016 Sixth International Conference on Innovative Computing Technology (INTECH), pp. 371–375 (2016)

    Google Scholar 

  17. Sarwar, N., Latif, M.S., Aslam, N., Batool, A.: Automated object role model generation. Int. J. Comput. Sci. Inf. Secur. 14(9), 301–308 (2016)

    Google Scholar 

  18. Ibrahim, M., Sarwar, N.: NoSQL database generation using SAT solver. In: 6th International Conference on Innovative Computing Technology, INTECH 2016, no. August 2016, pp. 627–631 (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nadeem Sarwar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nasir, J. et al. (2020). Classification and Prediction Analysis of Diseases and Other Datasets Using Machine Learning. In: Bajwa, I., Sibalija, T., Jawawi, D. (eds) Intelligent Technologies and Applications. INTAP 2019. Communications in Computer and Information Science, vol 1198. Springer, Singapore. https://doi.org/10.1007/978-981-15-5232-8_37

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-5232-8_37

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-5231-1

  • Online ISBN: 978-981-15-5232-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics