Skip to main content

Model Evaluation as Approach to Predict a Diagnosis

  • Conference paper
  • First Online:
Soft Computing Applications (SOFA 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 634))

Included in the following conference series:

Abstract

The paper presents an approach to a relevant issue of the supervised learning: classification. Creating the models that are able to generalize to new data, tuning the model so that the performance can be increased and models evaluation are relevant task of this sub-field of machine learning. In this paper, we have chosen to treat the evaluation of a model. The paper consists in two experiments, each of them with a particular purpose. First experiment outlines different model evaluation methods using specific performance metrics. This paper analyzes two models, logistic regression and the k-nearest neighbor with three methods of evaluation: train-test approach, test-set approach and the k-cross validation approach. On our analyzed data set, we achieved reasonable results using logistic regression as model with k-cross validation approach as evaluating method of the model. The second experiment determines the rank of the observations by predicting the probabilities depending on the response vector. Based on the predicted probability we try to improve the metric performance. For our particular task, one metric that allows us to extract relevant information from data and can be improved is the sensitivity metric.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn. 30(7), 1145–1159 (1997)

    Article  Google Scholar 

  2. Dursun, D., Walker, G., Kadam, A.: Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005)

    Article  Google Scholar 

  3. Fawcett, T.: An introduction to ROC analysis. Pattern Recogn. Lett. 27(8), 861–874 (2006)

    Article  MathSciNet  Google Scholar 

  4. Gama, J., Rodrigues, P.P., Raquel, S.: Evaluating algorithms that learn from data streams. In: Proceedings of the 2009 ACM symposium on Applied Computing. ACM (2009)

    Google Scholar 

  5. Gunawardana, A., Shani, G.: A survey of accuracy evaluation metrics of recommendation tasks. J. Mach. Learn. Res. 10, 2935–2962 (2009)

    MathSciNet  MATH  Google Scholar 

  6. Harrington, P.: Machine Learning in Action. Manning, Greenwich (2012)

    Google Scholar 

  7. Hunter, J.D.: Matplotlib: a 2D graphics environment. Comput. Sci. Eng. 9, 90–95 (2007)

    Article  Google Scholar 

  8. Lichman, M.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine (2013)

    Google Scholar 

  9. Lobo, J.M., Valverde, A.J., Real, R.: AUC: a misleading measure of the performance of predictive distribution models. Glob. Ecol. Biogeogr. 17(2), 145–151 (2008)

    Article  Google Scholar 

  10. McKinney, W.: Data structures for statistical computing in python. In: Proceedings of the 9th Python in Science Conference, pp. 51–56 (2010)

    Google Scholar 

  11. Michalski, R.S., Jaime, C.G., Mitchell, T.M. (eds.): Machine Learning: An Artificial Intelligence Approach. Springer Science & Business Media, Heidelberg (2013)

    Google Scholar 

  12. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, É.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  13. Perez, F., Granger, B.E.: IPython: a system for interactive scientific computing. Comput. Sci. Eng. 9, 21–29 (2007)

    Article  Google Scholar 

  14. Seewald, A.K., Johannes, F.: An evaluation of grading classifiers. In: Advances in Intelligent Data Analysis, pp. 115–124. Springer, Heidelberg (2001)

    Google Scholar 

  15. Senthil Kumar, S., Hannah Inbarani, H., Azar, A.T., Own, H.S., Balas, V.E., Olariu, T.: Optimistic multi-granulation rough set based classification for neonatal jaundice. In: Kacprzyk, J. (ed.) Proceedings of the 6th International Workshop on Soft Computing Applications SOFA 2014. Advances in Intelligent Systems and Computing, Timisoara, Romania, July 22–24 (2014)

    Google Scholar 

  16. Weng, C.G., Josiah, P.: A new evaluation measure for imbalanced datasets. In: Proceedings of the 7th Australasian Data Mining Conference, vol. 87. Australian Computer Society, Inc. (2008)

    Google Scholar 

  17. Witten, I.H., Eibe, F.: Data Mining : Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington (2005)

    MATH  Google Scholar 

  18. Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Nat. Acad. Sci. U.S.A. 87, 9193–9196 (1990)

    Article  MATH  Google Scholar 

  19. Zhu, X.: Semi-supervised learning literature survey (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adriana Mihaela Coroiu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Coroiu, A.M. (2018). Model Evaluation as Approach to Predict a Diagnosis. In: Balas, V., Jain, L., Balas, M. (eds) Soft Computing Applications. SOFA 2016. Advances in Intelligent Systems and Computing, vol 634. Springer, Cham. https://doi.org/10.1007/978-3-319-62524-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62524-9_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62523-2

  • Online ISBN: 978-3-319-62524-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics