Skip to main content

Hybrid Approach for Diagnosing Thyroid, Hepatitis, and Breast Cancer Based on Correlation Based Feature Selection and Naïve Bayes

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7666))

Abstract

Feature selection techniques have become an obvious need for researchers in computer science and many other fields of science. Whether the target research is in medicine, agriculture, business, or industry; the necessity for analysing large amount of data is needed. In Addition to that, finding the most excellent feature selection technique that best satisfies a certain learning algorithm could bring the benefit for researchers. Therefore, we proposed a new method for diagnosing some diseases based on a combination of learning algorithm tools and feature selection techniques. The idea is to obtain a hybrid approach that combines the best performing learning algorithms and the best performing feature selection techniques in regards to three well-known datasets. Experimental result shows that co-ordination between correlation based feature selection method along with Naive Bayse learning algorithm can produce promising results.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Han, J., Kambler, M.: Data Mining Concepts and Techniques, vol. 3. Morgan Kaufmann, San Franscisco (2011)

    Google Scholar 

  2. Hall, M.A., Holmes, G.: Benchmarking Attribute Selection Techniques for Discrete Class Data Mining. IEEE Transactions on Knowledge and Data Engineering 15(3) (2003)

    Google Scholar 

  3. Ashraf, M., et al.: A New Approach for Constructing Missing Features Values. International Journal of Intelligent Information Processing 3(1), 110–118 (2012)

    Article  Google Scholar 

  4. Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artificial Intelligence 97(1-2), 245–271 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  5. Cancer, I.A.R. Mammography Screening can Reduce Deaths from Breast Cancer, http://www.iarc.fr/en/media-centre/pr/2002/pr139.html

  6. Lee, S.L.: Thyroid Problems (2012), http://www.emedicinehealth.com/thyroid_problems/article_em.htm

  7. Introducing Hepatitis C, http://www.hep.org.au/.

  8. Kononenko, I.: Estimating Attributes: Analysis and Extensions of RELIEF. In: Bergadano, F., De Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 172–182. Springer, Heidelberg (1994)

    Google Scholar 

  9. Hall, M.A.: Correlation-based Feature Selection for Machine Learning in Department of Computer Science. The University of Waikato, Hamilton (1999)

    Google Scholar 

  10. Rutkowski, L., et al. (eds.): Artificial Intelligence and Soft Computing, Part I. ed. L.N.i.C.S. 6113, vol. 1, pp. 487–498 Springer, Poland (2010)

    Google Scholar 

  11. Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. Journal of Machine Learning Research 3, 1157–1182 (2003)

    MATH  Google Scholar 

  12. Jolliffe, I.T.: Principal Component Analysis. Springer, NY (2002)

    MATH  Google Scholar 

  13. Liu, H., Setiono, R.: A probabilistic approach to feature selection: A fiter solution. In: Proc. of the 13th International Conference on Machine Learning (1996)

    Google Scholar 

  14. Liu, H., Setiono, R.: Chi2:Feature selection and discretization of numeric attributes. In: Proc. of the 7thIEEE International Conference on Tools with Articial Intelligence (1995)

    Google Scholar 

  15. Wolberg, W., Mangasarian, L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proceedings of the National Academy of Sciences 87, 9193–9196 (1990)

    Article  MATH  Google Scholar 

  16. Pevsner, J.: Bioinformatics and Functional Genomics, 2nd edn. Wiley-Blackwell (2009)

    Google Scholar 

  17. Zhang, H., Su, J.: Naive Bayes for optimal ranking. Journal of Experimental & Theoretical Artificial Intelligence 20(2), 79–93 (2008)

    Article  MATH  Google Scholar 

  18. Rokach, L., Maimon, O.: Data Mining With Decision Trees. World Scientific Publishing (2008)

    Google Scholar 

  19. Ashraf, M., Le, K., Huang, X.: Information Gain and Adaptive Neuro-Fuzzy Inference System for Breast Cancer Diagnoses. In: Proc. of ICCIT 2010, pp. 911–915. IEEE Press, Seoul (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ashraf, M., Chetty, G., Tran, D., Sharma, D. (2012). Hybrid Approach for Diagnosing Thyroid, Hepatitis, and Breast Cancer Based on Correlation Based Feature Selection and Naïve Bayes. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7666. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34478-7_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34478-7_34

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34477-0

  • Online ISBN: 978-3-642-34478-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics