Abstract
Feature selection techniques have become an obvious need for researchers in computer science and many other fields of science. Whether the target research is in medicine, agriculture, business, or industry; the necessity for analysing large amount of data is needed. In Addition to that, finding the most excellent feature selection technique that best satisfies a certain learning algorithm could bring the benefit for researchers. Therefore, we proposed a new method for diagnosing some diseases based on a combination of learning algorithm tools and feature selection techniques. The idea is to obtain a hybrid approach that combines the best performing learning algorithms and the best performing feature selection techniques in regards to three well-known datasets. Experimental result shows that co-ordination between correlation based feature selection method along with Naive Bayse learning algorithm can produce promising results.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Han, J., Kambler, M.: Data Mining Concepts and Techniques, vol. 3. Morgan Kaufmann, San Franscisco (2011)
Hall, M.A., Holmes, G.: Benchmarking Attribute Selection Techniques for Discrete Class Data Mining. IEEE Transactions on Knowledge and Data Engineering 15(3) (2003)
Ashraf, M., et al.: A New Approach for Constructing Missing Features Values. International Journal of Intelligent Information Processing 3(1), 110–118 (2012)
Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artificial Intelligence 97(1-2), 245–271 (1997)
Cancer, I.A.R. Mammography Screening can Reduce Deaths from Breast Cancer, http://www.iarc.fr/en/media-centre/pr/2002/pr139.html
Lee, S.L.: Thyroid Problems (2012), http://www.emedicinehealth.com/thyroid_problems/article_em.htm
Introducing Hepatitis C, http://www.hep.org.au/.
Kononenko, I.: Estimating Attributes: Analysis and Extensions of RELIEF. In: Bergadano, F., De Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 172–182. Springer, Heidelberg (1994)
Hall, M.A.: Correlation-based Feature Selection for Machine Learning in Department of Computer Science. The University of Waikato, Hamilton (1999)
Rutkowski, L., et al. (eds.): Artificial Intelligence and Soft Computing, Part I. ed. L.N.i.C.S. 6113, vol. 1, pp. 487–498 Springer, Poland (2010)
Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. Journal of Machine Learning Research 3, 1157–1182 (2003)
Jolliffe, I.T.: Principal Component Analysis. Springer, NY (2002)
Liu, H., Setiono, R.: A probabilistic approach to feature selection: A fiter solution. In: Proc. of the 13th International Conference on Machine Learning (1996)
Liu, H., Setiono, R.: Chi2:Feature selection and discretization of numeric attributes. In: Proc. of the 7thIEEE International Conference on Tools with Articial Intelligence (1995)
Wolberg, W., Mangasarian, L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proceedings of the National Academy of Sciences 87, 9193–9196 (1990)
Pevsner, J.: Bioinformatics and Functional Genomics, 2nd edn. Wiley-Blackwell (2009)
Zhang, H., Su, J.: Naive Bayes for optimal ranking. Journal of Experimental & Theoretical Artificial Intelligence 20(2), 79–93 (2008)
Rokach, L., Maimon, O.: Data Mining With Decision Trees. World Scientific Publishing (2008)
Ashraf, M., Le, K., Huang, X.: Information Gain and Adaptive Neuro-Fuzzy Inference System for Breast Cancer Diagnoses. In: Proc. of ICCIT 2010, pp. 911–915. IEEE Press, Seoul (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ashraf, M., Chetty, G., Tran, D., Sharma, D. (2012). Hybrid Approach for Diagnosing Thyroid, Hepatitis, and Breast Cancer Based on Correlation Based Feature Selection and Naïve Bayes. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7666. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34478-7_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-34478-7_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34477-0
Online ISBN: 978-3-642-34478-7
eBook Packages: Computer ScienceComputer Science (R0)