skip to main content
10.1145/3056662.3056716acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Classifying breast cancer by using decision tree algorithms

Published:26 February 2017Publication History

ABSTRACT

Nowadays, datasets are growing daily to obtain knowledge from big dataset. Extraction operation of useful information from the dataset is called data mining that is one of the major techniques to get the diagnostic results especially in medical care fields as breast cancer. Breast cancer is one of the widespread cancers among women if matched with all other tumors all over the world. Classification is widely used in most important and necessary tasks in the real life applications in all fields. This technique has ability of detecting very similarities/differences that a human analyst may be not notice and therefore create and introduction more accurate/useful categories. This study presents comparison and analyses breast cancer dataset by using classification decision tree algorithms. Decision tree algorithms are applied to these algorithms which are J48, Function Tree, Random Forest Tree, AD Alternating Decision Tree, Decision stump and Best First. A computationally efficient classifies of these decision tree algorithms by employing Waikato Environment for Knowledge Analysis (WEKA) that is development program which includes a set of machine learning algorithms. These masses included 569 with 357 benign and 212 malignant cases with 32 attributes to test and proof the differences among the classification methods or algorithms. These results found by manner which involves reserving a particular sample of a medical dataset on which do not train the model. The decision tree classification forms forecast breast tumor with lower error average and higher precision of correctly classified cases 97.7%. The predicted accuracy correctly classified instances for decision stump algorithm 88.0% model is the lowest of all.

References

  1. Quinlan, J. R.1986. Induction of Decision Trees. Machine Learning., 81--106. Google ScholarGoogle ScholarCross RefCross Ref
  2. Rajesh K, Anand S., 2012. Analysis of SEER Dataset for Breast Cancer Diagnosis using C4.5 Classification Algorithm. Int. Jour, of Advanced Research in Computer and Communication Engineering. 1(2):72--77.Google ScholarGoogle Scholar
  3. Cobain EF, Hayes DF. 2015. Indications for prognostic gene expression profiling in early breast cancer. Curr. Treat Options Oncol. 16(5):23. Google ScholarGoogle ScholarCross RefCross Ref
  4. Canadian Cancer Society's Steering Committee on Cancer Statistics. Canadian Cancer Statistics 2015. Toronto Canadian Cancer Society.Google ScholarGoogle Scholar
  5. Sharma, P., Ratnoo, S. 2014, A review on Discovery of Classification Rules using Pittsburgh Approach, Int. J. Artif. Intell. Know 1. Discov., 4(3): 6--16.Google ScholarGoogle Scholar
  6. Han, J. and Kamber, M. 2001. Data Mining: Concepts and Techniques, Morgan Kaufmann.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Buck, Carol J. 2016. Step-by-Step Medical Coding, Elsevier Health Sciences (Book).Google ScholarGoogle Scholar
  8. Kim, H.,Loh,W. 2001. Classification trees with unbiased multi-way splits, Jour. of the American Statistical Association, W.-Y.2001, vol. 96, 589--604.Google ScholarGoogle Scholar
  9. UCI Machine Learning Repository: 1995. Center for Machine Learning and Intelligent Systems. Breast cancer Wisconsin (diagnostic) dataset.Google ScholarGoogle Scholar
  10. Han, J., Kamber M. and Pei, J. 2011. Data Mining: Concepts and Techniques. The Morgan Kaufmann Series in Data Management Systems.Google ScholarGoogle Scholar
  11. Rastogi, R. and Shim, K. 1998. Public: A decision tree classifier that integrates building and pruning. In Proc. of 1996 Int. Conf. on Very Large Databases (VLDB96).Google ScholarGoogle Scholar
  12. Kaur, G., Chhabra. A. 2014. Improved J48 classification algorithm for the prediction of diabetes. Int. Jour. of Computer Applications., 98(22)13--17. Google ScholarGoogle ScholarCross RefCross Ref
  13. Bahgat, E.M. and Moawad, I.F. 2016 Semantic-Based Feature Reduction Approach for E-mail Classification. 2016 Proceedings of the Int. Conf. on Advanced Intelligent Systems and Informatics, 533: 53--63. Google ScholarGoogle ScholarCross RefCross Ref
  14. Shi H. 2007. Best-first Decision Tree Learning. Master Thesis. University of WaikatoGoogle ScholarGoogle Scholar
  15. Gama, J. 2004, Functional trees, Machine. Learning, 55 (3) :219--250 Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Torgo, L. 1997. Functional models for regression tree leaves. In D. Fisher, Machine. Learning, Proceedings of the 14th Int. conf.,385--393Google ScholarGoogle Scholar
  17. Freund, Y. and Mason, L.1999. The Alternating Decision Tree Algorithm. Proceedings 16th Int. Conf. on Mach. Learn, 124--133.Google ScholarGoogle Scholar
  18. Vanassche, A., Krzywania, D., Vaneyghen, J., Struyf, J., Blockeel, H. 2003. First order alternating decision trees. 13th Int. Conf. on Inductive Logic Programming, 116--125.Google ScholarGoogle Scholar
  19. Ho, Tin Kam 1995. Random Decision Forests(PDF). Proceedings of the 3rd Int. Conf. on Document Analysis and Recognition, Montreal, 278--282.Google ScholarGoogle Scholar
  20. Breiman, L., 2001. Random forests, Machine Learning 45 (1), 5--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Iba, W.; and Langley, P. 1992; Induction of One-Level Decision Trees, in ML92: Proceedings of the Ninth Int. Conf. on Mach. Learn, 233--240.Google ScholarGoogle Scholar
  22. Holte, Robert C. 1993. Very Simple Classification Rules Perform Well on Most Commonly Used Datasets, Machine Learning. 11:63--91. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Frank, E., Hal, M. A., and Witten I. 2016. The WEKA Workbench. Online Appendix for Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann.Google ScholarGoogle Scholar

Index Terms

  1. Classifying breast cancer by using decision tree algorithms

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICSCA '17: Proceedings of the 6th International Conference on Software and Computer Applications
      February 2017
      339 pages
      ISBN:9781450348577
      DOI:10.1145/3056662

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 February 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader