Skip to main content
Log in

Application of Attribute Weighting Method Based on Clustering Centers to Discrimination of Linearly Non-Separable Medical Datasets

  • Original Paper
  • Published:
Journal of Medical Systems Aims and scope Submit manuscript

Abstract

In this paper, attribute weighting method based on the cluster centers with aim of increasing the discrimination between classes has been proposed and applied to nonlinear separable datasets including two medical datasets (mammographic mass dataset and bupa liver disorders dataset) and 2-D spiral dataset. The goals of this method are to gather the data points near to cluster center all together to transform from nonlinear separable datasets to linear separable dataset. As clustering algorithm, k-means clustering, fuzzy c-means clustering, and subtractive clustering have been used. The proposed attribute weighting methods are k-means clustering based attribute weighting (KMCBAW), fuzzy c-means clustering based attribute weighting (FCMCBAW), and subtractive clustering based attribute weighting (SCBAW) and used prior to classifier algorithms including C4.5 decision tree and adaptive neuro-fuzzy inference system (ANFIS). To evaluate the proposed method, the recall, precision value, true negative rate (TNR), G-mean1, G-mean2, f-measure, and classification accuracy have been used. The results have shown that the best attribute weighting method was the subtractive clustering based attribute weighting with respect to classification performance in the classification of three used datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25

Similar content being viewed by others

References

  1. Özşen, S., and Güneş, S., Attribute weighting via genetic algorithms for attribute weighted artificial immune system (AWAIS) and its application to heart disease and liver disorders problems. Expert Syst. Appl. 36:386–392, 2009.

    Article  Google Scholar 

  2. Gançarski, P., Blansché, A., and Wania, A., Comparison between two coevolutionary feature weighting algorithms in clustering. Pattern Recognit. 41:983–994, 2008.

    Article  MATH  Google Scholar 

  3. Polat, K., Şahan, S., and Güneş, S., A new method to medical diagnosis: Artificial immune recognition system (AIRS) with fuzzy weighted pre-processing and application to ECG arrhythmia. Expert Syst. Appl. 31:264–269, 2006.

    Article  Google Scholar 

  4. Polat, K., Şahan, S., and Güneş, S., Automatic detection of heart disease using an artificial immune recognition system (AIRS) with fuzzy resource allocation mechanism and k-nn (nearest neighbour) based weighting preprocessing. Expert Syst. Appl. 32:625–631, 2007.

    Article  Google Scholar 

  5. Polat, K., and Güneş, S., A hybrid medical decision making system based on principles component analysis, k-NN based weighted pre-processing and adaptive neuro-fuzzy inference system. Digital Signal Process. 16:913–921, 2006.

    Article  Google Scholar 

  6. Polat, K., and Güneş, S., The effect to diagnostic accuracy of decision tree classifier of fuzzy and k-NN based weighted pre-processing methods to diagnosis of erythemato-squamous diseases. Digital Signal Process. 16:922–930, 2006.

    Article  Google Scholar 

  7. Polat, K., Latifoğlu, F., Kara, S., and Güneş, S., Usage of novel similarity based weighting method to diagnose the atherosclerosis from carotid artery Doppler signals. Med. Biol. Eng. Comput. 46:353–362, 2008.

    Article  Google Scholar 

  8. Dua, S., Singh, H., and Thompson, H. W., Associative classification of mammograms using weighted rules. Expert Syst. Appl. 36:9250–9259, 2009.

    Article  Google Scholar 

  9. Polat, K.,Durduran, S.S., Subtractive clustering attribute weighting (SCAW) to discriminate the traffic accidents on Konya–Afyonkarahisar highway in Turkey with the help of GIS: A case study. Adv Eng Software In Press, 2011.

  10. Bai, L., Liang, J., Dang, C., Cao, F., A novel attribute weighting algorithm for clustering high-dimensional categorical data. Pattern Recognition In Press, 2011.

  11. Hathaway, R., and Bezdek, J., Fuzzy c-means clustering of incomplete data. IEEE Trans. Syst. Man Cybern. 31:735–744, 2001.

    Article  Google Scholar 

  12. Everitt, B., Landau, S., and Leese, M., Cluster Analysis. Arnold, London, 2001.

    MATH  Google Scholar 

  13. Hartigan, J., Clustering Algorithms. Wiley, New York, 1975.

    MATH  Google Scholar 

  14. MacQueen, B., Some Methods for classification and Analysis of Multivariate Observations, Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, University of California Press, 1:281–297, 1967.

  15. Bezdek, J. C., Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York, 1981.

    Book  MATH  Google Scholar 

  16. Yager, R. R., and Filev, D. P., Generation of fuzzy rules by mountain clustering. IEEE Trans. Syst. Man Cybern. 24:209–219, 1994.

    Google Scholar 

  17. Chiu, S. L., Fuzzy model identification based on cluster estimation. J. Intell. Fuzzy Syst. 2, 1994.

  18. Dempster, A. P., Laird, N. M., and Rubin, D. B., Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. 39:1–38, 1977.

    MathSciNet  MATH  Google Scholar 

  19. Marinai, S., Faini, S., Marino, E., Soda, G., Efficient word retrieval by means of SOM clustering and PCA. In: DAS 2006, Springer Verlag- LNCS 3872 (2006) 336–347

  20. Çomak, E., Polat, K., Güneş, S., and Arslan, A., A new medical decision making system: Least square support vector machine (LSSVM) with Fuzzy Weighting Pre-processing. Expert Syst. Appl. 32:409–414, 2007.

    Article  Google Scholar 

  21. Polat, K., Şahan, S., Kodaz, H., and Güneş, S., Breast cancer and liver disorders classification using artificial immune recognition system (AIRS) with performance evaluation by fuzzy resource allocation mechanism. Expert Syst. Appl. 32:172–183, 2007.

    Article  Google Scholar 

  22. Jin, B., Tang, Y. C., and Zhang, Y. Q., Support vector machines with genetic fuzzy feature transformation for biomedical data classification. Inf. Sci. 177:476–489, 2007.

    Article  Google Scholar 

  23. Elter, M., Schulz-Wendtland, R., and Wittenberg, T., The prediction of breast cancer biopsy outcomes using two CAD approaches that both emphasize an intelligible decision process. Med. Physic 34:4164–4172, 2007.

    Article  Google Scholar 

  24. Benaki Lairenjamand Siri Krishan Wasan, Neural Network with Classification Based on Multiple Association Rule for Classifying Mammographic Data Lecture Notes in Computer Science, 2009, Volume 5788, Intelligent Data Engineering and Automated Learning - IDEAL 2009, Pages 465–476

  25. UCI machine learning database, ftp://ftp.ics.uci.edu/pub/machine-learning-databases (last accessed: 2011)

  26. http://www.benmargolis.com/compsci/ai/two_spirals_problem.htm (last accessed: 2011)

  27. Singh, S., 2D spiral pattern recognition with possibilistic measures. Pattern Recognit. Lett. 19:141–147, 1998.

    Article  MATH  Google Scholar 

  28. Güneş, S., Polat, K., and Yosunkaya, Ş., Efficient sleep stage recognition system based on EEG signal using k-means clustering based feature weighting. Expert Syst. Appl. 37:7922–7928, 2010.

    Article  Google Scholar 

  29. Polat, K., Durduran, S.S., Automatic determination of traffic accidents based on KMC-based attribute weighting. Neural Computing & Applications, Article in Press, 2011.

  30. http://www.resample.com/xlminer/help/kMClst/KMClust_intro.htm (last accessed: 2011)

  31. http://en.wikipedia.org/wiki/K-means_clustering (last accessed: 2011)

  32. http://en.wikipedia.org/wiki/Cluster_analysis#Fuzzy_c-means_clustering, (last accessed: 2011)

  33. Polat, K., Classification of Parkinson’s disease using feature weighting method onthe basis of fuzzy C-means clustering. International Journal of Systems Science, Article in Press, 2011.

  34. Quinlan, J. R., C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993.

  35. Mitchell, Tom M., Machine Learning. McGraw-Hill, 1997. pp. 55–58.

  36. Kotsiantis, S. B., Supervised machine learning: A review of classification techniques. Informatica 31:249–268, 2007.

    MathSciNet  MATH  Google Scholar 

  37. Quinlan, J. R., Improved use of continuous attributes in c4.5. J. Artif. Intell. Res. 4:77–90, 1996.

    MATH  Google Scholar 

  38. Jang, J.-S. R., Fuzzy Modeling Using Generalized Neural Networks and Kalman Filter Algorithm, Proc. of the Ninth National Conf. on Artificial Intelligence (AAAI-91), 762–767, 1991.

  39. Polat, K., and Güneş, S., A hybrid medical decision making system based on principles component analysis, k-NN based weighted pre-processing and adaptive neuro-fuzzy inference system. Digital Signal Process. 16:913–921, 2006.

    Article  Google Scholar 

  40. Akdemir, B., Kara, S., Polat, K., Güven, A., and Güneş, A., Ensemble adaptive network-based fuzzy inference system with weighted arithmetical mean and application to diagnosis of optic nerve disease from visual-evoked potential signals. Artif. Intell. Med. 43:141–149, 2008.

    Article  Google Scholar 

  41. Kohavi, R., A study of cross-validation and bootstrap for accuracy estimation and model selection. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence 2 (12):1137–1143, 1995, http://citeseer.ist.psu.edu/kohavi95study.html. (Morgan Kaufmann, San Mateo)

  42. http://www2.cs.uregina.ca/~dbd/cs831/notes/confusion_matrix/confusion_matrix.html (last accessed: 2011)

  43. Güven, A., Polat, K., Kara, S., and Güneş, S., The effect of generalized discriminate analysis (GDA) to the classification of optic nerve disease from VEP signals. Comput. Biol. Med. 38:62–68, 2008.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kemal Polat.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Polat, K. Application of Attribute Weighting Method Based on Clustering Centers to Discrimination of Linearly Non-Separable Medical Datasets. J Med Syst 36, 2657–2673 (2012). https://doi.org/10.1007/s10916-011-9741-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10916-011-9741-y

Keywords

Navigation