Abstract
This article presents a robust noise-resistant fuzzy-based algorithm for cancer class detection. High-throughput microarray technologies facilitate the generation of large-scale expression data; this data captures enough information to build classifiers to understand the molecular basis of a disease. The proposed approach built on the Credibilistic Fuzzy C-Means (CFCM) algorithm partitions data restricted to a p-dimensional unit hypersphere. CFCM was introduced to address the noise sensitiveness of fuzzy-based procedures, but it is unstable and fails to capture local non-linear interactions. The introduced approach addresses these shortcomings. The experimental findings in this article focus on cancer expression datasets. The performance of the proposed approach is assessed with both internal and external measures. The fuzzy-based learning algorithms Fuzzy C-Means (FCM) and Hyperspherical Fuzzy C-Means (HFCM) are used for comparative analysis. The experimental findings indicate that the proposed approach can be used as a plausible tool for clustering cancer expression data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Babichev, S., Skvor, J., Fiser, J., Lytvynenko, V.: Technology of gene expression profiles filtering based on wavelet analysis. Int. J. Intell. Syst. Appl. (IJISA) 10(4), 1–7 (2018). https://doi.org/10.5815/ijisa.2018.04.01
Banerjee, A., Dhillon, I.S., Ghosh, J., Sra, S., Ridgeway, G.: Clustering on the unit hypersphere using von mises-fisher distributions. J. Mach. Learn. Res. 6(9), 1345–1382 (2005)
Banerjee, A., Ghosh, J.: Frequency sensitive competitive learning for clustering on high-dimensional hyperspheres. In: Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN 2002 (Cat. No. 02CH37290), vol. 2, pp. 1590–1595. IEEE (2002). https://doi.org/10.1109/IJCNN.2002.1007755
Banerjee, A., Ghosh, J.: Frequency-sensitive competitive learning for scalable balanced clustering on high-dimensional hyperspheres. IEEE Trans. Neural Netw. 15(3), 702–719 (2004). https://doi.org/10.1109/TNN.2004.824416
Benjamin, J.B., Hussain, I., Yang, M.S.: Possiblistic c-means clustering on directional data. In: 2019 12th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), pp. 1–6. IEEE (2019). https://doi.org/10.1109/CISP-BMEI48845.2019.8965703
Bezdek, J.C.: Fuzzy-mathematics in pattern classification. Cornell University (1973)
Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat.-Theory Methods 3(1), 1–27 (1974). https://doi.org/10.1080/03610927408827101
Callari, M., et al.: Comparison of microarray platforms for measuring differential microrna expression in paired normal/cancer colon tissues (2012). https://doi.org/10.1371/journal.pone.0045105
Chintalapudi, K.K., Kam, M.: A noise-resistant fuzzy c means algorithm for clustering. In: 1998 IEEE international conference on fuzzy systems proceedings. In: IEEE World Congress on Computational Intelligence (Cat. No. 98CH36228), vol. 2, pp. 1458–1463. IEEE (1998). https://doi.org/10.1109/FUZZY.1998.686334
Chintalapudi, K.K., Kam, M.: The credibilistic fuzzy c means clustering algorithm. In: SMC 1998 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No. 98CH36218), vol. 2, pp. 2034–2039. IEEE (1998). https://doi.org/10.1109/ICSMC.1998.728197
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 2, 224–227 (1979). https://doi.org/10.1109/TPAMI.1979.4766909
Dortet-Bernadet, J.L., Wicker, N.: Model-based clustering on the unit sphere with an illustration using gene expression profiles. Biostatistics 9(1), 66–80 (2008). https://doi.org/10.1093/biostatistics/kxm012
Dunn, J.C.: A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters (1973). https://doi.org/10.1080/01969727308546046
Gao, X., Xie, W.: Advances in theory and applications of fuzzy clustering. Chin. Sci. Bull. 45(11), 961–970 (2000). https://doi.org/10.1007/BF02884971
Griesinger, A.M.: Characterization of distinct immunophenotypes across pediatric brain tumor types. J. Immunol. 191(9), 4880–4888 (2013). https://doi.org/10.4049/jimmunol.1301966
Hornik, K., Feinerer, I., Kober, M., Buchta, C.: Spherical k-means clustering. J. Stat. Softw. 50, 1–22 (2012)
Kaur, P.: Intuitionistic fuzzy sets based credibilistic fuzzy C-means clustering for medical image segmentation. Int. J. Inf. Technol. 9(4), 345–351 (2017). https://doi.org/10.1007/s41870-017-0039-2
Kaur, P., Soni, A., Gosain, A.: Robust kernelized approach to clustering by incorporating new distance measure. Eng. Appl. Artif. Intell 26(2), 833–847 (2013). https://doi.org/10.1016/j.engappai.2012.07.002
Kesemen, O., Tezel, Ö., Özkul, E.: Fuzzy c-means clustering algorithm for directional data (fcm4dd). Expert Syst. Appl. 58, 76–82 (2016). https://doi.org/10.1016/j.eswa.2016.03.034
Laskhmaiah, K., Murali Krishna, S., Eswara Reddy, B.: An optimized k-means with density and distance-based clustering algorithm for multidimensional spatial databases. Int. J. Comput. Netw. Inf. Secur. (IJCNIS) 13(6), 70–82 (2021). https://doi.org/10.5815/ijcnis.2021.06.06
Maitra, R., Ramler, I.P.: A k-mean-directions algorithm for fast clustering of data on the sphere. J. Comput. Graph. Stat. 19(2), 377–396 (2010). https://doi.org/10.1198/jcgs.2009.08155
Mendes, M., Sacks, L.: Evaluating fuzzy clustering for relevance-based information access. In: The 12th IEEE International Conference on Fuzzy Systems, 2003, FUZZ 2003, vol. 1, pp. 648–653. IEEE (2003). https://doi.org/10.1109/FUZZ.2003.1209440
Nguyen, T., Khosravi, A., Creighton, D., Nahavandi, S.: Hierarchical gene selection and genetic fuzzy system for cancer microarray data classification. PloS one 10(3), e0120364 (2015). https://doi.org/10.1371/journal.pone.0120364
Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344(6191), 1492–1496 (2014). https://doi.org/10.1126/science.1242072
Roessler, S., et al.: A unique metastasis gene signature enables prediction of tumor relapse in early-stage hepatocellular carcinoma patients. Cancer Res. 70(24), 10202–10212 (2010). https://doi.org/10.1158/0008-5472.CAN-10-2607
Starczewski, A., Krzyżak, A.: Performance evaluation of the silhouette index. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2015. LNCS (LNAI), vol. 9120, pp. 49–58. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19369-4_5
Yan, Y., Chen, L.: Hyperspherical possibilistic fuzzy c-means for high-dimensional data clustering. In: 2009 7th International Conference on Information, Communications and Signal Processing (ICICS), pp. 1–5. IEEE (2009). https://doi.org/10.5555/1818318.1818479
Yang, M.S., Chang-Chien, S.J., Hung, W.L.: An unsupervised clustering algorithm for data on the unit hypersphere. Appl. Soft Comput. 42, 290–313 (2016). https://doi.org/10.1016/j.asoc.2015.12.037
Yang, M.S., Pan, J.A.: On fuzzy clustering of directional data. Fuzzy Sets Syst. 91(3), 319–326 (1997). https://doi.org/10.1016/S0165-0114(96)00157-1
Yang, M.S., Wu, K.L.: A similarity-based robust clustering method. IEEE Trans. Pattern Anal. Mach. Intell. 26(4), 434–448 (2004). https://doi.org/10.1109/TPAMI.2004.1265860
Yu, X., Yu, G., Wang, J.: Clustering cancer gene expression data by projective clustering ensemble. PloS one 12(2), e0171429 (2017). https://doi.org/10.1371/journal.pone.0171429
Zeng, Y., Xu, Z., He, Y., Rao, Y.: Fuzzy entropy clustering by searching local border points for the analysis of gene expression data. Knowl.-Based Syst. 190, 105309 (2020). https://doi.org/10.1016/j.knosys.2019.105309
Acknowledgment
Oleksii K. Tyshchenko acknowledges the Helmholtz Information and Data Science Academy (HIDA) for providing financial support enabling a short-term research stay at Helmholtz Centre for Infection Research (Braunschweig, Germany) for fuzzy clustering analysis of biological data.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hu, Z., Last, M., Hong, TP., Tyshchenko, O.K., Kashyap, E. (2023). A Noise Resistant Credibilistic Fuzzy Clustering Algorithm on a Unit Hypersphere with Illustrations Using Expression Data. In: Babichev, S., Lytvynenko, V. (eds) Lecture Notes in Data Engineering, Computational Intelligence, and Decision Making. ISDMCI 2022. Lecture Notes on Data Engineering and Communications Technologies, vol 149. Springer, Cham. https://doi.org/10.1007/978-3-031-16203-9_32
Download citation
DOI: https://doi.org/10.1007/978-3-031-16203-9_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16202-2
Online ISBN: 978-3-031-16203-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)