skip to main content
research-article

A Generalized Deep Learning Clustering Algorithm Based on Non-Negative Matrix Factorization

Published:04 May 2023Publication History
Skip Abstract Section

Abstract

Clustering is a popular research topic in the field of data mining, in which the clustering method based on non-negative matrix factorization (NMF) has been widely employed. However, in the update process of NMF, there is no learning rate to guide the update as well as the update depends on the data itself, which leads to slow convergence and low clustering accuracy. To solve these problems, a generalized deep learning clustering (GDLC) algorithm based on NMF is proposed in this article. Firstly, a nonlinear constrained NMF (NNMF) algorithm is constructed to achieve sequential updates of the elements in the matrix guided by the learning rate. Then, the gradient values corresponding to the element update are transformed into generalized weights and generalized biases, by inputting the elements as well as their corresponding generalized weights and generalized biases into the nonlinear activation function to construct the GDLC algorithm. In addition, for improving the understanding of the GDLC algorithm, its detailed inference procedure and algorithm design are provided. Finally, the experimental results on eight datasets show that the GDLC algorithm has efficient performance.

REFERENCES

  1. [1] Balakrishnama Suresh and Ganapathiraju Aravind. 1998. Linear discriminant analysis-a brief tutorial. Institute for Signal and Information Processing 18, 1998 (1998), 18.Google ScholarGoogle Scholar
  2. [2] Cai Deng, He Xiaofei, Han Jiawei, and Huang Thomas S.. 2010. Graph regularized nonnegative matrix factorization for data representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 8 (2010), 15481560.Google ScholarGoogle Scholar
  3. [3] Chen Wen-Sheng, Zeng Qianwen, and Pan Binbin. 2022. A survey of deep nonnegative matrix factorization. Neurocomputing 491 (2022), 305320.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Deng Ping, Li Tianrui, Wang Hongjun, Horng Shi-Jinn, Yu Zeng, and Wang Xiaomin. 2021. Tri-regularized nonnegative matrix tri-factorization for co-clustering. Knowledge-Based Systems 226 (2021), 107101.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Deng Ping, Li Tianrui, Wang Hongjun, Wang Dexian, Horng Shi-Jinn, and Liu Rui. 2022. Graph regularized sparse non-negative matrix factorization for clustering. IEEE Transactions on Computational Social Systems (2022), 1–12.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Deng Ping, Zhang Fan, Li Tianrui, Wang Hongjun, and Horng Shi-Jinn. 2022. Biased unconstrained non-negative matrix factorization for clustering. Knowledge-Based Systems 239 (2022), 108040.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Ding Chris, Li Tao, Peng Wei, and Park Haesun. 2006. Orthogonal nonnegative matrix t-factorizations for clustering. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 126135.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Guan Naiyang, Tao Dacheng, Luo Zhigang, and Yuan Bo. 2012. Online nonnegative matrix factorization with robust stochastic approximation. IEEE Transactions on Neural Networks and Learning Systems 23, 7 (2012), 10871099.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Guo Zhenxing and Zhang Shihua. 2019. Sparse deep nonnegative matrix factorization. Big Data Mining and Analytics 3, 1 (2019), 1328.Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Hebb Donald Olding. 2005. The Organization of Behavior: A Neuropsychological Theory. Psychology Press.Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Hedjam Rachid, Abdesselam Abdelhamid, and Melgani Farid. 2021. NMF with feature relationship preservation penalty term for clustering problems. Pattern Recognition 112 (2021), 107814.Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Huang Jin, Nie Feiping, Huang Heng, and Ding Chris. 2014. Robust manifold nonnegative matrix factorization. ACM Transactions on Knowledge Discovery from Data (TKDD) 8, 3 (2014), 121.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Huang Shudong, Xu Zenglin, Kang Zhao, and Ren Yazhou. 2020. Regularized nonnegative matrix factorization with adaptive local structure learning. Neurocomputing 382 (2020), 196209.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Kahan W.. 2013. A tutorial overview of vector and matrix norms. University of California, Berkeley, CA, Lecture Notes (2013), 19.Google ScholarGoogle Scholar
  15. [15] Roux Jonathan Le, Hershey John R., and Weninger Felix. 2015. Deep NMF for speech separation. In Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’15). IEEE, 6670.Google ScholarGoogle Scholar
  16. [16] LeCun Yann, Bengio Yoshua, and Hinton Geoffrey. 2015. Deep learning. Nature 521, 7553 (2015), 436444.Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Lee Daniel D. and Seung H. Sebastian. 1999. Learning the parts of objects by non-negative matrix factorization. Nature 401, 6755 (1999), 788791.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Lei Xiujuan, Tie Jiaojiao, and Fujita Hamido. 2020. Relational completion based non-negative matrix factorization for predicting metabolite-disease associations. Knowledge-Based Systems 204 (2020), 106238.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Li Bo, Zhou Guoxu, and Cichocki Andrzej. 2015. Two efficient algorithms for approximately orthogonal nonnegative matrix factorization. IEEE Signal Processing Letters 22, 7 (2015), 843846.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Li Heng-Chao, Yang Gang, Yang Wen, Du Qian, and Emery William J.. 2020. Deep nonsmooth nonnegative matrix factorization network with semi-supervised learning for SAR image change detection. ISPRS Journal of Photogrammetry and Remote Sensing 160 (2020), 167179.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Li Xuelong, Cui Guosheng, and Dong Yongsheng. 2017. Graph regularized non-negative low-rank matrix factorization for image clustering. IEEE Transactions on Cybernetics 47, 11 (2017), 38403853.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Meng Yang, Shang Ronghua, Jiao Licheng, Zhang Wenya, and Yang Shuyuan. 2018. Dual-graph regularized non-negative matrix factorization with sparse and orthogonal constraints. Engineering Applications of Artificial Intelligence 69 (2018), 2435.Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Nie Feiping, Wang Cheng-Long, and Li Xuelong. 2019. K-multiple-means: A multiple-means clustering method with specified k clusters. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 959967.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Peng Siyuan, Ser Wee, Chen Badong, and Lin Zhiping. 2021. Robust semi-supervised nonnegative matrix factorization for image clustering. Pattern Recognition 111 (2021), 107683.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Robbins Herbert and Siegmund David. 1971. A convergence theorem for non negative almost supermartingales and some applications. In Proceedings of the Optimizing Methods in Statistics. Elsevier, 233257.Google ScholarGoogle Scholar
  26. [26] Rosenblatt Frank. 1958. The perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review 65, 6 (1958), 386408.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Rumelhart David E., Hinton Geoffrey E., and Williams Ronald J.. 1986. Learning representations by back-propagating errors. Nature 323, 6088 (1986), 533536.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Saxena Amit, Prasad Mukesh, Gupta Akshansh, Bharill Neha, Patel Om Prakash, Tiwari Aruna, Er Meng Joo, Ding Weiping, and Lin Chin-Teng. 2017. A review of clustering techniques and developments. Neurocomputing 267 (2017), 664681.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Shang Fanhua, Jiao Licheng, and Wang Fei. 2012. Graph dual regularization non-negative matrix factorization for co-clustering. Pattern Recognition 45, 6 (2012), 22372250.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Shang Ronghua, Wang Wenbing, Stolkin Rustam, and Jiao Licheng. 2017. Non-negative spectral learning and sparse regression-based dual-graph regularized feature selection. IEEE Transactions on Cybernetics 48, 2 (2017), 793806.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Sun Gan, Cong Yang, Zhang Yulun, Zhao Guoshuai, and Fu Yun. 2021. Continual multiview task learning via deep matrix factorization. IEEE Transactions on Neural Networks and Learning Systems 32, 1 (2021), 139150.Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Sun Jing, Wang Zhihui, Sun Fuming, and Li Haojie. 2018. Sparse dual graph-regularized NMF for image co-clustering. Neurocomputing 316 (2018), 156165.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Vidal Rene, Ma Yi, and Sastry Shankar. 2005. Generalized principal component analysis (GPCA). IEEE Transactions on Pattern Analysis and Machine Intelligence 27, 12 (2005), 19451959.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Wen Jinhuan, Fowler James E., He Mingyi, Zhao Yong-Qiang, Deng Chengzhi, and Menon Vineetha. 2016. Orthogonal nonnegative matrix factorization combining multiple features for spectral–spatial dimensionality reduction of hyperspectral imagery. IEEE Transactions on Geoscience and Remote Sensing 54, 7 (2016), 42724286.Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Wold Svante, Esbensen Kim, and Geladi Paul. 1987. Principal component analysis. Chemometrics and Intelligent Laboratory Systems 2, 1-3 (1987), 3752.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Xu Rui and Wunsch Donald. 2005. Survey of clustering algorithms. IEEE Transactions on Neural Networks 16, 3 (2005), 645678.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. [37] Yang Mingming and Xu Songhua. 2021. Orthogonal nonnegative matrix factorization using a novel deep autoencoder network. Knowledge-Based Systems 227 (2021), 107236.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] Yang Zuyuan, Liang Naiyao, Yan Wei, Li Zhenni, and Xie Shengli. 2021. Uniform distribution non-negative matrix factorization for multiview clustering. IEEE Transactions on Cybernetics 51, 6 (2021), 32493262.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Ye Fanghua, Chen Chuan, and Zheng Zibin. 2018. Deep autoencoder-like nonnegative matrix factorization for community detection. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 13931402.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Ye Jun and Jin Zhong. 2014. Dual-graph regularized concept factorization for clustering. Neurocomputing 138 (2014), 120130.Google ScholarGoogle ScholarCross RefCross Ref
  41. [41] Yi Yugen, Chen Yuqi, Wang Jianzhong, Lei Gang, Dai Jiangyan, and Zhang Huihui. 2020. Joint feature representation and classification via adaptive graph semi-supervised nonnegative matrix factorization. Signal Processing: Image Communication 89 (2020), 115984.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Zhao Renbo and Tan Vincent Y. F.. 2016. Online nonnegative matrix factorization with outliers. IEEE Transactions on Signal Processing 65, 3 (2016), 555570.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Zhao Yang, Wang Huiyang, and Pei Jihong. 2019. Deep non-negative matrix factorization architecture based on underlying basis images learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 6 (2019), 18971913.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Generalized Deep Learning Clustering Algorithm Based on Non-Negative Matrix Factorization

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Knowledge Discovery from Data
      ACM Transactions on Knowledge Discovery from Data  Volume 17, Issue 7
      August 2023
      319 pages
      ISSN:1556-4681
      EISSN:1556-472X
      DOI:10.1145/3589018
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 May 2023
      • Online AM: 20 February 2023
      • Accepted: 13 February 2023
      • Revised: 1 December 2022
      • Received: 31 May 2022
      Published in tkdd Volume 17, Issue 7

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)360
      • Downloads (Last 6 weeks)38

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format