research-article

A Generalized Deep Learning Clustering Algorithm Based on Non-Negative Matrix Factorization

Authors:
Dexian Wang

Southwest Jiaotong University, Chengdu, China

Southwest Jiaotong University, Chengdu, China
View Profile

,
Tianrui Li

Southwest Jiaotong University, Chengdu, China

Southwest Jiaotong University, Chengdu, China
View Profile

,
Ping Deng

Xihua University, Chengdu, China

Xihua University, Chengdu, China

0000-0001-7208-8855
View Profile

,
Fan Zhang

Southwest Jiaotong University, Chengdu, China

Southwest Jiaotong University, Chengdu, China

0000-0002-8735-2812
View Profile

,
Wei Huang

Southwest Jiaotong University, Chengdu, China

Southwest Jiaotong University, Chengdu, China

0000-0001-9031-107X
View Profile

,
Pengfei Zhang

Southwest Jiaotong University, Chengdu, China

Southwest Jiaotong University, Chengdu, China

0000-0002-7090-0325
View Profile

,
Jia Liu

Southwest Jiaotong University, Chengdu, China

Southwest Jiaotong University, Chengdu, China

0000-0002-2910-3447
View Profile

ACM Transactions on Knowledge Discovery from Data Volume 17 Issue 7Article No.: 99pp 1–20https://doi.org/10.1145/3584862

Published:04 May 2023Publication History

ACM Transactions on Knowledge Discovery from Data

Abstract

Clustering is a popular research topic in the field of data mining, in which the clustering method based on non-negative matrix factorization (NMF) has been widely employed. However, in the update process of NMF, there is no learning rate to guide the update as well as the update depends on the data itself, which leads to slow convergence and low clustering accuracy. To solve these problems, a generalized deep learning clustering (GDLC) algorithm based on NMF is proposed in this article. Firstly, a nonlinear constrained NMF (NNMF) algorithm is constructed to achieve sequential updates of the elements in the matrix guided by the learning rate. Then, the gradient values corresponding to the element update are transformed into generalized weights and generalized biases, by inputting the elements as well as their corresponding generalized weights and generalized biases into the nonlinear activation function to construct the GDLC algorithm. In addition, for improving the understanding of the GDLC algorithm, its detailed inference procedure and algorithm design are provided. Finally, the experimental results on eight datasets show that the GDLC algorithm has efficient performance.

REFERENCES

[1] Balakrishnama Suresh and Ganapathiraju Aravind. 1998. Linear discriminant analysis-a brief tutorial. Institute for Signal and Information Processing 18, 1998 (1998), 1–8.Google Scholar
[2] Cai Deng, He Xiaofei, Han Jiawei, and Huang Thomas S.. 2010. Graph regularized nonnegative matrix factorization for data representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 8 (2010), 1548–1560.Google Scholar
[3] Chen Wen-Sheng, Zeng Qianwen, and Pan Binbin. 2022. A survey of deep nonnegative matrix factorization. Neurocomputing 491 (2022), 305–320.Google ScholarDigital Library
[4] Deng Ping, Li Tianrui, Wang Hongjun, Horng Shi-Jinn, Yu Zeng, and Wang Xiaomin. 2021. Tri-regularized nonnegative matrix tri-factorization for co-clustering. Knowledge-Based Systems 226 (2021), 107101.Google ScholarCross Ref
[5] Deng Ping, Li Tianrui, Wang Hongjun, Wang Dexian, Horng Shi-Jinn, and Liu Rui. 2022. Graph regularized sparse non-negative matrix factorization for clustering. IEEE Transactions on Computational Social Systems (2022), 1–12.Google ScholarCross Ref
[6] Deng Ping, Zhang Fan, Li Tianrui, Wang Hongjun, and Horng Shi-Jinn. 2022. Biased unconstrained non-negative matrix factorization for clustering. Knowledge-Based Systems 239 (2022), 108040.Google ScholarDigital Library
[7] Ding Chris, Li Tao, Peng Wei, and Park Haesun. 2006. Orthogonal nonnegative matrix t-factorizations for clustering. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 126–135.Google ScholarDigital Library
[8] Guan Naiyang, Tao Dacheng, Luo Zhigang, and Yuan Bo. 2012. Online nonnegative matrix factorization with robust stochastic approximation. IEEE Transactions on Neural Networks and Learning Systems 23, 7 (2012), 1087–1099.Google ScholarCross Ref
[9] Guo Zhenxing and Zhang Shihua. 2019. Sparse deep nonnegative matrix factorization. Big Data Mining and Analytics 3, 1 (2019), 13–28.Google ScholarCross Ref
[10] Hebb Donald Olding. 2005. The Organization of Behavior: A Neuropsychological Theory. Psychology Press.Google ScholarCross Ref
[11] Hedjam Rachid, Abdesselam Abdelhamid, and Melgani Farid. 2021. NMF with feature relationship preservation penalty term for clustering problems. Pattern Recognition 112 (2021), 107814.Google ScholarCross Ref
[12] Huang Jin, Nie Feiping, Huang Heng, and Ding Chris. 2014. Robust manifold nonnegative matrix factorization. ACM Transactions on Knowledge Discovery from Data (TKDD) 8, 3 (2014), 1–21.Google ScholarDigital Library
[13] Huang Shudong, Xu Zenglin, Kang Zhao, and Ren Yazhou. 2020. Regularized nonnegative matrix factorization with adaptive local structure learning. Neurocomputing 382 (2020), 196–209.Google ScholarDigital Library
[14] Kahan W.. 2013. A tutorial overview of vector and matrix norms. University of California, Berkeley, CA, Lecture Notes (2013), 19.Google Scholar
[15] Roux Jonathan Le, Hershey John R., and Weninger Felix. 2015. Deep NMF for speech separation. In Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’15). IEEE, 66–70.Google Scholar
[16] LeCun Yann, Bengio Yoshua, and Hinton Geoffrey. 2015. Deep learning. Nature 521, 7553 (2015), 436–444.Google ScholarCross Ref
[17] Lee Daniel D. and Seung H. Sebastian. 1999. Learning the parts of objects by non-negative matrix factorization. Nature 401, 6755 (1999), 788–791.Google ScholarCross Ref
[18] Lei Xiujuan, Tie Jiaojiao, and Fujita Hamido. 2020. Relational completion based non-negative matrix factorization for predicting metabolite-disease associations. Knowledge-Based Systems 204 (2020), 106238.Google ScholarCross Ref
[19] Li Bo, Zhou Guoxu, and Cichocki Andrzej. 2015. Two efficient algorithms for approximately orthogonal nonnegative matrix factorization. IEEE Signal Processing Letters 22, 7 (2015), 843–846.Google ScholarCross Ref
[20] Li Heng-Chao, Yang Gang, Yang Wen, Du Qian, and Emery William J.. 2020. Deep nonsmooth nonnegative matrix factorization network with semi-supervised learning for SAR image change detection. ISPRS Journal of Photogrammetry and Remote Sensing 160 (2020), 167–179.Google ScholarCross Ref
[21] Li Xuelong, Cui Guosheng, and Dong Yongsheng. 2017. Graph regularized non-negative low-rank matrix factorization for image clustering. IEEE Transactions on Cybernetics 47, 11 (2017), 3840–3853.Google ScholarCross Ref
[22] Meng Yang, Shang Ronghua, Jiao Licheng, Zhang Wenya, and Yang Shuyuan. 2018. Dual-graph regularized non-negative matrix factorization with sparse and orthogonal constraints. Engineering Applications of Artificial Intelligence 69 (2018), 24–35.Google ScholarCross Ref
[23] Nie Feiping, Wang Cheng-Long, and Li Xuelong. 2019. K-multiple-means: A multiple-means clustering method with specified k clusters. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 959–967.Google ScholarDigital Library
[24] Peng Siyuan, Ser Wee, Chen Badong, and Lin Zhiping. 2021. Robust semi-supervised nonnegative matrix factorization for image clustering. Pattern Recognition 111 (2021), 107683.Google ScholarCross Ref
[25] Robbins Herbert and Siegmund David. 1971. A convergence theorem for non negative almost supermartingales and some applications. In Proceedings of the Optimizing Methods in Statistics. Elsevier, 233–257.Google Scholar
[26] Rosenblatt Frank. 1958. The perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review 65, 6 (1958), 386–408.Google ScholarCross Ref
[27] Rumelhart David E., Hinton Geoffrey E., and Williams Ronald J.. 1986. Learning representations by back-propagating errors. Nature 323, 6088 (1986), 533–536.Google ScholarCross Ref
[28] Saxena Amit, Prasad Mukesh, Gupta Akshansh, Bharill Neha, Patel Om Prakash, Tiwari Aruna, Er Meng Joo, Ding Weiping, and Lin Chin-Teng. 2017. A review of clustering techniques and developments. Neurocomputing 267 (2017), 664–681.Google ScholarDigital Library
[29] Shang Fanhua, Jiao Licheng, and Wang Fei. 2012. Graph dual regularization non-negative matrix factorization for co-clustering. Pattern Recognition 45, 6 (2012), 2237–2250.Google ScholarDigital Library
[30] Shang Ronghua, Wang Wenbing, Stolkin Rustam, and Jiao Licheng. 2017. Non-negative spectral learning and sparse regression-based dual-graph regularized feature selection. IEEE Transactions on Cybernetics 48, 2 (2017), 793–806.Google ScholarCross Ref
[31] Sun Gan, Cong Yang, Zhang Yulun, Zhao Guoshuai, and Fu Yun. 2021. Continual multiview task learning via deep matrix factorization. IEEE Transactions on Neural Networks and Learning Systems 32, 1 (2021), 139–150.Google ScholarCross Ref
[32] Sun Jing, Wang Zhihui, Sun Fuming, and Li Haojie. 2018. Sparse dual graph-regularized NMF for image co-clustering. Neurocomputing 316 (2018), 156–165.Google ScholarCross Ref
[33] Vidal Rene, Ma Yi, and Sastry Shankar. 2005. Generalized principal component analysis (GPCA). IEEE Transactions on Pattern Analysis and Machine Intelligence 27, 12 (2005), 1945–1959.Google ScholarDigital Library
[34] Wen Jinhuan, Fowler James E., He Mingyi, Zhao Yong-Qiang, Deng Chengzhi, and Menon Vineetha. 2016. Orthogonal nonnegative matrix factorization combining multiple features for spectral–spatial dimensionality reduction of hyperspectral imagery. IEEE Transactions on Geoscience and Remote Sensing 54, 7 (2016), 4272–4286.Google ScholarCross Ref
[35] Wold Svante, Esbensen Kim, and Geladi Paul. 1987. Principal component analysis. Chemometrics and Intelligent Laboratory Systems 2, 1-3 (1987), 37–52.Google ScholarCross Ref
[36] Xu Rui and Wunsch Donald. 2005. Survey of clustering algorithms. IEEE Transactions on Neural Networks 16, 3 (2005), 645–678.Google ScholarDigital Library
[37] Yang Mingming and Xu Songhua. 2021. Orthogonal nonnegative matrix factorization using a novel deep autoencoder network. Knowledge-Based Systems 227 (2021), 107236.Google ScholarDigital Library
[38] Yang Zuyuan, Liang Naiyao, Yan Wei, Li Zhenni, and Xie Shengli. 2021. Uniform distribution non-negative matrix factorization for multiview clustering. IEEE Transactions on Cybernetics 51, 6 (2021), 3249–3262.Google ScholarCross Ref
[39] Ye Fanghua, Chen Chuan, and Zheng Zibin. 2018. Deep autoencoder-like nonnegative matrix factorization for community detection. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 1393–1402.Google ScholarDigital Library
[40] Ye Jun and Jin Zhong. 2014. Dual-graph regularized concept factorization for clustering. Neurocomputing 138 (2014), 120–130.Google ScholarCross Ref
[41] Yi Yugen, Chen Yuqi, Wang Jianzhong, Lei Gang, Dai Jiangyan, and Zhang Huihui. 2020. Joint feature representation and classification via adaptive graph semi-supervised nonnegative matrix factorization. Signal Processing: Image Communication 89 (2020), 115984.Google ScholarCross Ref
[42] Zhao Renbo and Tan Vincent Y. F.. 2016. Online nonnegative matrix factorization with outliers. IEEE Transactions on Signal Processing 65, 3 (2016), 555–570.Google ScholarDigital Library
[43] Zhao Yang, Wang Huiyang, and Pei Jihong. 2019. Deep non-negative matrix factorization architecture based on underlying basis images learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 6 (2019), 1897–1913.Google ScholarCross Ref

Index Terms

A Generalized Deep Learning Clustering Algorithm Based on Non-Negative Matrix Factorization
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Learning latent representations

Recommendations

Document Clustering Based on Spectral Clustering and Non-negative Matrix Factorization
IEA/AIE '08: Proceedings of the 21st international conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: New Frontiers in Applied Artificial Intelligence

In this paper, we propose a novel non-negative matrix factorization (NMF) to the affinity matrix for document clustering, which enforces non-negativity and orthogonality constraints simultaneously. With the help of orthogonality constraints, this NMF ...
Read More
Similarity-based clustering by left-stochastic matrix factorization

For similarity-based clustering, we propose modeling the entries of a given similarity matrix as the inner products of the unknown cluster probabilities. To estimate the cluster probabilities from the given similarity matrix, we introduce a left-...
Read More
New SVD based initialization strategy for non-negative matrix factorization

We give a new method to determine the rank of the factorization for NMF algorithms.We propose a novel method SVD-NMF to enhance initialization for NMF.The compute process is cheap.Numerical results show that SVD-NMF convergent faster than NNDSVD and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Knowledge Discovery from Data Volume 17, Issue 7
August 2023
319 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3589018
Editor:
Charu Aggarwal
IBM T. J. Watson Research, USA
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 May 2023
- Online AM: 20 February 2023
- Accepted: 13 February 2023
- Revised: 1 December 2022
- Received: 31 May 2022
Published in tkdd Volume 17, Issue 7

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Deep learning
clustering
non-negative matrix factorization
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 476
  Total Downloads
- Downloads (Last 12 months)360
- Downloads (Last 6 weeks)38
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

HTML Format

View this article in HTML Format .

View HTML Format

A Generalized Deep Learning Clustering Algorithm Based on Non-Negative Matrix Factorization

ACM Transactions on Knowledge Discovery from Data

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Document Clustering Based on Spectral Clustering and Non-negative Matrix Factorization

Similarity-based clustering by left-stochastic matrix factorization

New SVD based initialization strategy for non-negative matrix factorization