An algorithm of nonnegative matrix factorization under structure constraints for image clustering

Jia, Mengxue; Li, Xiangli; Zhang, Ying

doi:10.1007/s00521-022-08136-x

An algorithm of nonnegative matrix factorization under structure constraints for image clustering

Original Article
Published: 20 December 2022

Volume 35, pages 7891–7907, (2023)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

299 Accesses
1 Altmetric
Explore all metrics

Abstract

Nonnegative matrix factorization (NMF) is a crucial method for image clustering. However, NMF may obtain low accurate clustering results because the factorization results contain no data structure information. In this paper, we propose an algorithm of nonnegative matrix factorization under structure constraints (SNMF). The factorization results of SNMF could maintain data global and local structure information simultaneously. In SNMF, the global structure information is captured by the cosine measure under the $\ell _2$ norm constraints. Meanwhile, $\ell _2$ norm constraints are utilized to get more discriminant data representations. A graph regularization term is employed to maintain the local structure. Effective updating rules are given in this paper. Moreover, the effects of different normalizations on similarities are investigated through experiments. On real datasets, the numerical results confirm the effectiveness of the SNMF.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Non-negative matrix factorization via adaptive sparse graph regularization

Article 12 January 2021

Weighted non-negative matrix factorization based on adaptive robust local sparse graph

Article 09 May 2023

Non-negative Matrix Factorization with Symmetric Manifold Regularization

Article 30 August 2019

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Artificial Intelligence

Data availability

All datasets analyzed in this study are available in the homepage of Deng Cai (http://www.cad.zju.edu.cn/home/dengcai/).

Notes

References

LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444. https://doi.org/10.1038/nature14539
Article Google Scholar
Guan Y, Fang J, Wu X (2020) Multi-pose face recognition using cascade alignment network and incremental clustering. Signal Image Video Process 1:1–9
Google Scholar
Ren Y, Kamath U, Domeniconi C, Xu Z (2019) Parallel boosted clustering. Neurocomputing 351:87–100
Article Google Scholar
Xie P, Xing EP (2015) Integrating image clustering and codebook learning. In: AAAI, pp 1903–1909
Chang J, Chen Y, Qi L, Yan H (2020) Hypergraph clustering using a new laplacian tensor with applications in image processing. SIAM J Imag Sci 13(3):1157–1178
Article MathSciNet MATH Google Scholar
Song K, Yao X, Nie F, Li X, Xu M (2021) Weighted bilateral k-means algorithm for fast co-clustering and fast spectral clustering. Pattern Recognit 109:107560
Article Google Scholar
Ren Y, Wang N, Li M, Xu Z (2020) Deep density-based image clustering. Knowl-Based Syst 1:105841
Article Google Scholar
Kumar N, Uppala P, Duddu K, Sreedhar H, Varma V, Guzman G, Walsh M, Sethi A (2018) Hyperspectral tissue image segmentation using semi-supervised NMF and hierarchical clustering. IEEE Trans Med Imaging 38(5):1304–1313
Article Google Scholar
Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Article Google Scholar
Ji S, Ye J (2008) Generalized linear discriminant analysis: a unified framework and efficient model selection. IEEE Trans Neural Networks 19(10):1768–1782
Article Google Scholar
Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788
Article MATH Google Scholar
Cai D, He X, Han J, Huang TS (2010) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560
Google Scholar
Shang F, Jiao L, Wang F (2012) Graph dual regularization non-negative matrix factorization for co-clustering. Pattern Recogn 45(6):2237–2250
Article MATH Google Scholar
Ding CH, Li T, Jordan MI (2008) Convex and semi-nonnegative matrix factorizations. IEEE Trans Pattern Anal Mach Intell 32(1):45–55
Article Google Scholar
Hu W, Choi K-S, Wang P, Jiang Y, Wang S (2015) Convex nonnegative matrix factorization with manifold regularization. Neural Netw 63:94–103
Article MATH Google Scholar
Cui G, Li X, Dong Y (2018) Subspace clustering guided convex nonnegative matrix factorization. Neurocomputing 292:38–48
Article Google Scholar
Kong D, Ding C, Huang H (2011) Robust nonnegative matrix factorization using l21-norm. In: Proceedings of the 20th ACM international conference on information and knowledge management, pp 673–682
Li Z, Tang J, He X (2017) Robust structured nonnegative matrix factorization for image representation. IEEE Trans Neural Netw Learn Syst 29(5):1947–1960
Article MathSciNet Google Scholar
Zhang Z, Liao S, Zhang H, Wang S, Hua C (2018) Improvements in sparse non-negative matrix factorization for hyperspectral unmixing algorithms. J Appl Remote Sens 12(4):045015
Article Google Scholar
Xing L, Dong H, Jiang W, Tang K (2018) Nonnegative matrix factorization by joint locality-constrained and l2, 1-norm regularization. Multimed Tools Appl 77(3):3029–3048
Article Google Scholar
Babaee M, Tsoukalas S, Babaee M, Rigoll G, Datcu M (2016) Discriminative nonnegative matrix factorization for dimensionality reduction. Neurocomputing 173:212–223
Article Google Scholar
Liu H, Wu Z, Li X, Cai D, Huang TS (2011) Constrained nonnegative matrix factorization for image representation. IEEE Trans Pattern Anal Mach Intell 34(7):1299–1311
Article Google Scholar
Fei W, Tao L, Changshui Z (2008) Semi-supervised clustering via matrix factorization. In: Proceedings of 2008 SIAM International Conference on Data Mining (SDM 2008), pp 1–12
Yang Y-J, Hu B-G (2007) Pairwise constraints-guided non-negative matrix factorization for document clustering. In: IEEE/WIC/ACM International Conference on Web Intelligence (WI’07). IEEE, pp 250–256
Yang Z, Hu Y, Liang N, Lv J (2019) Nonnegative matrix factorization with fixed l2-norm constraint. Circuits Syst Signal Process 38(7):3211–3226
Article Google Scholar
Ahmed I, Hu XB, Acharya MP, Ding Y (2021) Neighborhood structure assisted non-negative matrix factorization and its application in unsupervised point-wise anomaly detection. J Mach Learn Res 22(34):1–32
MathSciNet MATH Google Scholar
Kuang D, Ding C, Park H (2012) Symmetric Nonnegative Matrix Factorization for Graph Clustering, pp 106–117. https://doi.org/10.1137/1.9781611972825.10
Samaria FS, Harter AC (1994) Parameterisation of a stochastic model for human face identification. In: Proceedings of 1994 IEEE workshop on applications of computer vision, pp 138–142. https://doi.org/10.1109/ACV.1994.341300
Hedjam R, Abdesselam A, Melgani F (2021) NMF with feature relationship preservation penalty term for clustering problems. Pattern Recogn 112:107814
Article Google Scholar
Cai D, He X, Han J (2005) Document clustering using locality preserving indexing. IEEE Trans Knowl Data Eng 17(12):1624–1637
Article Google Scholar
Wang Y, Chen L, Mei J-P (2014) Stochastic gradient descent based fuzzy clustering for large data. In: 2014 IEEE international conference on fuzzy systems (FUZZ-IEEE). IEEE, pp 2511–2518
Hubert L, Arabie P (1985) Comparing partitions. J Classif 2(1):193–218
Article MATH Google Scholar
Goldfarb D, Wen Z, Yin W (2009) A curvilinear search method for p-harmonic flows on spheres. SIAM J Imag Sci 2(1):84–109. https://doi.org/10.1137/080726926
Article MathSciNet MATH Google Scholar
Vese LA, Osher SJ (2002) Numerical methods for p-harmonic flows and applications to image processing. SIAM J Numer Anal 40(6):2085–2104. https://doi.org/10.1137/S0036142901396715
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (11961010, 61967004).

Author information

Authors and Affiliations

School of Mathematics and Computing Science, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China
Mengxue Jia, Xiangli Li & Ying Zhang
School of Mathematics and Statistics, Xidian university, Xi’an, 710126, Shaanxi, China
Mengxue Jia & Ying Zhang
Guangxi Colleges and Universities Key Laboratory of Data Analysisand Computation, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China
Xiangli Li
Center for Applied Mathematics of Guangxi (GUET), Guilin, 541004, Guangxi, China
Xiangli Li

Authors

Mengxue Jia
View author publications
You can also search for this author inPubMed Google Scholar
Xiangli Li
View author publications
You can also search for this author inPubMed Google Scholar
Ying Zhang
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Xiangli Li.

Ethics declarations

Conflict of interest

All authors disclosed no relevant relationships.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Because (18) and (19) are two gradient descent methods, (16) is non-increasing. Here is the proof.

Denote the objective function of (16) as F. The partial derivative of $U_{mk}$ in F is

$$\begin{aligned} \frac{\partial F}{\partial U_{mk}} = (-X^NV+UV^TV)_{mk}. \end{aligned}$$

(25)

The formulation of updating $U_{mk}$ through the gradient descent method is the following:

$$\begin{aligned} U_{mk}=U_{mk}+\tau _u(X^NV-UV^TV)_{mk}. \end{aligned}$$

(26)

When $\tau _u={U_{mk}}/{(UV^TV)_{mk}}$, the nonnegative constraints on U hold, and (26) is (18).

For V, the updating rule (19) is also a gradient descent method. The proof is similar to [25].

Let $x_r$ and h represent the ith row of V and $\frac{\partial F}{\partial x_r}$, respectively. Denote $F_1(x_r)$ as the related part of $x_r$ in (16). When omitting the nonnegative constraints, the Lagrange function of $x_r$ is the following:

$$\begin{aligned} L(x_r,\lambda )=F_1(x_r)+\frac{\lambda }{2}(x_rx_r^T-1). \end{aligned}$$

(27)

Denote

$$\begin{aligned} x=x_r^T, a=h^T,c_1=l_1^T,c_2=l_2^T. \end{aligned}$$

Rewrite (27) as follows:

$$\begin{aligned} L(x,\lambda )=F_1(x^T)+\frac{\lambda }{2}(x^Tx-1). \end{aligned}$$

(28)

When$\nabla _xL(x,\lambda )=0$, we have

$$\begin{aligned} a-x\lambda =0. \end{aligned}$$

Through the constraint $x^Tx=1$, we obtain

$$\begin{aligned} \lambda =x^Ta=a^Tx. \end{aligned}$$

Thus, rewrite $\nabla _xL(x,\lambda )$ as follows:

$$\begin{aligned} \begin{aligned} \nabla _xL(x,\lambda )&=a-x\lambda \\ \quad&=a-x^Tax \\ \quad&=ax^Tx-xa^Tx \\ \quad&=(ax^T-x^Ta)x \end{aligned} \end{aligned}$$

(29)

Let A represents $ax^T-xa^T$. Therefore, A is a skew-symmetric matrix. Since Ax is the gradient of (28) the updating rule in the gradient descent method of x should be the following:

$$\begin{aligned} y=x-\tau _vAx. \end{aligned}$$

However, it is difficult to satisfy the constraint $y^Ty=1$. From [33, 34], a modified method (30) is used.

$$\begin{aligned} y(\tau )=x-\tau A\left( \frac{x+y(\tau )}{2}\right) . \end{aligned}$$

(30)

(30) could satisfy that $y^Ty=x^Tx=1$ for any skew-symmetric matrix A and $\tau$.

From Lemma 1 (2) in [25], (30) could be expressed as:

$$\begin{aligned} y(\tau )=\left( I+\frac{\tau }{2}A\right) ^{-1}\left( I-\frac{\tau }{2}A\right) x. \end{aligned}$$

(31)

Then from Lemma 2 in [25], (31) could be rewritten as

$$\begin{aligned} y(\tau )=x-\beta _1(\tau )a-\beta _2(\tau )x, \end{aligned}$$

(32)

where

$$\begin{aligned} & \beta _1(\tau )=\tau \frac{x^Tx-\frac{\tau }{2}\left( (a^Tx)(x^Tx)-(x^Ta)(x^Tx)\right) }{1-\left( \frac{\tau }{2}\right) ^2(a^Tx)^2 +\left( \frac{\tau }{2}\right) ^2\Vert a\Vert ^2\Vert x\Vert ^2},\\ & \beta _2(\tau )=-\tau \frac{x^Ta-\frac{\tau }{2}\left( (a^Tx)(x^Ta)-(a^Ta)(x^Tx)\right) }{1-\left( \frac{\tau }{2}\right) ^2(a^Tx)^2 +\left( \frac{\tau }{2}\right) ^2\Vert a\Vert ^2\Vert x\Vert ^2}. \end{aligned}$$

The nonnegative constraints on y should be handled next.

Note $q=1-(\frac{\tau }{2})^2(a^Tx)^2+(\frac{\tau }{2})^2\Vert a\Vert ^2\Vert x\Vert ^2$. Rewrite (32) as follows:

$$\begin{aligned} y(\tau )=x-\frac{\tau }{q}(\beta _1^{'}(\tau )(c_1-c_2)+\beta _2^{'}x), \end{aligned}$$

(33)

where

$$\begin{aligned} & \beta _1^{'}(\tau )=x^Tx-\frac{\tau }{2}((a^Tx)(x^Tx)-(x^Ta)(x^Tx)),\\ & \beta _2^{'}(\tau )=x^Ta-\frac{\tau }{2}((a^Tx)(x^Ta)-(a^Ta)(x^Tx)). \end{aligned}$$

Expanding $\beta _1^{'}(\tau )(c_1-c_2)$ and $\beta _2^{'}x$ in (33) through auxiliary variables (6), we get

$$\begin{aligned} \begin{aligned} y(\tau )&=x-\frac{\tau }{q}\left( x^Txc_1+\frac{\tau }{2}T1+x^Tc_2x+\frac{\tau }{2}P1\right) \\ \quad&\quad +\frac{\tau }{q}\left( x^Txc_2+\frac{\tau }{2}T2+x^Tc_1x+\frac{\tau }{2}P2\right) . \end{aligned} \end{aligned}$$

(34)

Because $y(\tau )$ has nonnegative constraints, the $\tau$ should satisfy

$$\begin{aligned} x-\frac{\tau }{q}\left( x^Txc_1+\frac{\tau }{2}T1+x^Tc_2x+\frac{\tau }{2}P1\right) \ge 0. \end{aligned}$$

(35)

Denote $(a^Tx)^2-\Vert a\Vert ^2\Vert x\Vert ^2$ as M. Then $q=1-(\frac{\tau }{2})^2M$. Therefore, we obtain

$$\begin{aligned} & qx-\tau \left( x^Txc_1+\frac{\tau }{2}T1+x^Tc_2x+\frac{\tau }{2}P1\right) \ge 0\\ & \qquad \Rightarrow (2T1+2P1+Mx)\tau ^2+4(x^Txc_1+x^Tc_2x)\tau -4x\le 0. \end{aligned}$$

Utilizing the auxiliary variables (6), a series of equations are obtained as follows:

$$\begin{aligned} B_i\tau ^2+C_i\tau +D_i\le 0,\quad i=1,2,\ldots ,k. \end{aligned}$$

(36)

(6) confirms that $C_i\ge 0$ and $D_i\le 0$. Thus, when $\tau$ satisfies (36) and $\tau >0$, there exist three situations.

1.
when $B_i>0$,
$$\begin{aligned} \tau =\frac{\sqrt{C_i^2-4B_iD_i}-C_i}{2B_i}. \end{aligned}$$
2.
when $B_i=0$,
$$\begin{aligned} \tau =-\frac{D_i}{C_i}. \end{aligned}$$
3.
when $B_i<0$,
$$\begin{aligned} \tau =\frac{\sqrt{C_i^2-4B_iD_i}-C_i}{2B_i}. \end{aligned}$$

Thus, we obtain

$$\begin{aligned} \begin{aligned} \tau&=min \{\frac{\sqrt{C_i^2-4B_iD_i}-C_i}{2B_i},\quad when\; B_i\ne 0; \\ &\quad -\frac{D_i}{C_i},\; when\; B_i=0\},\; i=1,2,\ldots ,k. \\ \end{aligned} \end{aligned}$$

(37)

(37) guarantees (35) hold. Thus, the updating rule (19) guarantees all constraints hold.

Now it has been proved that (18) and (19) are gradient descent methods for (16). Therefore, (16) is non-increasing under (18) and (19).

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Jia, M., Li, X. & Zhang, Y. An algorithm of nonnegative matrix factorization under structure constraints for image clustering. Neural Comput & Applic 35, 7891–7907 (2023). https://doi.org/10.1007/s00521-022-08136-x

Download citation

Received: 06 April 2022
Accepted: 29 November 2022
Published: 20 December 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s00521-022-08136-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An algorithm of nonnegative matrix factorization under structure constraints for image clustering

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Non-negative matrix factorization via adaptive sparse graph regularization

Weighted non-negative matrix factorization based on adaptive robust local sparse graph

Non-negative Matrix Factorization with Symmetric Manifold Regularization

Explore related subjects

Data availability

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now