Abstract
A novel algorithm of document clustering based on non-negative sparse analysis is proposed. In contrast to the algorithm based on non-negative matrix factorization, our algorithm can obtain documents topics exactly by controlling the sparseness of the topic matrix and the encoding matrix explicitly. Thus, the clustering accuracy has been improved greatly. In the end, simulation results are employed to further illustrate the accuracy and efficiency of this algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baker, L., McCallum, A.: Distributional clustering of words for text classification. In: Proceedings of ACM SIGIR, Melbourne, Australia, pp. 96–103 (1998)
Ding, C., He, X., Zha, H., Gu, M., Simon, H.D.: A min-max cut algorithm for graph partitioning and data clustering. In: Proceedings of IEEE ICDM, pp. 107–114 (2001)
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, vol. 13, pp. 56–562 (2001)
Hoyer, P.O.: Nonnegative matrix factorization with sparseness constraints. Journal of Machine Learning Research 5, 1457–1469 (2004)
Hoyer, P.O.: Modeling receptive fields with non-negative sparse coding. Neurocomputing 52-54, 547–552 (2003)
Hoyer, P.O.: Non-negative sparse coding. In: Proc. IEEE Workshop on Neural Networks for Signal Processing, Martigny, Switzerland, pp. 557–565 (2002)
Willett, P.: Document clustering using an inverted file approach. Journal of Information Science 2, 223–231 (1990)
Wei, X., Xin, L., Yihong, G.: Document Clustering Based On Non-negative Matrix Factorization. In: Proceedings of ACM SIGIR, Toronto, Canada, pp. 267–273 (2003)
Zha, H., Ding, C., Gu, M., He, X., Simon, H.: Spectral relaxation for k-means clustering. In: Advances in Neural Information Processing Systems, vol. 14, 1057–1064 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, C.F., Ye, M., Zhao, J. (2005). Document Clustering Based on Nonnegative Sparse Matrix Factorization. In: Wang, L., Chen, K., Ong, Y.S. (eds) Advances in Natural Computation. ICNC 2005. Lecture Notes in Computer Science, vol 3611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539117_80
Download citation
DOI: https://doi.org/10.1007/11539117_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28325-6
Online ISBN: 978-3-540-31858-3
eBook Packages: Computer ScienceComputer Science (R0)