Skip to main content

Document Clustering Based on Nonnegative Sparse Matrix Factorization

  • Conference paper
Advances in Natural Computation (ICNC 2005)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3611))

Included in the following conference series:

Abstract

A novel algorithm of document clustering based on non-negative sparse analysis is proposed. In contrast to the algorithm based on non-negative matrix factorization, our algorithm can obtain documents topics exactly by controlling the sparseness of the topic matrix and the encoding matrix explicitly. Thus, the clustering accuracy has been improved greatly. In the end, simulation results are employed to further illustrate the accuracy and efficiency of this algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 119.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baker, L., McCallum, A.: Distributional clustering of words for text classification. In: Proceedings of ACM SIGIR, Melbourne, Australia, pp. 96–103 (1998)

    Google Scholar 

  2. Ding, C., He, X., Zha, H., Gu, M., Simon, H.D.: A min-max cut algorithm for graph partitioning and data clustering. In: Proceedings of IEEE ICDM, pp. 107–114 (2001)

    Google Scholar 

  3. Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)

    Article  Google Scholar 

  4. Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, vol. 13, pp. 56–562 (2001)

    Google Scholar 

  5. Hoyer, P.O.: Nonnegative matrix factorization with sparseness constraints. Journal of Machine Learning Research 5, 1457–1469 (2004)

    MathSciNet  Google Scholar 

  6. Hoyer, P.O.: Modeling receptive fields with non-negative sparse coding. Neurocomputing 52-54, 547–552 (2003)

    Article  Google Scholar 

  7. Hoyer, P.O.: Non-negative sparse coding. In: Proc. IEEE Workshop on Neural Networks for Signal Processing, Martigny, Switzerland, pp. 557–565 (2002)

    Google Scholar 

  8. Willett, P.: Document clustering using an inverted file approach. Journal of Information Science 2, 223–231 (1990)

    Article  Google Scholar 

  9. Wei, X., Xin, L., Yihong, G.: Document Clustering Based On Non-negative Matrix Factorization. In: Proceedings of ACM SIGIR, Toronto, Canada, pp. 267–273 (2003)

    Google Scholar 

  10. Zha, H., Ding, C., Gu, M., He, X., Simon, H.: Spectral relaxation for k-means clustering. In: Advances in Neural Information Processing Systems, vol. 14, 1057–1064 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yang, C.F., Ye, M., Zhao, J. (2005). Document Clustering Based on Nonnegative Sparse Matrix Factorization. In: Wang, L., Chen, K., Ong, Y.S. (eds) Advances in Natural Computation. ICNC 2005. Lecture Notes in Computer Science, vol 3611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539117_80

Download citation

  • DOI: https://doi.org/10.1007/11539117_80

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28325-6

  • Online ISBN: 978-3-540-31858-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics