Skip to main content

Low-Rank Matrix Factorization and Co-clustering Algorithms for Analyzing Large Data Sets

  • Conference paper
Data Engineering and Management (ICDEM 2010)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 6411))

Included in the following conference series:

Abstract

With the ever increasing data, there is a greater need for analyzing and extracting useful and meaningful information out of it. The amount of research being conducted in extracting this information is commendable. From clustering to bi and multi clustering, there are a lot of different algorithms proposed to analyze and discover the hidden patterns in data, in every which way possible. On the other hand, the size of the data sets is increasing with each passing day and hence it is becoming increasingly difficult to try and analyze all this data and find clusters in them without the algorithms being computationally prohibitive. In this study, we have tried to study both the domains and understand the development of the algorithms and how they are being used. We have compared the different algorithms to try and get a better idea of which algorithm is more suited for a particular situation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berry, M.W., Stewart, G.W., Pulatova, S.A.: Algorithm 844: Computing sparse reduced-rank approximations to sparse matrices. ACM Transactions on Mathermatical Software 31 (2005)

    Google Scholar 

  2. Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco (2001)

    Google Scholar 

  3. Drineas, P., Kannan, R., Mahoney, M.W.: Fast monte carlo algorithms for matrices iii: Computing a compressed approximate matrix decomposition. Society for Industrial and Applied Mathematics (SIAM) 36, 184–206 (2006)

    MATH  Google Scholar 

  4. Gu, Q., Zhou, J.: Co-clustering of manifolds. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ICDM), Paris (2009)

    Google Scholar 

  5. Long, B., Zhang, Z., Yu, P.S.: Co-clustering by block value decomposition. In: 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Chicago (2005)

    Google Scholar 

  6. Pan, F., Zhang, X., Wang, W.: Crd: Fast co-clustering of large datasets utilizing sampling-based matrix decomposition. In: ACM SIGMOD/PODS Conference, Vancouver (2008)

    Google Scholar 

  7. Rege, M., Dong, M., Fotouhi, F.: Co-clustering documents and words using bipartite isoperimetric graph partitioning. In: 6th IEEE International Conference on Data Mining (ICDM), Hong Kong (2006)

    Google Scholar 

  8. Sun, J., Xie, Y., Zhang, H., Faloutsos, C.: Less is more: Complex matrix decomposition for large sparse graphs. In: 7th SIAM International Conference on Data Mining (ICDM), Minneapolis (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Donavalli, A., Rege, M., Liu, X., Jafari-Khouzani, K. (2012). Low-Rank Matrix Factorization and Co-clustering Algorithms for Analyzing Large Data Sets. In: Kannan, R., Andres, F. (eds) Data Engineering and Management. ICDEM 2010. Lecture Notes in Computer Science, vol 6411. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27872-3_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27872-3_41

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27871-6

  • Online ISBN: 978-3-642-27872-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics