Data Clustering by Scaled Adjacency Matrix

Yu, Jian; Jia, Caiyan

doi:10.1007/978-3-642-25975-3_18

Jian Yu²¹ &
Caiyan Jia²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7091))

Included in the following conference series:

International Conference on Knowledge Science, Engineering and Management

1621 Accesses

Abstract

Similarity based clustering, which is to find the extrinsic clusters in data by taking as input a collection of real-valued similarities between data points, has been playing an important role in data analysis and engineering. Lots of work had been done in this field. However, data clustering is an rather challenge problem as there is no labeled data available. We observe that an ideal similarity matrix should be close to an adjacency matrix up to a scale. Based on this idea, we develop a scaled adjacency matrix (SAM) clustering algorithm that could find an optimal adjacency matrix in some sense for a given similarity matrix. Based on the learnt adjacency matrix, clustering could be performed straightforwardly. Upon three assumptions on the similarity matrix, we prove that the performance of SAM is robust. Experimental results also show that SAM is effective.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Shi, J.B., Malik, J.: Normalized cuts and image segmentation. IEEE Transaction on Pattern Analysis and Machine Intelligence 22(8), 888–905 (2000)
Article Google Scholar
Hagen, L., Kahng, A.B.: New spectral methods for ratio cut partitioning and clustering. IEEE Transactions Computer Aided Design 11(9), 1074–1085 (1992)
Article Google Scholar
Ding, C.H.Q., He, H.F., Zha, H.Y., Gu, M., Simon, H.D.: A min-max cut algorithm for graph partitioning and data clustering. In: Proceedings of IEEE International Conference on Data Mining, pp. 107–114 (2001)
Google Scholar
Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and algorithm. In: Proceedings of NIPS, vol. 14, pp. 897–856 (2002)
Google Scholar
Zelnik-Manor, L., Perona, P.: Self-Tuning Spectral Clustering. In: Proceedings of NIPS, vol. 17, pp. 1601–1608 (2004)
Google Scholar
Li, X.B., Tian, Z.: Optimum cut-based clustering. Signal Processing 87(11), 2491–2502 (2007)
Article MATH Google Scholar
Chang, H., Yeung, D.Y.: obust Path-Based Spectral Clustering with Application to Image Segmentation. In: Proceedings of ICCV, pp. 278–285 (2005)
Google Scholar
Qiu, H.J., Hancock, E.R.: Clustering and Embedding Using Commute Times. IEEE Trans. Pattern Anal. Mach. Intell. 29(11), 1873–1890 (2007)
Article Google Scholar
Gower, J., Ross, G.: Minimum spanning trees and single linkage cluster analysis. Applied Statistics 18, 54–64 (1969)
Article MathSciNet Google Scholar
Wu, Z.Y., Leahy, R.: An optimal graph theoretic approach to data clustering: Theory and its application to image segmentation. IEEE Transaction on Pattern Analysis and Machine Intelligence 15(11), 1101–1113 (1993)
Article Google Scholar
Nene, S.K.N.S.A., Murase, H.: Columibia object image library (coil-20). Techinical Report CUCS-005-96 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Beijing Jiaotong University, Beijing, 100044, P.R. China
Jian Yu & Caiyan Jia

Authors

Jian Yu
View author publications
You can also search for this author in PubMed Google Scholar
Caiyan Jia
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Management Science and Information Systems Department, Rutgers, the State University of New Jersey, 1, Washington Park, 07102, Newark, NJ, USA
Hui Xiong
Department of Industrial and Systems Engineering, The Hong Kong Polytechnic University, Hong Kong, China
W. B. Lee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, J., Jia, C. (2011). Data Clustering by Scaled Adjacency Matrix. In: Xiong, H., Lee, W.B. (eds) Knowledge Science, Engineering and Management. KSEM 2011. Lecture Notes in Computer Science(), vol 7091. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25975-3_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-25975-3_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25974-6
Online ISBN: 978-3-642-25975-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics