A Limited Incremental Clustering Algorithm with Respect to Cluster Stability

Zhu, Wenhao; Yao, Wenxin; Dai, Song; Lu, Zhiguo

doi:10.1007/978-3-319-32557-6_18

Wenhao Zhu¹⁸,
Wenxin Yao¹⁸,
Song Dai¹⁸ &
…
Zhiguo Lu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9576))

Included in the following conference series:

International Conference on High Performance Computing and Applications

745 Accesses

Abstract

As one of the important techniques of data analysis, a clustering algorithm is widely used in data mining, image recognition, information extraction, pattern recognition and other fields. In the era of big data, with the rapid development of web applications, much of the data to be processed is characterized by massive and dynamic growth. Under this background, how to cluster the incremental data becomes a challenging problem that clustering algorithms face. In this paper, we proposed a limited incremental clustering algorithm with respect to cluster stability. Based on an assumption that the categories of data are limited, we take advantage of an existing clustering structure and deal with incremental data steadily with respect to cluster stability. Cluster reconstruction will be triggered when the stability does no longer hold or a buffer pool for undetermined data is full. At the end of the paper, we implement the limited incremental clustering algorithm with K-means. Meanwhile, we use an average density of clusters and the global stability to choose a proper value of K.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Xu, R., Wunsch, D.: Survey of clustering algorithms. IEEE Trans. Neural Netw. 16(3), 645–678 (2005)
Article Google Scholar
Steinbach, M., Karypis, G., Kumar, V.: A comparison of document clustering techniques. In: Proceedings of the KDD Workshop on Text Mining, Boston, MA, USA, 20–23 August 2000
Google Scholar
Sun, J., Liu, J., Zhao, L.: Clustering algorithms research. J. Softw. 19(1), 48–61 (2008)
Google Scholar
Li, T., Anand, S.S.: Hirel: an incremental clustering algorithm for relational datasets. In: Eighth IEEE International Conference on Data Mining, ICDM 2008, pp. 887–892. IEEE (2008)
Google Scholar
Gupta, C., Grossman, R.L.: GenIc: a single-pass generalized incremental algorithm for clustering. In: SDM 2004, pp. 147–153 (2004)
Google Scholar
Charikar, M., Chekuri, C., Feder, T., et al.: Incremental clustering and dynamic information retrieval. In: Proceedings of the Twenty-Ninth Annual ACM Symposium on Theory of Computing, pp. 626–635. ACM (1997)
Google Scholar
Azzopardi, J., Staff, C.: Incremental clustering of news reports. Algorithms 5(3), 364–378 (2012)
Article Google Scholar
Berkhin, P.: A Survey of Clustering Data Mining Techniques. Grouping Multidimensional Data, pp. 25–71. Springer, Heidelberg (2006)
Book Google Scholar
Jing, L., Ng, M.K., Huang, J.Z.: An entropy weighting k-means algorithm for subspace clustering of high-dimensional sparse data. IEEE Trans. Knowl. Data Eng. 19(8), 1026–1041 (2007)
Article Google Scholar
Rezaee, M.R., Lelieveldt, B.P., Reiber, J.H.: A new cluster validity index for the fuzzy C-means. Pattern Recogn. Lett. 19(3–4), 237–246 (1998)
Article MATH Google Scholar

Download references

Acknowledgements

The work of the paper is partially supported by National Natural Science Foundation of China (No. 61303097) and Ph.D. Programs Foundation of Ministry of Education of China (No. 20123108120026).

Author information

Authors and Affiliations

School of Computer Engineering and Science, Shanghai University, Shanghai, China
Wenhao Zhu, Wenxin Yao & Song Dai
Shanghai University Library, Shanghai University, Shanghai, China
Zhiguo Lu

Authors

Wenhao Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Wenxin Yao
View author publications
You can also search for this author in PubMed Google Scholar
Song Dai
View author publications
You can also search for this author in PubMed Google Scholar
Zhiguo Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhiguo Lu .

Editor information

Editors and Affiliations

School of Computer Engineering, Shanghai University, Shanghai, China
Jiang Xie
Chemical and Petroleum Engineering, University of Calgary, Calgary, Alberta, Canada
Zhangxin Chen
Mathematics Department, University of Wyoming, Laramie, Wyoming, USA
Craig C. Douglas
School of Computer Engineering, Shanghai University, Shanghai, China
Wu Zhang
Mathematics and Informatics, South China Agricultural University, Guangzhou, China
Yan Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, W., Yao, W., Dai, S., Lu, Z. (2016). A Limited Incremental Clustering Algorithm with Respect to Cluster Stability. In: Xie, J., Chen, Z., Douglas, C., Zhang, W., Chen, Y. (eds) High Performance Computing and Applications. HPCA 2015. Lecture Notes in Computer Science(), vol 9576. Springer, Cham. https://doi.org/10.1007/978-3-319-32557-6_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-32557-6_18
Published: 20 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32556-9
Online ISBN: 978-3-319-32557-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics