SDCC: A New Stable Double-Centroid Clustering Technique Based on K-Means for Non-spherical Patterns

Chang, Juifang

doi:10.1007/978-3-642-01510-6_89

Juifang Chang¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5552))

Included in the following conference series:

International Symposium on Neural Networks

989 Accesses

Abstract

Numerous existing partitioning clustering algorithms, such as K-means, are developed to discover clusters that fit some of the static models. These algorithms may fail if it chooses a set of incorrect parameters in the static model with respect to the objects being clustered, or when the objects consist of patterns that are of non-spherical or not the same size. Furthermore, they could produce an instable result. This investigation presents a new partition clustering algorithm named SDCC, which can improve the problem of instable results in partitioning-based clustering, such as K-means. As a hybrid approach that utilizes double-centroid concept, the proposed algorithm can eliminate the above-mentioned drawbacks to produce stable results while recognizing the non-spherical patterns and clusters that are not the same size. Experimental results illustrate that the new algorithm can identify non-spherical pattern correctly, and efficiently reduces the problem of long computational time when applying KGA and GKA. It also indicates that the proposed approach produces much smaller errors than K-means, KGA and GKA approaches in most cases examined herein.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

McQueen, J.B.: Some Methods of Classification and Analysis of Multivariate Observations. In: 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297. Univ. of Calif. Press, California (1967)
Google Scholar
Krishna, K., Murty, M.N.: Genetic K-means Algorithm. IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics 29, 433–439 (1999)
Article Google Scholar
Bandyopadhyay, S., Maulik, U.: An Evolutionary Technique Based on K-means Algorithm for Optimal Clustering in R^N. Information Sciences 146, 221–237 (2002)
Article MathSciNet MATH Google Scholar
Tsai, C.F., Tsai, C.W., Wu, H.C., Yang, T.: ACODF: A Novel Data Clustering Approach for Data Mining in Large Databases. Journal of Systems and Software 73, 133–145 (2004)
Article Google Scholar
Wang, T.P., Tsai, C.F.: GDH: An Effective and Efficient Approach to Detect Arbitrary Patterns in Clusters with Noises in Very Large Databases. Degree of master at National Pingtung University of Science and Technology, Taiwan (2006)
Google Scholar
Tsai, C.F., Yen, C.C.: ANGEL: A New Effective and Efficient Hybrid Clustering Technique for Large Databases. In: Zhou, Z.-H., Li, H., Yang, Q. (eds.) PAKDD 2007. LNCS, vol. 4426, pp. 817–824. Springer, Heidelberg (2007)
Chapter Google Scholar
Tsai, C.F., Yen, C.C.: G-TREACLE: A New Grid-Based and Tree-Alike Pattern Clustering Technique for Large Databases. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds.) PAKDD 2008. LNCS, vol. 5012, pp. 739–748. Springer, Heidelberg (2008)
Chapter Google Scholar
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: 2nd International Conference on Knowledge Discovery and Data Mining, pp. 226–231. AAAI, Portland (1996)
Google Scholar
Hinneburg, A., Keim, D.A.: An Efficient Approach to Clustering in Large Multimedia Databases with Noise. In: 4th International Conference on Knowledge Discovery and Data Mining, pp. 58–65. AAAI, New York (1998)
Google Scholar
Agrawal, R., Gehrke, J., Gunopulos, D., Raghavan, P.: Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications. In: ACM SIGMOD International Conference on Management of Data, pp. 94–105. ACM Press, Seattle (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of International Business, National Kaohsiung University of Applied Sciences, 80778, Kaohsiung, Taiwan
Juifang Chang

Authors

Juifang Chang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Control Automático,, CINVESTAV-IPN,, A.P. 14-740, Av.IPN 2508,, D.F., 07360,, México, México
Wen Yu
Deptartment of Electrical and Computer Engineering,, Stevens Institute of Technology,, NJ 07030,, Hoboken,, USA
Haibo He
Dept. of Electrical and Computer Engineering,, South Dakota School of Mines & Technology,, 501 E. St. Joseph Street,, SD 57701,, Rapid City,, USA
Nian Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chang, J. (2009). SDCC: A New Stable Double-Centroid Clustering Technique Based on K-Means for Non-spherical Patterns. In: Yu, W., He, H., Zhang, N. (eds) Advances in Neural Networks – ISNN 2009. ISNN 2009. Lecture Notes in Computer Science, vol 5552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01510-6_89

Download citation

DOI: https://doi.org/10.1007/978-3-642-01510-6_89
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01509-0
Online ISBN: 978-3-642-01510-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics