Abstract
In the field of cluster analysis, most of the available algorithms were designed for small data sets, which cannot efficiently deal with large scale data set encountered in data mining. However, some sampling-based clustering algorithms for large scale data set cannot achieve ideal result. For this purpose, a FCM-based clustering ensemble algorithm is proposed. Firstly, it performs the atom clustering algorithm on the large data set. Then, randomly select a sample from each atom as representative to reduce the data amount. And the ensemble learning technique is used to improve the clustering performance. For the complex large data sets, the new algorithm has high classification speed and robustness. The experimental results illustrate the effectiveness of the proposed clustering algorithm.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Qing, H.: Advance of the theory and application of fuzzy clustering analysis. Fuzzy System and Fuzzy Mathematics 12(2), 89–94 (1998) (in Chinese)
Gao, X.: Optimization and Applications Research on Fuzzy Clustering Algorithms, Doctoral Thesis, Xidian University, Xi’an 710071, China (1999)
Anderberg, M.R.: Cluster Analysis for Applications. Academic Press, London (1973)
Kaufman, L., Rousseeuw, P.J.: Finding groups in data: an introduction to cluster analysis (1990)
Dietterich, T.G.: Machine learning research: Four current directions. AI Magazine 18(4), 97–136 (1997)
Everitt, B.: Cluster Analysis, pp. 45–60. Heinemann Educational Books Ltd., New York (1974)
Bezdek, J.C.: Pattern Recognition with Fuzzy Object Function Algorithms. Plenum, New York (1981)
Gao, X., Li, J., Ji, H.: An automatic multi-threshold image segmentation algorithm based on weighting FCM and statistical test. Acta Electronica Sinica 32(4), 661–664 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, J., Gao, X., Tian, C. (2006). FCM-Based Clustering Algorithm Ensemble for Large Data Sets. In: Wang, L., Jiao, L., Shi, G., Li, X., Liu, J. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2006. Lecture Notes in Computer Science(), vol 4223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881599_66
Download citation
DOI: https://doi.org/10.1007/11881599_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45916-3
Online ISBN: 978-3-540-45917-0
eBook Packages: Computer ScienceComputer Science (R0)