A Novel Typical-Sample-Weighted Clustering Algorithm for Large Data Sets

Li, Jie; Gao, Xinbo; Jiao, Licheng

doi:10.1007/11596448_103

A Novel Typical-Sample-Weighted Clustering Algorithm for Large Data Sets

Jie Li²⁶,
Xinbo Gao²⁶ &
Licheng Jiao²⁶

Conference paper

1700 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3801))

Abstract

In the field of cluster analysis, most of existing algorithms are developed for small data sets, which cannot effectively process the large data sets encountered in data mining. Moreover, most clustering algorithms consider the contribution of each sample for classification uniformly. In fact, different samples should be of different contribution for clustering result. For this purpose, a novel typical-sample-weighted clustering algorithm is proposed for large data sets. By the atom clustering, the new algorithm extracts the typical samples to reduce the data amount. Then the extracted samples are weighted by their corresponding typicality and then clustered by the classical fuzzy c-means (FCM) algorithm. Finally, the Mahalanobis distance is employed to classify each original sample into obtained clusters. It is obvious that the novel algorithm can improve the speed and robustness of the traditional FCM algorithm. The experimental results with various test data sets illustrate the effectiveness of the proposed clustering algorithm.

This work was supported by National Natural Science Foundation of China (No.60202004), the Key project of Chinese Ministry of Education (No.104173) and the program for New Century Excellent Talents in University of China.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Qing, H.: Advance of the theory and application of fuzzy clustering analysis. Fuzzy System and Fuzzy Mathematics 12(2), 89–94 (1998) (in Chinese)
Google Scholar
Gao, X.: Optimization and Applications Research on Fuzzy Clustering Algorithms. Doctoral Thesis, Xidian University, Xi’an 710071, China (1999)
Google Scholar
Anderberg, M.R.: Cluster Analysis for Applications. Academic Press, London (1973)
MATH Google Scholar
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, Chichester (1990)
Google Scholar
Everitt, B.: Cluster Analysis, pp. 45–60. Heinemann Educational Books Ltd., New York (1974)
Google Scholar
Gao, X., Li, J., Ji, H.: An automatic multi-threshold image segmentation algorithm based on weighting FCM and statistical test. Acta Electronica Sinica 32(4), 661–664 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic Engineering, Xidian University, Xi’an, 710071, China
Jie Li, Xinbo Gao & Licheng Jiao

Authors

Jie Li
View author publications
You can also search for this author in PubMed Google Scholar
Xinbo Gao
View author publications
You can also search for this author in PubMed Google Scholar
Licheng Jiao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microelectronic Instiute, Xidian University, 710071, Xi’an, China
Yue Hao
Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Hong Kong
Jiming Liu
School of Computer Science and Technology, Xidian University, Xi’an, China
Yuping Wang
Department of Computer Science, Hong Kong Baptist University, Hong Kong,
Yiu-ming Cheung
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin
Life Science Research Center, School of Electronic Engineering, Xidian University, 710071, Xi’an, Shaanxi, China
Licheng Jiao
Key Laboratory of Computer Networks and Information Security (Ministry of Education), Xidian University, 710071, Xi’an, China
Jianfeng Ma
National Laboratory of Antennas and Microwave Technology, Xidian University, 710071, Xi’an, Shanxi, P.R. China
Yong-Chang Jiao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, J., Gao, X., Jiao, L. (2005). A Novel Typical-Sample-Weighted Clustering Algorithm for Large Data Sets. In: Hao, Y., et al. Computational Intelligence and Security. CIS 2005. Lecture Notes in Computer Science(), vol 3801. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11596448_103

Download citation

DOI: https://doi.org/10.1007/11596448_103
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30818-8
Online ISBN: 978-3-540-31599-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics