Abstract
K-means algorithm has the performance degradation problem due to improper initial centroids. In order to solve the problem, we suggest BK-means (Balanced K-means) algorithm to cluster documents. This algorithm uses the value, α, to adjust each cluster weight which is first defined in this paper. We compared the algorithm to the general K-means algorithms on Reutor-21578. The experimental results show about 11% higher performance than that of the general K-means algorithm with the balanced F Measure (BFM).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arthur, D., Vassilvitskii, S.: K-means++: the advantages of careful seeding. In: Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1027–1035. Society for Industrial and Applied Mathematics Philadelphia (2007)
Manning, C., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)
Liu, T., Liu, S., Chen, Z., Ma, W.Y.: An evaluation on feature selection for text clustering. In: Proceedings of the Twentieth International Conference, pp. 488–495. AAAI Press, Washington, DC (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jo, H., Park, Sc. (2014). BK-means Algorithm with Minimal Performance Degradation Caused by Improper Initial Centroid. In: Park, J., Adeli, H., Park, N., Woungang, I. (eds) Mobile, Ubiquitous, and Intelligent Computing. Lecture Notes in Electrical Engineering, vol 274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40675-1_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-40675-1_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40674-4
Online ISBN: 978-3-642-40675-1
eBook Packages: EngineeringEngineering (R0)