Abstract
In this paper, it designs an improved sequential clustering approach, which compensates for shortcomings in existing algorithms. This method uses bisecting k-means clustering framework and reduces the computing time through adding the cosine similarity comparison when sequences can not satisfy the pruning condition, while the accuracy is still in an acceptable range.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jain, A.K.: Data Clustering: 50 Years beyond K-Means. Pattern Recognition Letters (2009)
Gesfield, D.: Algorithms on strings, trees, and sequences. Cambridge University Press, Cambridge (1997)
Kahveci, T., Singh, A.K.: An efficient index structure for string databases. In: Proc.Of the 27th Int’l Conf. On Vear Large Data Bases, VLDB, pp. 351–360 (2001)
Ukkonen, E.: Approximate string matching with q-gram and maximal matching. Theory Computer Science 1, 191–211 (1992)
Steinbach, M., Karypis, G., Kumar, V. : A Comparison of Document Clustering Techniques. In: KDD Workshop on Text Mining (2000)
Dai, D.-B., Tang, S.-L., Xiong, Y.: Research on sequence data similarity query. PhD thesis. Fudan University (2009)
Dai, D.-B., Tang, S.-L., Xiong, Y.: Clustering Algorithms Based on Global and Local Similarity. Journal of Software Sequence 21(4) (2010)
Kashef, R., Kamel, M.S.: Enhanced bisecting k-means clustering using intermediate cooperation. Pattern Recognition 42(11), 2557–2569 (2009)
Datasets: http://fimi.ua.ac.be/data/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, Y., Gao, B., Zhang, X. (2011). An Improved Sequential Clustering Algorithm. In: Deng, H., Miao, D., Lei, J., Wang, F.L. (eds) Artificial Intelligence and Computational Intelligence. AICI 2011. Lecture Notes in Computer Science(), vol 7002. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23881-9_58
Download citation
DOI: https://doi.org/10.1007/978-3-642-23881-9_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23880-2
Online ISBN: 978-3-642-23881-9
eBook Packages: Computer ScienceComputer Science (R0)