Abstract
We investigate document clustering through adaptation of Zamir and Etzioni’s method for Suffix Tree Clustering. We modified it with substantial improvements in effectiveness and efficiency compared to the original algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chim, H., Deng, X.: A New Suffix Tree Similarity Measure for Document Clustering. In: Proceedings of the 16th international conference on World Wide Web, pp. 121–130. ACM, New York (2007)
Zu Eissen, S.M., Stein, B., Potthast, M.: The Suffix Tree Document Model Revisited. In: Tochtermann, M. (ed.) Proceedings of the I-KNOW 2005, Graz 5th International Conference on Knowledge Management Journal of Universal Computer Science, pp. 596–603 (2005)
Gulla, Borch, Ingvaldsen: Contextualized Clustering in Exploratory Web Search. In: do Prado, H.A., Ferneda, E. (eds.) Emerging Technologies of Text Mining: Techniques and Applications, pp. 184–207. IGI Global (2007)
Moe, R., Elgesem, D.: Compact trie clustering for overlap detection in news. In: Proceedings of NIK 2013 (2013)
Smyth, B.: Computing Patterns in Strings. Addison Wesley (2003)
Zamir, O., Etzioni, O.: Web Document Clustering: A Feasibility Demonstration. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 46–54. ACM, New York (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Moe, R.E. (2014). Improvements to Suffix Tree Clustering. In: de Rijke, M., et al. Advances in Information Retrieval. ECIR 2014. Lecture Notes in Computer Science, vol 8416. Springer, Cham. https://doi.org/10.1007/978-3-319-06028-6_73
Download citation
DOI: https://doi.org/10.1007/978-3-319-06028-6_73
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06027-9
Online ISBN: 978-3-319-06028-6
eBook Packages: Computer ScienceComputer Science (R0)