Abstract
Searching for information on the Web has attracted great attention in many research communities. Results returned by most Chinese web search engines usually reach up to thousands or even millions of documents, so efficient interfaces for search and navigation are of critical need. In this paper, we proposed an interactive search results clustering system to facilitate browsing Chinese web pages in a more compact and thematic form. Users can select the clusters that best match the implicit meanings of their queries and personalize on-the-fly those search results. Our experiments show that this highly efficient approach outperforms the traditional Chinese search engines.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zamir, O., Etzioni, O.: Grouper: A Dynamic Clustering Interface to Web Search Results. In: Proceedings of 8th International World Wide Web Conference, Toronto, Canada, pp. 1361–1374 (1999)
Hearst, M., Pedersen, P.: Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results. In: Proceedings of 19th Annual International ACM/SIGIR Conference, Zurich, pp. 318–329 (1996)
Zhang, D., Dong, Y.: Semantic, Hierarchical, Online Clustering of Web Search Results. In: Proceedings of the 6th Asia Pacific Web Conference, Hangzhou, China, pp. 69–78 (2004)
Weiss, D., Stefanowski, J.: Web Search Results Clustering in Polish: Experimental Evaluation of Carrot. In: Proceedings of Intelligent Information Processing and Web Mining Conference, Zakopane, Poland (2003)
Chi Lang, N., Hung Son, N.: A Tolerance Rough Set Approach to Clustering Web Search Results. In: Proceedings the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases, pp. 515–517 (2004)
Kummamuru, K., Lotlikar, R., Roy, S., Singal, K., Krishnapuram, R.: A Hierarchical Monothetic Document Custering Agorithm for Summarization and Browsing Search Results. In: Proceedings of 13th Internatioanl World Wide Web Conference, pp. 658–665 (2004)
Hua-jun, Z., Qi-Cai, H., Zheng, C., Wei-Ying, M., Jinwen, M.: Learning to Cluster Web Search Results. In: Proceedings of 27th Annual International ACM SIGIR Conference (2004)
Huaping, Z., Hongkui, Y., Deyi, X., Qun, L.: HHMM-based Chinese Lexical Analyzer ICTCLAS. In: The Second SIGHAN workshop affiliated with 41th ACL, Sapporo, Japan (2003)
Salton, G.: Developments in automatic text retrieval. Science 253, 974–979 (1991)
Levenshtein, V.I.: Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Cybernetics and Control Theory 10, 707–710 (1966)
Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, New York (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, W., Xue, GR., Huang, S., Yu, Y. (2005). Interactive Chinese Search Results Clustering for Personalization. In: Fan, W., Wu, Z., Yang, J. (eds) Advances in Web-Age Information Management. WAIM 2005. Lecture Notes in Computer Science, vol 3739. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11563952_63
Download citation
DOI: https://doi.org/10.1007/11563952_63
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29227-2
Online ISBN: 978-3-540-32087-6
eBook Packages: Computer ScienceComputer Science (R0)