Abstract
The growing amount of online news posted on the WWW demands new algorithms that support topic detection, search, and navigation of news documents. This work presents an algorithm for topic detection that considers the temporal evolution of news and the structure of web documents. Then, it uses the results of the topic detection algorithm for searching and navigating in an online news source. An experimental evaluation with a collection of online news in Spanish indicates the advantages of incorporating the temporal aspect and structure of documents in the topic detection of news. In addition, topic-based clusters are well suited for guiding the search and navigation of news.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allan, J., Carbonell, J., Doddington, G., Yamron, J., Yang, Y.: Topic detection and tracking pilot study: Final report. In: DARPA Broadcast News Trasncription and Understanding Workshop, pp. 194–218 (September 1998), http://citeseer.ist.psu.edu/article/allan98topic.html
Allan, J., feng, A., Bolivar, A.: Flexible intrinsic evaluation of hierarchical clustering for TDT. In: Twelfth International Conference on Information and Knowledge Management, pp. 263–270. ACM Press, New York (2003)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press, New York (1999)
EMOL. El mercurio online, http://www.emol.com/
Ferragina, P., Gulli, A.: A perzonalized search engine based on web-snippet hierarchical clustering. In: International Conference in the World Wide Web WWW 2005, China, Japan, pp. 801–810. ACM Press, New York (2005)
Fukumoto, F., Suzuki, Y.: Event tracking based on domain dependency. In: 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece, pp. 24–28. ACM Press, New York (2000)
AbsInt Angewandte Informatik GmbH. GDL: aiSee graph visualization software: User manual unix version 2.2.07 (September 2005), http://www.aisee.com/manual/unix/
Makkonen, J., Ahonen-Myka, H., Salmenkivi, M.: Topic detection and tracking with spatio-temporal evidence. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 251–265. Springer, Heidelberg (2003)
Mostafa, J.: Seeking better web searches. Scientific American Digital (2005), http://www.sciam.com/
Ram, S., Shankaranarayanan, G.: Modeling and navigation of large information spaces: A semantic based approach. In: International Conference on System Science, IEEE CS Press, Los Alamitos (1999), http://computer.org/proceedings/hicss/0001/00016/00016020abs.htm
Roussinov, D., McQuaid, M.: Information navigation by clustering and summary query results. In: International Conference on System Sciences, p. 3006. IEEE CS Press, Los Alamitos (2000)
Papka, R., Allan, J., Lavrenko, V.: Umass approaches to detection and tracking at tdt2. In: Proceedings of the DARPA Broadcast News (1999), http://www.nist.gov/speech/publications/darpa99/index.htm
UMASS. Topic Detection and Tracking TDT (2005), http://ciir.cs.umass.edu/projects/tdt/index.html
W3C. Scalable vector graphics (svg) 1.1 specification, http://www.w3.org/tr/svg/
Walls, F., Jin, H., Sista, S., Schwatz, R.: Topic detection in broadcast news. In: Proceedings of the DARPA Broadcast News (1999), http://www.nist.gov/speech/publications/darpa99/index.htm
Yang, Y., Carbonelli, J., Brown, R., Pierce, T., Archibald, B., Liu, X.: Learning approaches for detection and tracking news events. IEEE Intelligent Systems Special Isuue on Applications of Intelligent Information Retrieval 14, 32–43 (1999)
Zhang, Y., Ji, X., Chu, C.-H., Zha, H.: Correlating summarization of multi-source news with k-way graph bi-clustering. ACM SIGKDD Explorations Newsletter 6(2), 34–42 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Smith, S.C., Rodríguez, M.A. (2006). Clustering-Based Searching and Navigation in an Online News Source. In: Lalmas, M., MacFarlane, A., Rüger, S., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds) Advances in Information Retrieval. ECIR 2006. Lecture Notes in Computer Science, vol 3936. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11735106_14
Download citation
DOI: https://doi.org/10.1007/11735106_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33347-0
Online ISBN: 978-3-540-33348-7
eBook Packages: Computer ScienceComputer Science (R0)