Skip to main content

Clustering-Based Searching and Navigation in an Online News Source

  • Conference paper
Advances in Information Retrieval (ECIR 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3936))

Included in the following conference series:

Abstract

The growing amount of online news posted on the WWW demands new algorithms that support topic detection, search, and navigation of news documents. This work presents an algorithm for topic detection that considers the temporal evolution of news and the structure of web documents. Then, it uses the results of the topic detection algorithm for searching and navigating in an online news source. An experimental evaluation with a collection of online news in Spanish indicates the advantages of incorporating the temporal aspect and structure of documents in the topic detection of news. In addition, topic-based clusters are well suited for guiding the search and navigation of news.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Allan, J., Carbonell, J., Doddington, G., Yamron, J., Yang, Y.: Topic detection and tracking pilot study: Final report. In: DARPA Broadcast News Trasncription and Understanding Workshop, pp. 194–218 (September 1998), http://citeseer.ist.psu.edu/article/allan98topic.html

  2. Allan, J., feng, A., Bolivar, A.: Flexible intrinsic evaluation of hierarchical clustering for TDT. In: Twelfth International Conference on Information and Knowledge Management, pp. 263–270. ACM Press, New York (2003)

    Google Scholar 

  3. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press, New York (1999)

    Google Scholar 

  4. EMOL. El mercurio online, http://www.emol.com/

  5. Ferragina, P., Gulli, A.: A perzonalized search engine based on web-snippet hierarchical clustering. In: International Conference in the World Wide Web WWW 2005, China, Japan, pp. 801–810. ACM Press, New York (2005)

    Google Scholar 

  6. Fukumoto, F., Suzuki, Y.: Event tracking based on domain dependency. In: 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece, pp. 24–28. ACM Press, New York (2000)

    Google Scholar 

  7. AbsInt Angewandte Informatik GmbH. GDL: aiSee graph visualization software: User manual unix version 2.2.07 (September 2005), http://www.aisee.com/manual/unix/

  8. Makkonen, J., Ahonen-Myka, H., Salmenkivi, M.: Topic detection and tracking with spatio-temporal evidence. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 251–265. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  9. Mostafa, J.: Seeking better web searches. Scientific American Digital (2005), http://www.sciam.com/

  10. Ram, S., Shankaranarayanan, G.: Modeling and navigation of large information spaces: A semantic based approach. In: International Conference on System Science, IEEE CS Press, Los Alamitos (1999), http://computer.org/proceedings/hicss/0001/00016/00016020abs.htm

    Google Scholar 

  11. Roussinov, D., McQuaid, M.: Information navigation by clustering and summary query results. In: International Conference on System Sciences, p. 3006. IEEE CS Press, Los Alamitos (2000)

    Google Scholar 

  12. Papka, R., Allan, J., Lavrenko, V.: Umass approaches to detection and tracking at tdt2. In: Proceedings of the DARPA Broadcast News (1999), http://www.nist.gov/speech/publications/darpa99/index.htm

  13. UMASS. Topic Detection and Tracking TDT (2005), http://ciir.cs.umass.edu/projects/tdt/index.html

  14. W3C. Scalable vector graphics (svg) 1.1 specification, http://www.w3.org/tr/svg/

  15. Walls, F., Jin, H., Sista, S., Schwatz, R.: Topic detection in broadcast news. In: Proceedings of the DARPA Broadcast News (1999), http://www.nist.gov/speech/publications/darpa99/index.htm

  16. Yang, Y., Carbonelli, J., Brown, R., Pierce, T., Archibald, B., Liu, X.: Learning approaches for detection and tracking news events. IEEE Intelligent Systems Special Isuue on Applications of Intelligent Information Retrieval 14, 32–43 (1999)

    Google Scholar 

  17. Zhang, Y., Ji, X., Chu, C.-H., Zha, H.: Correlating summarization of multi-source news with k-way graph bi-clustering. ACM SIGKDD Explorations Newsletter 6(2), 34–42 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Smith, S.C., Rodríguez, M.A. (2006). Clustering-Based Searching and Navigation in an Online News Source. In: Lalmas, M., MacFarlane, A., Rüger, S., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds) Advances in Information Retrieval. ECIR 2006. Lecture Notes in Computer Science, vol 3936. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11735106_14

Download citation

  • DOI: https://doi.org/10.1007/11735106_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33347-0

  • Online ISBN: 978-3-540-33348-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics