Skip to main content

Navigating Among Search Results: An Information Content Approach

  • Conference paper
  • 1130 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4831))

Abstract

Total or partial duplication of documents affects the effectiveness of the visualization of search results. In this paper we propose a navigation strategy that sorts a list of documents such that the first documents contain more information content decreasing considerably duplication. The strategy defines a content relation between documents based on their equivalence and omission and estimates the new information content obtained from visiting documents. In this paper, we describe the strategy and experimentally evaluate it. These results indicate the potential use of this strategy for the visualization of thematically related documents that are relevant to a query.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. ACM (2006), http://www.acm.org/class

  2. Baeza-Yates, R.A., Ribeiro-Neto, B.A.: Modern Information Retrieval. ACM Press, Addison-Wesley (1999)

    Google Scholar 

  3. Chowdhury, A., Frieder, O., Grossman, D.A., McCabe, M.C.: Collection statistics for fast duplicate document detection. ACM Trans. Inf. Syst. 20(2), 171–191 (2002)

    Article  Google Scholar 

  4. Geffet, M., Feitelson, D.G.: Hierarchical indexing and document matching in bow. In: JCDL, pp. 259–267. ACM, New York, NY, USA (2001)

    Chapter  Google Scholar 

  5. Google (2007), http://www.google.com

  6. Halkidi, M., Nguyen, B., Varlamis, I., Vazirgiannis, M.: Thesus: Organizing web document collections based on link semantics. VLDB J. 12(4), 320–332 (2003)

    Article  Google Scholar 

  7. Hammouda, K.M., Kamel, M.S.: Efficient phrase-based document indexing for web document clustering. IEEE Trans. Knowl. Data Eng. 16(10), 1279–1296 (2004)

    Article  Google Scholar 

  8. Hearst, M.A.: Modern Information Retrieval, chapter User Interfaces and Visualization, pp. 257–324. ACM Press, New York, NY, USA (1999)

    Google Scholar 

  9. Pereira Jr., Á.R., Ziviani, N.: Syntactic similarity of web documents. In: LA-WEB, p. 194. IEEE Computer Society Press, Los Alamitos, CA, USA (2003)

    Google Scholar 

  10. Kartoo, S.A.: Kartoo Metaseach Engine (2006), http://www.kartoo.com

  11. Koshman, S., Spink, A., Jansen, B.J.: Web searching on the vivisimo search engine. JASIST 57(14), 1875–1887 (2006)

    Article  Google Scholar 

  12. Kou, H., Gardarin, G.: Similarity model and term association for document categorization. In: DEXA Workshops, pp. 256–260. IEEE Computer Society, Los Alamitos, CA, USA (2002)

    Google Scholar 

  13. Ouyang, B.: Delivering knowledge to nasa scientist and engineers: using phrase matching to determine document similarity. In: Guimarães, M. (ed.) ACM Southeast Regional Conference (1), pp. 384–385. ACM, New York, NY, USA (2005)

    Google Scholar 

  14. Roussinov, D., McQuaid, M.: Information navigation by clustering and summarizing query results. In: HICSS (2000)

    Google Scholar 

  15. Yahoo! (2007), http://www.yahoo.com

  16. Zhang, Y., Ji, X., Chu, C.-H, Zha, H.: Correlating summarization of multi-source news with k-way graph bi-clustering. ACM SIGKDD Explorations Newsletter 6(2), 34–42 (2004)

    Article  Google Scholar 

  17. Zhang, Y., Chu, C.-H., Ji, X., Zha, H.: Correlating summarization of multi-source news with k-way graph bi-clustering. SIGKDD Explorations 6(2), 34–42 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Boualem Benatallah Fabio Casati Dimitrios Georgakopoulos Claudio Bartolini Wasim Sadiq Claude Godart

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bilbao, R., Rodríguez, M.A. (2007). Navigating Among Search Results: An Information Content Approach. In: Benatallah, B., Casati, F., Georgakopoulos, D., Bartolini, C., Sadiq, W., Godart, C. (eds) Web Information Systems Engineering – WISE 2007. WISE 2007. Lecture Notes in Computer Science, vol 4831. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76993-4_58

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-76993-4_58

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-76992-7

  • Online ISBN: 978-3-540-76993-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics