Skip to main content

University of Otago at INEX 2010

  • Conference paper
Comparative Evaluation of Focused Retrieval (INEX 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6932))

Abstract

In this paper, we describe University of Otago’s participation in Ad Hoc, Link-the-Wiki Tracks, Efficiency and Data Centric Tracks of INEX 2010. In the Link-the-Wiki Track, we show that the simpler relevance summation method works better for producing Best Entry Points (BEP). In the Ad Hoc Track, we discusses the effect of various stemming algorithms. In the Efficiency Track, we compare three query pruning algorithms and discusses other efficiency related issues. Finally in the Data Centric Track, we compare the BM25 and Divergence ranking functions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Huang, D., Xu, Y., Trotman, A., Geva, S.: Overview of inex 2007 link the wiki track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 373–387. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  2. Geva, S.: Gpx: Ad-hoc queries and automated link discovery in the wikipedia. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 404–416. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  3. Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)

    Article  Google Scholar 

  4. Spärck Jones, K.: Automatic Keyword Classification for Information Retrieval. Archon Books (1971)

    Google Scholar 

  5. Xu, J., Croft, W.B.: Corpus-based stemming using cooccurrence of word variants. ACM Trans. Inf. Syst. 16(1), 61–81 (1998)

    Article  Google Scholar 

  6. Jia, X.F., Trotman, A., O’Keefe, R.: Efficient accumulator initialisation. In: Proceedings of the 15th Australasian Document Computing Symposium (ADCS 2010), Melbourne, Australia (2010)

    Google Scholar 

  7. Trotman, A.: Compressing inverted files. Inf. Retr. 6(1), 5–19 (2003)

    Article  Google Scholar 

  8. Anh, V.N., Moffat, A.: Inverted index compression using word-aligned binary codes. Inf. Retr. 8(1), 151–166 (2005)

    Article  Google Scholar 

  9. Baeza-Yates, R., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: The impact of caching on search engines. In: SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 183–190. ACM, New York (2007)

    Google Scholar 

  10. Jia, X.F., Trotman, A., O’Keefe, R., Huang, Z.: Application-specific disk I/O optimisation for a search engine. In: PDCAT 2008: Proceedings of the 2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies, pp. 399–404. IEEE Computer Society, Washington, DC (2008)

    Chapter  Google Scholar 

  11. Buckley, C., Lewit, A.F.: Optimization of inverted vector searches, pp. 97–110 (1985)

    Google Scholar 

  12. Moffat, A., Zobel, J.: Self-indexing inverted files for fast text retrieval. ACM Trans. Inf. Syst. 14(4), 349–379 (1996)

    Article  Google Scholar 

  13. Tsegay, Y., Turpin, A., Zobel, J.: Dynamic index pruning for effective caching, pp. 987–990 (2007)

    Google Scholar 

  14. Persin, M., Zobel, J., Sacks-Davis, R.: Filtered document retrieval with frequency-sorted indexes. J. Am. Soc. Inf. Sci. 47(10), 749–764 (1996)

    Article  Google Scholar 

  15. Anh, V.N., de Kretser, O., Moffat, A.: Vector-space ranking with effective early termination, pp. 35–42 (2001)

    Google Scholar 

  16. Trotman, A., Jia, X.F., Geva, S.: Fast and effective focused retrieval. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 229–241. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  17. Bentley, J.L., Mcilroy, M.D.: Engineering a sort function (1993)

    Google Scholar 

  18. Persin, M.: Document filtering for fast ranking, pp. 339–348 (1994)

    Google Scholar 

  19. Moffat, A., Zobel, J., Sacks-Davis, R.: Memory efficient ranking. Inf. Process. Manage. 30(6), 733–744 (1994)

    Article  Google Scholar 

  20. Moffat, A., Zobel, J., Klein, S.T.: Improved inverted file processing for large text databases, pp. 162–171 (1995)

    Google Scholar 

  21. Anh, V.N., Moffat, A.: Random access compressed inverted files. In: Australian Computer Science Comm.: Proc. 9th Australasian Database Conf. ADC, vol. 20(2), pp. 1–12 (February 1998)

    Google Scholar 

  22. Anh, V.N., Moffat, A.: Compressed inverted files with reduced decoding overheads, pp. 290–297 (1998)

    Google Scholar 

  23. Schenkel, R., Suchanek, F., Kasneci, G.: YAWN: A semantically annotated wikipedia xml corpus (March 2007)

    Google Scholar 

  24. Amati, G., Van Rijsbergen, C.J.: Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans. Inf. Syst. 20(4), 357–389 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jia, XF., Alexander, D., Wood, V., Trotman, A. (2011). University of Otago at INEX 2010. In: Geva, S., Kamps, J., Schenkel, R., Trotman, A. (eds) Comparative Evaluation of Focused Retrieval. INEX 2010. Lecture Notes in Computer Science, vol 6932. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23577-1_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23577-1_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23576-4

  • Online ISBN: 978-3-642-23577-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics