Abstract
Reorganising the index of a search engine based on access frequencies can significantly reduce query evaluation time while maintaining search effectiveness. In this paper we extend access-ordering and introduce a variant index organisation technique that we label access-reordering. We show that by access-reordering an inverted index, query evaluation time can be reduced by as much as 62% over the standard approach, while yielding highly similar effectiveness results to those obtained when using a conventional index.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anh, V.N., Moffat, A.: Impact transformation: Effective and efficient web retrieval. In: Järvelin, K., Beaulieu, M., Baeza-Yates, R., Myaeng, S.H. (eds.) Proc. ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, Auguest 2002, pp. 3–10 (2002)
Anh, V.N., Moffat, A.: Simplified similarity scoring using term ranks. In: Marchionini, G., Moffat, A., Tait, J., Baeza-Yates, R., Ziviani, N. (eds.) Proc. ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil, Auguest 2005, pp. 226–233 (2005)
Blandford, D., Blelloch, G.: Index compression through document reordering. In: Storer, J., Cohn, M. (eds.) Data Compression Conference, Snowbird, Utah, April 2002, pp. 342–351 (2002)
Clarke, C., Craswell, N., Soboroff, I.: Overview of the TREC 2004 terabytetrack. In: Proc. TREC Text REtrieval Conference (2004)
Diaz, F., Jones, R.: Using temporal profiles of queries for precision prediction. In: Sanderson, M., Järvelin, K., Allan, J., Bruza, P. (eds.) Proc. ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, United Kingdom, July 2004, pp. 18–24 (2004)
Elias, P.: Universal codeword sets and representations of the integers. IEEE Transactions on Information Theory, IT 21(2), 194–203 (1975)
Garcia, S., Williams, H.E., Cannane, A.: Access-ordered indexes. In: Estivill- Castro, V. (ed.) Proc. ACSC Australasian Computer Science Conference, Dunedin, New Zealand, Janaury 2004, pp. 7–14. Australian Computer Society, Inc. (2004)
Golomb, S.W.: Run-length encodings. IEEE Transactions on Information Theory, IT 12(3), 399–401 (1966)
Moffat, A., Zobel, J.: Fast ranking in limited space. In: Proc. IEEE ICDE Conference on Data Engineering, Houston, Texas, Febraury 1994, pp. 428–437 (1994)
Ozmutlu, S., Spink, A., Ozmutlu, H.C.: A day in the life of web searching: an exploratory study. Information Processing & Management 40(2), 319–345 (2004)
Persin, M., Zobel, J., Sacks-Davis, R.: Filtered document retrieval with frequency-sorted indexes. Journal of the American Society for Information Science 47(10), 749–764 (1996)
Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: Croft, W.B., Moffat, A., van Rijsbergen, C.J., Wilkinson, R., Zobel, J. (eds.) Proc. ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, Auguest 1998, pp. 275–281 (1998)
Scholer, F., Williams, H.E., Yiannis, J., Zobel, J.: Compression of inverted indexes for fast query evaluation. In: Järvelin, K., Beaulieu, M., Baeza-Yates, R., Myaeng, S.H. (eds.) Proc. ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, Auguest 2002, pp. 222–229 (2002)
Silvestri, F., Orlando, S., Perego, R.: Assigning identifiers to documents to enhance the clustering property of fulltext indexes. In: Sanderson, M., Järvelin, K., Allan, J., Bruza, P. (eds.) Proc. ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, United Kingdom, July 2004, pp. 305–312 (2004)
Silvestri, F., Perego, R., Orlando, S.: Assigning document identifiers to enhance compressibility of web search engines indexes. In: Haddad, H.M., Omicini, A., Wainwright, R.L., Liebrock, L.M. (eds.) Proc. ACM SAC Symposium on Applied Computing, Nicosia, Cyprus, March 2004, pp. 600–605 (2004)
Sparck-Jones, K., Walker, S., Robertson, S.: A probabilistic model of information retrieval: Development and comparative experiments. Parts 1&2. Information Processing & Management 36(6), 779–840 (2000)
Spink, A., Wolfram, D., Jansen, B.J., Saracevic, T.: Searching the web: the public and their queries. Journal of the American Society for Information Science 52(3), 226–234 (2001)
Witten, I., Moffat, A., Bell, T.: Managing Gigabytes: Compressing and Indexing Documents and Images, 2nd edn. Morgan Kaufmann Publishers, Los Altos (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Garcia, S., Turpin, A. (2006). Efficient Query Evaluation Through Access-Reordering. In: Ng, H.T., Leong, MK., Kan, MY., Ji, D. (eds) Information Retrieval Technology. AIRS 2006. Lecture Notes in Computer Science, vol 4182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880592_9
Download citation
DOI: https://doi.org/10.1007/11880592_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45780-0
Online ISBN: 978-3-540-46237-8
eBook Packages: Computer ScienceComputer Science (R0)