Running Time Prediction for Web Search Queries

Rojas, Oscar; Gil-Costa, Veronica; Marin, Mauricio

doi:10.1007/978-3-319-32152-3_20

Oscar Rojas^19,20,
Veronica Gil-Costa^19,20 &
Mauricio Marin^19,20

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9574))

1201 Accesses

Abstract

Large scale Web search engines have to process thousands of queries per second and each query has to be solved within a fraction of a second. To achieve this goal, search engines rely on sophisticated services capable of processing large amounts of data. One of these services is the search service (or index service) which is in charge of computing the top-k document results for user queries. Predicting in advance the response time of queries has practical applications in efficient administration of hardware resources assigned to query processing. In this paper, we propose and evaluate a query running time prediction algorithm that is based on a discrete Fourier transform which models the index as a collection of signals to obtain patterns. Results show that our approach performs at least as effectively as well-known prediction algorithms in the literature, while significantly improving computational efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Broder, A.Z., Carmel, D., Herscovici, M., Soffer, A., Zien, J.Y.: Efficient query evaluation using a two-level retrieval process. In: CIKM, pp. 426–434 (2003)
Google Scholar
Macdonald, N.T.C., Ounis, I.: Learning to predict response times for online query scheduling. In: SIGIR, pp. 621–630 (2012)
Google Scholar
Chakrabarti, K., Chaudhuri, S., Ganti, V.: Interval-based pruning for top-k processing over compressed lists. In: ICDE, pp. 709–720 (2011)
Google Scholar
Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: SIGIR, pp. 299–306 (2002)
Google Scholar
Ding, S., Suel, T.: Faster top-k document retrieval using block-max indexes. In: SIGIR, pp. 993–1002 (2011)
Google Scholar
Kim, S., He, Y., Hwang, S., Elnikety, S., Choi, S.: Delayed-dynamic-selective (DDS) prediction for reducing extreme tail latency in web search. In: WSDM, pp. 7–16 (2015)
Google Scholar
Park, L., Ramamohanarao, K., Palaniswami, M.: Fourier domain scoring: a novel document ranking method. TKDE 16(5), 529–539 (2004)
Google Scholar
Rojas, O., Gil-Costa, V., Marin, M.: Efficient parallel block-max wand algorithm. In: Wolf, F., Mohr, B., an Mey, D. (eds.) Euro-Par 2013. LNCS, vol. 8097, pp. 394–405. Springer, Heidelberg (2013)
Chapter Google Scholar
Tonellotto, N., Macdonald, C., Ounis, I.: Efficient and effective retrieval using selective pruning. In: WSDM, pp. 63–72 (2013)
Google Scholar
Warren, T.: Clustering of time series data-a survey. JPR 38(11), 1857–1874 (2005)
MATH Google Scholar

Download references

Acknowledgments

This research was partially funded by Basal funds FB0001, Conicyt, Chile; PMI USA 1204 and PICT 2014-1146.

Author information

Authors and Affiliations

CITIAPS, DIINF, University of Santiago, Santiago, Chile
Oscar Rojas, Veronica Gil-Costa & Mauricio Marin
Center for Biotechnology and Bioengineering, Santiago, Chile
Oscar Rojas, Veronica Gil-Costa & Mauricio Marin

Authors

Oscar Rojas
View author publications
You can also search for this author in PubMed Google Scholar
Veronica Gil-Costa
View author publications
You can also search for this author in PubMed Google Scholar
Mauricio Marin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Veronica Gil-Costa .

Editor information

Editors and Affiliations

Czestochowa University of Technolog, Czestochowa, Poland
Roman Wyrzykowski
Department of Computer Science, University of Southern California, Marina Del Rey, California, USA
Ewa Deelman
Electrical Engineering & Comput. Science, University of Tennessee, Knoxville, Tennessee, USA
Jack Dongarra
Czestochowa University of Technology, Institute of Computer & Information Sci., Czestochowa, Poland
Konrad Karczewski
Department of Computer Science, AGH University of Science and Technology, Krakow, Poland
Jacek Kitowski
Systèmes d’informations, Big Data et Rec, AGH University of Science and Technology, Krakow, Poland
Kazimierz Wiatr

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rojas, O., Gil-Costa, V., Marin, M. (2016). Running Time Prediction for Web Search Queries. In: Wyrzykowski, R., Deelman, E., Dongarra, J., Karczewski, K., Kitowski, J., Wiatr, K. (eds) Parallel Processing and Applied Mathematics. Lecture Notes in Computer Science(), vol 9574. Springer, Cham. https://doi.org/10.1007/978-3-319-32152-3_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-32152-3_20
Published: 02 April 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32151-6
Online ISBN: 978-3-319-32152-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics