Skip to main content

Progressive Ranking of Range Aggregates

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3589))

Abstract

Ranking-aware queries have been gaining much attention recently in many applications such as search engines and data streams. They are, however, not only restricted to such applications but are also very useful in OLAP applications. In this paper, we introduce aggregation ranking queries in OLAP data cubes motivated by an online advertisement tracking data warehouse application. These queries aggregate information over a specified range and then return the ranked order of the aggregated values. They differ from range aggregate queries in that range aggregate queries are mainly concerned with an aggregate operator such as SUM and MIN/MAX over the selected ranges of all dimensions in the data cubes. Existing techniques for range aggregate queries are not able to process aggregation ranking queries efficiently. Hence, in this paper we propose new algorithms to handle this problem. The essence of the proposed algorithms is based on both ranking and cumulative information to progressively rank aggregation results. Furthermore we empirically evaluate our techniques and the experimental results show that the query cost is improved significantly.

This research is supported by the NSF grants under IIS-23022, CNF-0423336, and EIA-00-80134.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Babcock, B., Olston, C.: Distributed top-k monitoring. In: Proc. of Int. Conf. on Managment of Data (SIGMOD), pp. 563–574 (2003)

    Google Scholar 

  2. Bruno, N., Chaudhuri, S., Gravano, L.: Top-k selection queries over relational databases: Mapping strategies and performance evaluation. ACM Trans. on Database Systems 27(2), 153–187 (2002)

    Article  Google Scholar 

  3. Bruno, N., Gravano, L., Marian, A.: Evaluating top-k queries over web accessible databases. In: Proc. of Int. Conf. on Data Engineering (ICDE), pp. 369–380 (2002)

    Google Scholar 

  4. Chang, K.C., Hwang, S.: Minimal probing: Supporting expensive predicates for top-k queries. In: Proc. of Int. Conf. on Management of Data (SIGMOD), pp. 346–357 (2002)

    Google Scholar 

  5. Charikar, M., Chen, K., Farach-Colton, M.: Approximate frequency counts over data streams. In: Proc. of 29th Int. Colloq. on Automata, Languages and Programming, pp. 693–703 (2002)

    Google Scholar 

  6. Ho, C., Agrawal, R., Megiddo, N., Srikant, R.: Range queries in olap data cubes. In: Proc. of Int. Conf. on Management of Data (SIMGMOD), pp. 73–88 (1997)

    Google Scholar 

  7. Donjerkovic, D., Ramakrishnan, R.: Probabilistic optimization of top N queries. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 411–422 (1999)

    Google Scholar 

  8. Fagin, R.: Combining fuzzy information from multiple systems. In: Proc. of Symp. on Principles of Database Systems (PODS), pp. 216–226 (1996)

    Google Scholar 

  9. Golab, L., DeHaan, D., Demaine, E.D., Lopez-Ortiz, A., Munro, J.I.: Identifying frequent items in sliding windows over on-line packet streams. In: Proc. of the conference on Internet measurement conferenc, pp. 173–178 (2003)

    Google Scholar 

  10. Ilyas, I.F., Aref, W.G., Elmagarmid, A.K.: Joining ranked inputs in practice. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 950–961 (2002)

    Google Scholar 

  11. Ilyas, I.F., Aref, W.G., Elmagarmid, A.K.: Supporting top-k join queries in relational databases. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 754–765 (2003)

    Google Scholar 

  12. Gray, J., Bosworth, A., Layman, A., Pirahesh, H.: Data cube: A relational aggregation operator generalizing group-by, cross-tabs and sub-totals. In: Proc. of Int. Conf. on Data Engeering(ICDE), pp. 152–159 (1996)

    Google Scholar 

  13. Lee, S.Y., Ling, T.W., Li, H.-G.: Hierarchical compact cube for range-max queries. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 232–241 (2000)

    Google Scholar 

  14. Li, C., Chang, K.C.-C., Ilyas, I.F., Song, S.: Ranksql: Query algebra and opatimization for relational topk queries. In: Proc. of Int. Conf. on Management of Data, SIGMOD (2005)

    Google Scholar 

  15. Li, H.-G., Yu, H., Agrawal, D., El Abbadi, A.: Ranking aggregates. Technical Report 2004-07, University of California at Santa Barbara (2004), http://www.cs.ucsb.edu/research/trcs/docs/2004-07.pdf

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, HG., Yu, H., Agrawal, D., El Abbadi, A. (2005). Progressive Ranking of Range Aggregates. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2005. Lecture Notes in Computer Science, vol 3589. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11546849_18

Download citation

  • DOI: https://doi.org/10.1007/11546849_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28558-8

  • Online ISBN: 978-3-540-31732-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics