Skip to main content

Supporting Efficient Distributed Top-k Monitoring

  • Conference paper
Advances in Web-Age Information Management (WAIM 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4016))

Included in the following conference series:

Abstract

This paper addresses the efficient processing of distributed top-k monitoring, which is continuously reporting the k largest values according to a user-specified ranking function over distributed data streams. To minimize communication requirements, the necessary data transmitting must be selected carefully. We study the optimization problem of which objects are necessary to be transmitted and present a new distributed top-k monitoring algorithm to reduce communication cost. In our approach, few objects are transmitted for maintaining the top-k set and communication cost is independent of k. We verify the effectiveness of our approach empirically using both real-world and synthetic data sets. We show that our approach reduces overall communication cost by a factor ranging from 2 to over an order of magnitude compared with the previous approach when k is no lees than 10.

This research is partly supported by the National High Technology Research and Development Plan (863 plan) of China under Grants No.2004AA112020 and No.2003AA111020.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arlitt, M., Jin, T.: 1998 world cup web site access logs (August 1998), Available at: http://www.acm.org/sigcomm/ITA/

  2. Bruno, N., Gravano, L., Marian, A.: Evaluating top-k queries over web-accessible databases. In: ICDE 2002 (2002)

    Google Scholar 

  3. Balke, W.-T., Nejdl, W., Siberski, W., et al.: Progressive Distributed Top-k Retrieval in Peerto- Peer Networks. In: ICDE 2005 (2005)

    Google Scholar 

  4. Babcock, B., Olston, C.: Distributed Top-K Monitoring. In: SIGMOD (2003)

    Google Scholar 

  5. Babcock, B., Olston, C.: Distributed top-k monitoring. Technical report, Stanford University Computer Science Department (2002), http://dbpubs.stanford.edu/pub/2002-61

  6. Carney, D., Cetintemel, U., Cherniack, M., et al.: Monitoring streams - a new class of data management applications. In: VLDB (2002)

    Google Scholar 

  7. Chen, J., DeWitt, D.J., Tian, F., et al.: NiagaraCQ: A scalable continuous query system for internet databases. In: SIGMOD (2000)

    Google Scholar 

  8. Chang, K.C.-C., Hwang, S.-W.: Minimal probing: supporting expensive predicates for top-k queries. In: SIGMOD (2002)

    Google Scholar 

  9. Carey, M.J., Kossmann, D.: On saying “Enough already!” in SQL. In: SIGMOD (1997)

    Google Scholar 

  10. Cao, P., Wang, Z.: Efficient top-k query calculation in distributed networks. In: PODC (2004)

    Google Scholar 

  11. Fagin, R.: Combining fuzzy information from multiple systems. J. Comput. System Sci. 58, 83–99 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  12. Fagin, R., Lotem, A., Naor, M.: Optimal aggregation algorithms for middleware. In: PODS (2001)

    Google Scholar 

  13. Güntzer, U., Balke, W.-T., Kieβling, W.: Optimizing multi-feature queries for image databases. In: VLDB (2000)

    Google Scholar 

  14. Gibbons, P.B., Matias, Y.: New sampling-based summary statistics for improving approximate query answers. In: SIGMOD (1998)

    Google Scholar 

  15. Madden, S., Hellerstein, J.M., Shah, M., et al.: Continuously adaptive continuous queries over streams. In: SIGMOD (2002)

    Google Scholar 

  16. Michel, S., Triantafillou, P., Weikum, G.: KLEE: A Framework for Distributed Top-k Query Algorithms. In: VLDB (2005)

    Google Scholar 

  17. Nepal, S., Ramakrishna, M.V.: Query processing issues in image (multimedia) databases. In: ICDE (1999)

    Google Scholar 

  18. Zipf, G.K.: Human Behavior and the Principle of Least Effort. Addison-Wesley Press, Reading (1949)

    Google Scholar 

  19. Zhang, R., Koudas, N., Ooi, B.C., et al.: Multiple Aggregations Over Data Streams. In: SIGMOD (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Deng, B., Jia, Y., Yang, S. (2006). Supporting Efficient Distributed Top-k Monitoring. In: Yu, J.X., Kitsuregawa, M., Leong, H.V. (eds) Advances in Web-Age Information Management. WAIM 2006. Lecture Notes in Computer Science, vol 4016. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11775300_42

Download citation

  • DOI: https://doi.org/10.1007/11775300_42

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-35225-9

  • Online ISBN: 978-3-540-35226-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics