skip to main content
10.1145/2484838.2484864acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
research-article

Multi-query scheduling for time-critical data stream applications

Published:29 July 2013Publication History

ABSTRACT

Many data stream applications, such as network intrusion detection, on-line financial tickers and environmental monitoring, typically exhibit certain "real-time" traits. In such applications, people are interested in strategies that ensure on-time delivery of query results. In this paper, we point out that traditional operator-based query scheduling strategies are insufficient to handle this class of problem. Therefore we choose to approach the issue from a new angle by modeling multi-query scheduling as a job-scheduling problem, a classical problem in real-time computing. By taking advantage of the wisdom in the real-time computing community, we propose several new scheduling strategies and algorithms to enhance the overall data stream query scheduling performance. Through extensive experiments over both real and synthetic data, we identify the important factors for scheduling performance and verify the effectiveness of our approaches.

References

  1. Internet traffic archive. http://www.sigcomm.org/ITA.Google ScholarGoogle Scholar
  2. R. Avnur and J. M. Hellerstein. Eddies: Continuously adaptive query processing. In SIGMOD Conference, pages 261--272, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. B. Babcock, S. Babu, M. Datar, and R. Motwani. Chain: Operator scheduling for memory minimization in data stream systems. In SIGMOD Conference, pages 253--264, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. K. Baruah, G. Koren, D. Mao, B. Mishra, A. Raghunathan, L. E. Rosier, D. Shasha, and F. Wang. On the competitiveness of on-line real-time task scheduling. Real-Time Systems, 4(2):125--144, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. Carney, U. Çetintemel, A. Rasin, S. B. Zdonik, M. Cherniack, and M. Stonebraker. Operator scheduling in a data stream manager. In VLDB, pages 838--849, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. B. Gedik, K.-L. Wu, P. S. Yu, and L. Liu. Adaptive load shedding for windowed stream joins. In CIKM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G. V. Gens and E. V. Levner. Fast approximation algorithm for job sequencing with deadlines. Discrete Applied Mathematics, 3:313--318, 1981.Google ScholarGoogle ScholarCross RefCross Ref
  8. R. Givan, E. K. P. Chong, and H. S. Chang. Scheduling multiclass packet streams to minimize weighted loss. Queueing Syst., 41(3):241--270, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Q. Jiang and S. Chakravarthy. Scheduling strategies for processing continuous queries over streams. In BNCOD, pages 16--30, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  10. R. M. Karp. Reducibility among combinatorial problems. In Complexity of Computer Computations, pages 85--103. Plenum, New York, 1972.Google ScholarGoogle ScholarCross RefCross Ref
  11. E. L. Lawler. Sequencing to minimize the weighted number of tardy jobs. RAIRO Operations Research, 10:27--33, 1976.Google ScholarGoogle Scholar
  12. E. L. Lawler and J. Moore. A functional equation and its application to resource allocation and sequencing problems. Management Science, 16:77--84, 1969.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. W. E. Leland, M. S. Taqqu, W. Willinger, and D. V. Wilson. On the self-similar nature of ethernet traffic (extended version). IEEE/ACM Trans. Netw., 2(1):1--15, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. C. D. Locke. Best-Effort Decision Making for Real-Time Scheduling. PhD thesis, Carnegie Mellon University, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Moore. An n job one machine sequencing algorithm for minimizing the number of late jobs. Management Science, 15(1):102--109, 1968.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. V. Paxson and S. Floyd. Wide area traffic: the failure of poisson modeling. IEEE/ACM Trans. Netw., 3(3):226--244, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Sahni. Algorithms for scheduling independent tasks. J. ACM, 23(1):116--127, 1976. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. A. Sharaf, P. K. Chrysanthis, A. Labrinidis, and K. Pruhs. Efficient scheduling of heterogeneous continuous queries. In VLDB, pages 511--522, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. U. Srivastava and J. Widom. Flexible time management in data stream systems. In PODS, pages 263--274, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. N. Tatbul, U. Çetintemel, S. B. Zdonik, M. Cherniack, and M. Stonebraker. Load shedding in a data stream manager. In VLDB, pages 309--320, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. P. A. Tucker, D. Maier, T. Sheard, and L. Fegaras. Exploiting punctuation semantics in continuous data streams. IEEE Trans. Knowl. Data Eng., 15(3):555--568, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. T. Urhan and M. J. Franklin. Dynamic pipeline scheduling for improving interactive query performance. In VLDB, pages 501--510, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Wang, N. H. Chan, S. Papadimitriou, C. Faloutsos, and T. M. Madhyastha. Data mining meets performance evaluation: Fast algorithms for modeling bursty traffic. In ICDE, pages 507--516, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. Wu, K.-L. Tan, and Y. Zhou. Qos-oriented multi-query scheduling over data streams. In DASFAA, pages 215--229, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Y. Zhou, J. Wu, and A. K. Leghari. Multi-query scheduling for time-critical data stream applications. Unpublished manuscript available from the authors, 2013.Google ScholarGoogle Scholar

Index Terms

  1. Multi-query scheduling for time-critical data stream applications

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      SSDBM '13: Proceedings of the 25th International Conference on Scientific and Statistical Database Management
      July 2013
      401 pages
      ISBN:9781450319218
      DOI:10.1145/2484838

      Copyright © 2013 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 July 2013

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate56of146submissions,38%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader