Skip to main content

Grouped Processing of Relational Algebra Expressions over Data Streams

  • Conference paper
Advances in Databases and Information Systems (ADBIS 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3255))

  • 418 Accesses

Abstract

Implementation of the data stream processing applications requires a method for formal specification of the computations at a dataflow level. The logical models of stream processing hide the lower level implementation details. To solve this problem, we propose a new model of data stream processing based on the concepts of relational data stream, extensible system of elementary operations on relational streams, and data stream processing network integrating the dataflows and elementary operations. Next, we present the transformations of grouped data stream processing applications into data stream processing networks. The transformations proposed in the paper integrate the networks and optimize the implementations through elimination of the redundant elementary operations and dataflows. Finally, the paper introduces a timestamp based synchronization of data flows in our model and discusses its correctness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Popa, L. (ed.) Proceedings of the Twenty-first ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp. 1–16. ACM Press, New York (2002)

    Chapter  Google Scholar 

  2. Madden, S., Franklin, M.J.: Fjording the stream: An architecture for queries over streaming sensor data. In: 18th International Conference on Data Engineering, San Jose, California, February 26-March 1, IEEE, Los Alamitos (2002)

    Google Scholar 

  3. Arasu, A., Babcock, B., Babu, S., McAlister, J., Widom, J.: Characterizing memory requirements for queries over continuous data streams. In: Popa, L. (ed.) Proceedings of the Twenty-first ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp. 221–232. ACM Press, New York (2002)

    Chapter  Google Scholar 

  4. Babu, S., Widom, J.: Continuous queries over data streams. SIGMOD Record 30, 109–120 (2001)

    Article  Google Scholar 

  5. Krishnamurthy, S., Chandrasekaran, S., Cooper, O., Deshpande, A., Franklin, M.J., Hellerstein, J.M., Hong, W., Madden, S.R., Reiss, F., Shah, M.A.: Telegraphcq: An architectural status report. Bulletin of the Technical Committee on Data Engineering 26, 11–18 (2003)

    Google Scholar 

  6. Hellerstein, J.M., Franklin, M.J., Chandrasekaran, S., Deshpande, A., Hildrum, K., Madden, S., Raman, V., Shah, M.A.: Adaptive query processing: Technology in evolution. Bulletin of the Technical Committee on Data Engineering 23, 7–18 (2000)

    Google Scholar 

  7. Cole, R.L.: A decision theoretic cost model for dynamic plans. Bulletin of the Technical Committee on Data Engineering 23, 34–41 (2000)

    Google Scholar 

  8. Bouganim, L., Fabret, F., Mohan, C.: A dynamic query processing architecture for data integration systems. Bulletin of the Technical Committee on Data Engineering 23, 42–48 (2000)

    Google Scholar 

  9. Ives, Z.G., Levy, A.Y., Weld, D.S., Florescu, D., Friedman, M.: Adaptive query processing for internet applications. Bulletin of the Technical Committee on Data Engineering 23, 19–26 (2000)

    Google Scholar 

  10. Urhan, T., Franklin, M.J.: Xjoin: A reactively-scheduled pipelined join operator. IEEE Data Engineering Bulletin 23(2), 27–33 (2000)

    Google Scholar 

  11. Terry, D., Goldberg, D., Nichols, D., Oki, B.: Continuous queries over append-only databases. In: Proceedings of the 1992 ACM SIGMOD International Conference on Management of Data, pp. 321–330 (1992)

    Google Scholar 

  12. Liu, L., Pu, C., Tang, W.: Continual queries for internet scale event-driven information delivery. IEEE Transactions on Knowledge and Data Engineering 11, 610–628 (1999)

    Article  Google Scholar 

  13. Hellerstein, A.R.: Eddies: Continuously adaptive query processing. In: Proc. ACMSIGMOD International Conference on Management of Data, pp. 106–117 (1998)

    Google Scholar 

  14. Fiat, A., Woeginger, G.J.: On Line Algorithms, The State of the Art. Springer, Heidelberg (1998)

    Google Scholar 

  15. Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online aggregation. In: Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data. SIGMOD Record, pp. 171–182 (1997)

    Google Scholar 

  16. Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online aggregation. In: SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, pp. 171–182. ACM Press, New York (1997)

    Chapter  Google Scholar 

  17. Lee, E.A., Parks, T.M.: Dataflow process networks. Technical report, Department of Electrical Engineering and Computer Science, University of California (1995)

    Google Scholar 

  18. Stephens, R.: A survey of stream processing. Technical Report CSRG95-05, Department of Electronic and Electrical Engineering, University of Surrey (1996)

    Google Scholar 

  19. Madden, S., Shah, M., Hellerstein, J.M., Raman, V.: Continuously adaptive continuous queries over streams. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, Wisconsin, June 4-6, pp. 49–60. ACM Press, New York (2002)

    Chapter  Google Scholar 

  20. Group, T.S.: Stream: The stanford stream data manager. Bulletin of the Technical Committee on Data Engineering 26, 19–26 (2003)

    Google Scholar 

  21. Avnur, R., Hellerstein, J.M.: Eddies: Continuously adaptive query processing. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 261–272. ACM, New York (2000)

    Chapter  Google Scholar 

  22. Stonebraker, M., Cherniack, M., Cetintemel, U., Balazinska, M., Balakrishnan, H.: The aurora and medusa projects. Bulletin of the Technical Committee on Data Engineering 26, 3–10 (2003)

    Google Scholar 

  23. Viglas, S.D., Naughton, J.F.: Rate-based query optimization for streaming information sources. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, pp. 37–48. ACM Press, New York (2002)

    Chapter  Google Scholar 

  24. Das, A., Gehrke, J., Riedewald, M.: Approximate join processing over data streams. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, June 9-12 (2003)

    Google Scholar 

  25. Ganguly, S., Garofalakis, M., Rastogi, R.: Processing set expressions over continuous update streams. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, June 9-12 (2003)

    Google Scholar 

  26. Getta, J., Vossough, E.: Optimization of data stream processing. Submitted for publication in SIGMOD Record (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Getta, J.R., Vossough, E. (2004). Grouped Processing of Relational Algebra Expressions over Data Streams. In: Benczúr, A., Demetrovics, J., Gottlob, G. (eds) Advances in Databases and Information Systems. ADBIS 2004. Lecture Notes in Computer Science, vol 3255. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30204-9_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30204-9_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23243-8

  • Online ISBN: 978-3-540-30204-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics