Skip to main content

A Survey of Data Stream Processing Tools

  • Conference paper
  • First Online:
Information Sciences and Systems 2014

Abstract

In current international context boundaries set for applications are being pushed by the emergence of bursty and time-varying data streams required to be processed in near real-time. Furthermore, traditional techniques for data mining cannot be applied to data streams. Thus, stream-based applications must exhibit to excel at a plurality of requirements. According to defined rules presented in previous promulgated researches on this subject we differ stream-based applications and evaluate their aptitude to stream sources management. By this work we intend to present features and drawbacks of existing software coming from both industry and academic world, along with outlining our contribution to this field.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Apache Samza official website. http://samza.incubator.apache.org/

  2. Apache Storm official website. http://storm.incubator.apache.org/

  3. D.J. Abadi, Y. Ahmad, M. Balazinska, U. Çetintemel, M. Cherniack, J.-H. Hwang, W. Lindner, A. Maskey, A. Rasin, E. Ryvkina, N. Tatbul, Y. Xing, S.B. Zdonik. The design of the borealis stream processing engine. in: CIDR, pp. 277–289, 2005

    Google Scholar 

  4. D.J. Abadi, D. Carney, U. Çetintemel, M. Cherniack, C. Convey, S. Lee, M. Stonebraker, N. Tatbul, S. Zdonik, Aurora: a new model and architecture for data stream management. VLDB J. 12(2), 120–139 (2003)

    Google Scholar 

  5. M. Ali, B. Chandramouli, J. Goldstein, and R. Schindlauer. The extensibility framework in microsoft streaminsight. in: Proceedings of the 2011 IEEE 27th International Conference on Data Engineering, ICDE ’11, IEEE Computer Society, pp. 1242–1253. 2011

    Google Scholar 

  6. A. Arasu, B. Babcock, S. Babu, J. Cieslewicz, M. Datar, K. Ito, R. Motwani, U. Srivastava, J. Widom, Stream: the stanford stream data manager. Technical Report 2003–21, Stanford InfoLab, 2003

    Google Scholar 

  7. R.S. Barga, J. Goldstein, M.H. Ali, M. Hong. Consistent streaming through time: a vision for event stream processing. in: CIDR, pp. 363–374. www.cidrdb.org

  8. M. Cammert, C. Heinz, J. Krmer, A. Markowetz, B. Seeger, Pipes: a multi-threaded publish-subscribe architecture for continuous queries over streaming data sources. Technical report, 2003

    Google Scholar 

  9. T.M. Ghanem, M.A. Hammad, M.F. Mokbel, W.G. Aref, A.K. Elmagarmid, Incremental evaluation of sliding-window queries over data streams. IEEE Trans. Knowl. Data Eng. 19(1), 57–72 (2007)

    Article  Google Scholar 

  10. M. Gorawski, A. Chrószcz, Query processing using negative and temporal tuples in stream query engines. in: Advances in Software Engineering Techniques—4th IFIP TC 2 Central and East European Conference on Software Engineering Techniques, CEE-SET 2009, Revised Selected Papers, volume 7054 of Lecture Notes in Computer Science, (Springer, 2009) pp. 70–83

    Google Scholar 

  11. M. Gorawski, A. Chrószcz, StreamAPAS: query language and data model. in: 2009 International Conference on Complex, Intelligent and Software Intensive Systems, CISIS 2009, IEEE Computer Society, pp. 75–82, 2009

    Google Scholar 

  12. M. Gorawski, A. Gorawska, Research on the Stream ETL Process. in: Beyond Databases, Architectures, and Structures, volume 424 of Communications in Computer and Information Science, (Springer, 2014) pp. 61–71

    Google Scholar 

  13. M. Gorawski, A. Gorawska, K. Pasterak, Evaluation and development perspectives of stream data processing systems, in: Computer Networks, vol. 370 (Springer, Berlin, 2013), pp. 300–311

    Google Scholar 

  14. R. Kuntschke, B. Stegmaier, A. Kemper, A. Reiser, Streamglobe: processing and sharing data streams in grid-based p2p infrastructures. in: Proceedings of the 31st International Conference on Very Large Data Bases, VLDB, pp. 1259–1262. ACM, 2005

    Google Scholar 

  15. M. Stonebraker, U. Çetintemel, S. Zdonik, The 8 requirements of real-time stream processing. SIGMOD Rec. 34(4), 42–47 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marcin Gorawski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Gorawski, M., Gorawska, A., Pasterak, K. (2014). A Survey of Data Stream Processing Tools. In: Czachórski, T., Gelenbe, E., Lent, R. (eds) Information Sciences and Systems 2014. Springer, Cham. https://doi.org/10.1007/978-3-319-09465-6_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09465-6_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09464-9

  • Online ISBN: 978-3-319-09465-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics