skip to main content
10.1145/3328905.3329506acmconferencesArticle/Chapter ViewAbstractPublication PagesdebsConference Proceedingsconference-collections
research-article

Reinforcement Learning Based Policies for Elastic Stream Processing on Heterogeneous Resources

Published:24 June 2019Publication History

ABSTRACT

Data Stream Processing (DSP) has emerged as a key enabler to develop pervasive services that require to process data in a near real-time fashion. DSP applications keep up with the high volume of produced data by scaling their execution on multiple computing nodes, so as to process the incoming data flow in parallel. Workloads variability requires to elastically adapt the application parallelism at run-time in order to avoid over-provisioning. Elasticity policies for DSP have been widely investigated, but mostly under the simplifying assumption of homogeneous infrastructures. The resulting solutions do not capture the richness and inherent complexity of modern infrastructures, where heterogeneous computing resources are available on-demand. In this paper, we formulate the problem of controlling elasticity on heterogeneous resources as a Markov Decision Process (MDP). The resulting MDP is not easily solved by traditional techniques due to state space explosion, and thus we show how linear Function Approximation and Tile Coding can be used to efficiently compute elasticity policies at run-time. In order to deal with parameters uncertainty, we integrate the proposed approach with Reinforcement Learning algorithms. Our numerical evaluation shows the efficacy of the presented solutions compared to standard methods in terms of accuracy and convergence speed.

References

  1. Y. Al-Dhuraibi, F. Paraiso, N. Djarallah, and P. Merle. 2018. Elasticity in Cloud Computing: State of the Art and Research Challenges. IEEE Trans. Serv. Comput. 11 (2018), 430--447.Google ScholarGoogle ScholarCross RefCross Ref
  2. V. Cardellini, F. Lo Presti, M. Nardelli, and G. Russo Russo. 2018. Decentralized Self-Adaptation for Elastic Data Stream Processing. Future Gener. Comput. Syst. 87 (2018), 171--185.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. V. Cardellini, F. Lo Presti, M. Nardelli, and G. Russo Russo. 2018. Optimal Operator Deployment and Replication for Elastic Distributed Data Stream Processing. Concurr. Comput.: Pract. Exper. 30, 9 (2018), e4334.Google ScholarGoogle ScholarCross RefCross Ref
  4. M.D. de Assunção, A. da Silva Veith, and R. Buyya. 2018. Distributed data stream processing and edge computing: A survey on resource elasticity and future directions. J. Netw. Comput. Appl. 103 (2018), 1--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. T. De Matteis and G. Mencagli. 2017. Proactive Elasticity and Energy Awareness in Data Stream Processing. J. Syst. Softw. 127 (2017), 302--319. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R.C. Fernandez, M. Migliavacca, E. Kalyvianaki, and P. Pietzuch. 2013. Integrating Scale Out and Fault Tolerance in Stream Processing Using Operator State Management. In Proc. ACM SIGMOD '13. 725--736. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. B. Gedik, S. Schneider, M Hirzel, and K. Wu. 2014. Elastic Scaling for Data Stream Processing. IEEE Trans. Parallel Distrib. Syst. 25, 6 (2014), 1447--1463. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Geramifard, T.J. Walsh, S. Tellex, G. Chowdhary, N. Roy, J.P. How, et al. 2013. A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning. Found. Trends in Mach. Learn. 6, 4 (2013), 375--451. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. P. Graubner, C. Thelen, M. Körber, A. Sterz, G. Salvaneschi, et al. 2018. Multimodal Complex Event Processing on Mobile Devices. In Proc. ACM DEBS '18. 112--123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Gulisano, R. Jiménez-Peris, M. Patiño Martinez, C. Soriente, and P. Valduriez. 2012. StreamCloud: An Elastic and Scalable Data Streaming System. IEEE Trans. Parallel Distrib. Syst. 23, 12 (2012), 2351--2365. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. He, Y. Chen, T. Z. J. Fu, X. Long, M. Winslett, L. You, and Z. Zhang. 2018. HaaS: Cloud-Based Real-Time Data Analytics with Heterogeneity-Aware Scheduling. In Proc. IEEE ICDCS '18. 1017--1028.Google ScholarGoogle Scholar
  12. T. Heinze, L. Aniello, L. Querzoni, and J. Zbigniew. 2014. Cloud-based Data Stream Processing. In Proc. ACM DEBS '14. 238--245. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. Heinze, V. Pappalardo, Z. Jerzak, and C. Fetzer. 2014. Auto-scaling Techniques for Elastic Data Stream Processing. In Proc. IEEE ICDEW '14. 296--302.Google ScholarGoogle Scholar
  14. M. Hirzel, R. Soulé, S. Schneider, B. Gedik, and R. Grimm. 2014. A Catalog of Stream Processing Optimizations. ACM Comput. Surv. 46, 4 (2014), 46:1--46:34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Z. Jerzak and H. Ziekow. 2015. The DEBS 2015 Grand Challenge. In Proc. ACM DEBS '15. ACM, 266--268.Google ScholarGoogle Scholar
  16. A. Koliousis, M. Weidlich, R. Castro Fernandez, A.L. Wolf, P. Costa, and P. Pietzuch. 2016. SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures. In Proc. ACM SIGMOD '16. 555--569. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. M. Kretchmar and C. W. Anderson. 1997. Comparison of CMACs and Radial Basis Functions for Local Function Approximators in Reinforcement Learning. In Proc. ICNN '97, Vol. 2. 834--837.Google ScholarGoogle Scholar
  18. G. T. Lakshmanan, Y. Li, and R. Strom. 2008. Placement Strategies for Internet-scale Data Stream Systems. IEEE Internet Comput. 12, 6 (2008), 50--60. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. X. Liu, A.V. Dastjerdi, R.N. Calheiros, C. Qu, and R. Buyya. 2018. A Stepwise Auto-Profiling Method for Performance Optimization of Streaming Applications. ACM Trans. Auton. Adapt. Syst. 12, 4 (2018), 24:1--24:33. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Lohrmann, P. Janacik, and O. Kao. 2015. Elastic Stream Processing with Latency Guarantees. In Proc. IEEE ICDCS '15. 399--410.Google ScholarGoogle Scholar
  21. F. Lombardi, L. Aniello, S. Bonomi, and L. Querzoni. 2018. Elastic Symbiotic Scaling of Operators and Resources in Stream Processing Systems. IEEE Trans. Parallel Distrib. Syst. 29, 3 (2018), 572--585.Google ScholarGoogle ScholarCross RefCross Ref
  22. G. Mencagli. 2016. A Game-Theoretic Approach for Elastic Distributed Data Stream Processing. ACM Trans. Auton. Adapt. Syst. 11, 2 (2016), 13:1--13:34.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M.A.U. Nasir, G. De Francisci Morales, D. García-Soriano, N. Kourtellis, and M. Serafini. 2015. The Power of Both Choices: Practical Load Balancing for Distributed Stream Processing Engines. In Proc. IEEE ICDE '15. 137--148.Google ScholarGoogle Scholar
  24. M.L. Puterman. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons. Google ScholarGoogle Scholar
  25. G. Russo Russo, M. Nardelli, V. Cardellini, and F. Lo Presti. 2018. Multi-Level Elasticity for Wide-Area Data Streaming Systems: A Reinforcement Learning Approach. Algorithms 11, 9 (2018), 134.Google ScholarGoogle Scholar
  26. F. Starks, V. Goebel, S. Kristiansen, and T. Plagemann. 2018. Mobile Distributed Complex Event Processing---Ubi Sumus? Quo Vadimus? In Mobile Big Data: A Roadmap from Models to Technologies. Springer, 147--180.Google ScholarGoogle Scholar
  27. R.S. Sutton. 1995. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. In Proc. NIPS '95. MIT Press, 1038--1044. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. R.S. Sutton and A.G. Barto. 1998. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. C. Watkins and P. Dayan. 1992. Q-learning. Machine Learning 8, 3-4 (1992), 279--292. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. K.P. Yoon and C.-L. Hwang. 1995. Multiple Attribute Decision Making: an Introduction. Sage Pubs.Google ScholarGoogle Scholar

Index Terms

  1. Reinforcement Learning Based Policies for Elastic Stream Processing on Heterogeneous Resources

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          DEBS '19: Proceedings of the 13th ACM International Conference on Distributed and Event-based Systems
          June 2019
          291 pages
          ISBN:9781450367943
          DOI:10.1145/3328905

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 24 June 2019

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          DEBS '19 Paper Acceptance Rate13of47submissions,28%Overall Acceptance Rate130of553submissions,24%

          Upcoming Conference

          DEBS '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader