Abstract
In this article, we propose and implement a distributed autonomic manager that maintains service level agreements (SLA) for each application scenario. The proposed autonomic manager supports SLAs by configuring the bandwidth ratios for each application scenario and uses an overlay network as an infrastructure. The most important aspect of the proposed autonomic manager is its scalability which allows us to deal with geographically distributed cloud-based applications and a large volume of computation. This can be useful in look ahead optimization and in adaptations using complex models, such as machine learning. We formally prove the safety and liveness properties of the implemented distributed algorithms. Through experiments on the Amazon AWS cloud, using two different use cases, we demonstrate the elasticity and flexibility of the autonomic manager as a measure of its applicability to different cloud applications with different types of workloads. Experiments also demonstrate that increasing the size of a look ahead window, up to a certain size, improves the accuracy of the adaptation decisions by up to 50%.
- [1] . 1985. Actors: A Model of Concurrent Computation in Distributed Systems.
Technical Report . MASSACHUSETTS INST OF TECH CAMBRIDGE ARTIFICIAL INTELLIGENCE LAB.Google Scholar - [2] . 2018. Elasticity in cloud computing: State of the art and research challenges. IEEE Transactions on Services Computing 11, 2 (2018), 430–447.Google ScholarCross Ref
- [3] . 2017. Adaptive service management for cloud applications using overlay networks. In Proceedings of the 15th IEEE International Symposium on Integrated Network Management.Google ScholarDigital Library
- [4] . 2017. A model-based application autonomic manager with fine granular bandwidth control. In Proceedings of the IEEE 13th International Conference on Network and Service Management.Google ScholarCross Ref
- [5] . 2009. Statistical machine learning makes automatic control practical for internet datacenters. In Proceedings of the HotCloud.Google Scholar
- [6] . 2009. Engineering self-adaptive systems through feedback loops. Software Engineering for Self-adaptive Systems 5525 (2009), 48–70.Google ScholarDigital Library
- [7] . 2003. The maude 2.0 system. In Proceedings of the RTA. Springer, 76–87.Google ScholarCross Ref
- [8] . 2010. JAAF+ T: A framework to implement self-adaptive agents that apply self-test. In Proceedings of the 2010 ACM Symposium on Applied Computing. ACM, 928–935.Google ScholarDigital Library
- [9] . 2007. Weka: Practical machine learning tools and techniques with java implementations. AI Tools SeminarUniversity of Saarland, WS 6, 07 (2007).Google Scholar
- [10] . 2013. On estimating actuation delays in elastic computing systems. In Proceedings of the 2013 8th International Symposium on Software Engineering for Adaptive and Self-Managing Systems. IEEE, 33–42.Google ScholarDigital Library
- [11] . 2013. Assurance of self-adaptive controllers for the cloud. In Proceedings of the Assurances for Self-Adaptive Systems. Springer, 311–339.Google ScholarCross Ref
- [12] . 2012. Akka Essentials. Packt Publishing Ltd.Google Scholar
- [13] 2022. MinIO | High Performance, Kubernetes Native Object Storage. Retrieved from https://min.io/.Google Scholar
- [14] . 2003. The vision of autonomic computing. Computer 36, 1 (2003), 41–50.Google ScholarDigital Library
- [15] . 2007. Towards self-testing in autonomic computing systems. In Proceedings of the 8th International Symposium on Autonomous Decentralized Systems. IEEE, 51–58.Google ScholarDigital Library
- [16] . 2010. Migrating autonomic self-testing to the cloud. In Proceedings of the 2010 3rd International Conference on Software Testing, Verification, and Validation Workshops. IEEE, 438–443.Google ScholarDigital Library
- [17] . 2015. A survey on engineering approaches for self-adaptive systems. Pervasive and Mobile Computing 17 (2015), 184–206.Google ScholarDigital Library
- [18] . 2005. Generalized consensus and paxos. (2005).Google Scholar
- [19] . 2011. Using reinforcement learning for controlling an elastic web application hosting platform. In Proceedings of the 8th ACM International Conference on Autonomic Computing. ACM, 205–208.Google ScholarDigital Library
- [20] . 2012. Comparison of decision-making strategies for self-optimization in autonomic computing systems. ACM Transactions on Autonomous and Adaptive Systems 7, 4 (2012), 36.Google ScholarDigital Library
- [21] . 2012. A performance study on the vm startup time in the cloud. In Proceedings of the 2012 IEEE 5th International Conference on Cloud Computing. IEEE, 423–430.Google ScholarDigital Library
- [22] . 2017. IoT middleware: A survey on issues and enabling technologies. IEEE Internet of Things Journal 4, 1 (2017), 1–20.Google Scholar
- [23] . 2015. The raft consensus algorithm. Lecture Notes CS 190 (2015).Google Scholar
- [24] . 1999. An architecture-based approach to self-adaptive software. IEEE Intelligent Systems and Their Applications 14, 3 (1999), 54–62.Google ScholarDigital Library
- [25] . 2018. Auto-scaling web applications in clouds: A taxonomy and survey. ACM Computing Surveys 51, 4 (2018), 73.Google Scholar
- [26] . 2017. Self-test framework for self-adaptive software architecture. In Proceedings of the 2017 International Conference of Electronics, Communication and Aerospace Technology. IEEE, 669–674.Google ScholarCross Ref
- [27] . 2016. A survey on resource scheduling in cloud computing: Issues and challenges. Journal of Grid Computing 14, 2 (2016), 217–264.Google ScholarDigital Library
- [28] . 2004. Modeling and verification of reactive systems using rebeca. Fundamenta Informaticae 63, 4 (2004), 385–410.Google ScholarDigital Library
- [29] . 2017. Distributed Systems. Maarten van Steen Leiden, The Netherlands.Google Scholar
- [30] . 2015. Cloud resource management: A survey on forecasting and profiling models. Journal of Network and Computer Applications 47 (2015), 99–106.Google ScholarDigital Library
- [31] . 2018. Perpetual Assurances in Self-Adaptive Systems.
Lecture Notes in Computer Science , Vol. 9640. Springer, 31–63.DOI: Google ScholarCross Ref - [32] . 2019. Online Shopping Store - Web Server Logs.
DOI: Google ScholarCross Ref - [33] . 2019. Look ahead distributed planning for application management in cloud. In Proceedings of the 15th International Conference on Network and Service Management.Google ScholarCross Ref
- [34] . 2016. Designing adaptive applications deployed on cloud environments. ACM Transactions on Autonomous and Adaptive Systems 10, 4 (2016), 25.Google ScholarDigital Library
Index Terms
- Formally Verified Scalable Look Ahead Planning For Cloud Resource Management
Recommendations
Enabling scalable scientific workflow management in the Cloud
Cloud computing is gaining tremendous momentum in both academia and industry. In this context, we define the term "Cloud Workflow" as the specification, execution and provenance tracking of large-scale scientific workflows, as well as the management of ...
Topics in cloud incident management
Continuous advancement of cloud technologies, alongside their ever increasing stability, adoption, and ease of use, has led to a rise in native cloud applications, possibly over a larger pool of heterogeneous resources or in multi-cloud approaches. This,...
Autonomic Cloud Storage: Challenges at Stake
CISIS '10: Proceedings of the 2010 International Conference on Complex, Intelligent and Software Intensive SystemsWhile the cloud computing paradigm is progressively being adopted by companies aiming to deliver large-scale distributed services, such as Amazon, IBM, Google or Yahoo!, the service level provided for data storage remains rather basic. This talk will ...
Comments