ABSTRACT
Distributed Computing (DC) involves a collection of tasks (or modules) executed in parallel on different compute nodes connected through a network. Cloud Service providers (CSP) such as Azure[1], Amazon[2], and Google[3] are providing DC platforms as PaaS (Platform As A Service) offerings. These cloud platforms reduce implementation costs but have a significant drawback as these services can be configured to spawn only a single type of compute node for executing all the tasks in the DC environment. These drawback lead to inefficiency in execution cost and time as each task will have specific compute node requirements. This paper presents a novel framework called TreeOptimizer(TO) to resolve these shortcomings. TO uses a classifier-based dynamic task scheduling to determine the best available node to perform the task. The framework has been tested in Azure Batch[1] for an Oil Industry use case for extracting data from scanned images. Experimental results indicate that TO significantly reduces the overall execution cost by 68% and processing time by 8%. Although this paper uses Batch Service to explain the proposed framework, it can be applied to other PaaS DC platforms.
- [1] Microsoft Batch - https://learn.microsoft.com/en-us/azure/batch/batch-technical-overview.Google Scholar
- [2] AWS Batch -https://docs.aws.amazon.com/batch.Google Scholar
- [3] Google Batch - https://cloud.google.com/batch/docs/get-started.Google Scholar
- [4] Scanned Well Files - https://data.bsee.gov/Other/DiscMediaStore/ScanWellFiles.aspxGoogle Scholar
Index Terms
- TreeOptimizer: A classifier-based task scheduling framework
Recommendations
An Improved and Efficient Distributed Computing Framework with Intelligent Task Scheduling
Distributed Computing and Intelligent TechnologyAbstractDistributed Computing platforms involve multiple processing systems connected through a network and support the parallel execution of applications. They enable huge computational power and data processing with a quick response time. Examples of ...
Cloud Multi-Tenancy: Issues and Developments
UCC '17 Companion: Companion Proceedings of the10th International Conference on Utility and Cloud ComputingCloud Computing (CC) is a computational paradigm that provides pay-per use services to customers from a pool of networked computing resources that are provided on demand. Customers therefore does not need to worry about infrastructure or storage. Cloud ...
PaaS dependability integration architecture based on cloud brokering
SAC '16: Proceedings of the 31st Annual ACM Symposium on Applied ComputingCloud computing has revolutionized the way IT is provisioned nowadays since it exposes computing capabilities as rental resources to consumers. The emergence of cloud computing services hasn't though prevented outages in these environments even among ...
Comments