research-article

ARIA: automatic resource inference and allocation for mapreduce environments

Authors:

Abhishek Verma,

Ludmila Cherkasova,

Roy H. CampbellAuthors Info & Claims

ICAC '11: Proceedings of the 8th ACM international conference on Autonomic computing

Pages 235 - 244

https://doi.org/10.1145/1998582.1998637

Published: 14 June 2011 Publication History

Abstract

MapReduce and Hadoop represent an economically compelling alternative for efficient large scale data processing and advanced analytics in the enterprise. A key challenge in shared MapReduce clusters is the ability to automatically tailor and control resource allocations to different applications for achieving their performance goals. Currently, there is no job scheduler for MapReduce environments that given a job completion deadline, could allocate the appropriate amount of resources to the job so that it meets the required Service Level Objective (SLO). In this work, we propose a framework, called ARIA, to address this problem. It comprises of three inter-related components. First, for a production job that is routinely executed on a new dataset, we build a job profile that compactly summarizes critical performance characteristics of the underlying application during the map and reduce stages. Second, we design a MapReduce performance model, that for a given job (with a known profile) and its SLO (soft deadline), estimates the amount of resources required for job completion within the deadline. Finally, we implement a novel SLO-based scheduler in Hadoop that determines job ordering and the amount of resources to allocate for meeting the job deadlines.

We validate our approach using a set of realistic applications. The new scheduler effectively meets the jobs' SLOs until the job demands exceed the cluster resources. The results of the extensive simulation study are validated through detailed experiments on a 66-node Hadoop cluster.

References

[1]

G. Ananthanarayanan, S. Kandula, A. Greenberg, I. Stoica, Y. Lu, B. Saha, and E. Harris. Reining in the Outliers in Map-Reduce Clusters using Mantri. In Proc. of OSDI'2010.

Digital Library

[2]

Apache. Capacity Scheduler Guide, 2010. URL http://hadoop. apache.org/common/docs/r0.20.1/capacity_scheduler.html.

[3]

J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51 (1):107--113, 2008.

Digital Library

[4]

A. Ganapathi, Y. Chen, A. Fox, R. Katz, and D. Patterson. Statistics-driven workload modeling for the cloud. In Proc. of 5th Intl. Workshop on Self Managing Database Systems, 2010.

[5]

R.L. Graham. Bounds for certain multiprocessing anomalies. Bell System Tech. Journal, 45(9):1563--1581, 1966.

[6]

Intel. Optimizing Hadoop* Deployments, 2010. URL http://communities.intel.com/docs/DOC-4218.

[7]

M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. Dryad: distributed data-parallel programs from sequential building blocks. ACM SIGOPS OS Review, 41(3):72, 2007.

Digital Library

[8]

M. Isard, V. Prabhakaran, J. Currey, U. Wieder, K. Talwar, and A. Goldberg. Quincy: fair scheduling for distributed computing clusters. In Proc. of SOSP'2009.

Digital Library

[9]

K. Kambatla, A. Pathak, and H. Pucha. Towards optimizing hadoop provisioning in the cloud. In Proc. of the First Workshop on Hot Topics in Cloud Computing, 2009.

Digital Library

[10]

S. Kavulya, J. Tan, R. Gandhi, and P. Narasimhan. An Analysis of Traces from a Production MapReduce Cluster. In Proc. of CCGrid'2010.

Digital Library

[11]

A. Konwinski, M. Zaharia, R. Katz, and I. Stoica. X-tracing Hadoop. Hadoop Summit, 2008.

[12]

H. Kwak, C. Lee, H. Park, and S. Moon. What is Twitter, a social network or a news media? In Proc. of WWW'2010.

Digital Library

[13]

O. O'Malley and A.C. Murthy. Winning a 60 second dash with a yellow elephant, 2009.

[14]

K. Morton, M. Balazinska, D. Grossman.ParaTimer: a progress indicator for MapReduce DAGs. In Proc. of SIGMOD'2010.

Digital Library

[15]

L. Phan, Z. Zhang, B. Loo, and I. Lee. Real-time MapReduce Scheduling. Tech. Report No. MS-CIS-10-32, UPenn, 2010.

[16]

J. Polo, D. Carrera, Y. Becerra, J. Torres, E. Ayguade, M. Steinder, and I. Whalley. Performance-driven task co-scheduling for MapReduce environments. In 12th IEEE/IFIP Network Operations and Management Symposium. ACM, 2010.

[17]

T. Sandholm and K. Lai. Dynamic Proportional Share Scheduling in Hadoop. LNCS: Proc. of the 15th Workshop on Job Scheduling Strategies for Parallel Processing, 2010.

Digital Library

[18]

J. Tan, X. Pan, S. Kavulya, E. Marinelli, R. Gandhi, and P. Narasimhan. Kahuna: Problem Diagnosis for MapReduce-based Cloud Computing Environments. In 12th IEEE/IFIP NOMS, 2010.

[19]

G. Wang, A.R. Butt, P. Pandey, and K. Gupta. A simulation approach to evaluating design decisions in MapReduce setups. In Proc of MASCOTS'2009.

[20]

J. Wolf, et al. FLEX: A Slot Allocation Scheduling Optimizer for MapReduce Workloads. In Proc.of Middleware'2010.

Digital Library

[21]

M. Zaharia, D. Borthakur, J. Sen Sarma, K. Elmeleegy, S. Shenker, and I. Stoica. Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling. In Proc. of EuroSys, pages 265--278. ACM, 2010.

Digital Library

[22]

M. Zaharia, A. Konwinski, A.D. Joseph, R. Katz, and I. Stoica. Improving MapReduce performance in heterogeneous environments. In OSDI, 2008.

Digital Library

Cited By

Bergui MHourri SNajah SNikolov N(2024)Predictive modelling of MapReduce job performance in cloud environments using machine learning techniquesJournal of Big Data10.1186/s40537-024-00964-z11:1Online publication date: 23-Jul-2024
https://doi.org/10.1186/s40537-024-00964-z
Wang SChen SShi Y(2024)GPARS: Graph predictive algorithm for efficient resource scheduling in heterogeneous GPU clustersFuture Generation Computer Systems10.1016/j.future.2023.10.022152(127-137)Online publication date: Mar-2024
https://doi.org/10.1016/j.future.2023.10.022
Lin WXu HZhong HChen FHu Z(2024)AMORA: An Advanced Malleable and Operational Framework for Performance Prediction of Big Data SystemsSoftware: Practice and Experience10.1002/spe.338255:3(491-523)Online publication date: 24-Oct-2024
https://doi.org/10.1002/spe.3382
Show More Cited By

Index Terms

ARIA: automatic resource inference and allocation for mapreduce environments

Recommendations

Single Machine Batch Scheduling Problem with Resource Dependent Setup and Processing Time in the Presence of Fuzzy Due Date

We consider a batch scheduling problem on a single machine which processes jobs with resource dependent setup and processing time in the presence of fuzzy due-dates given as follows:

1. There are n independent non-preemptive and simultaneously ...
Single-machine due-window assignment and scheduling with resource allocation, aging effect, and a deteriorating rate-modifying activity

We consider single-machine scheduling with a common due-window and a deteriorating rate-modifying activity. We assume that the processing time of a job is a function of the amount of a resource allocated to it, its position in the processing sequence, ...
HRF: a resource allocation scheme for moldable jobs
CF '15: Proceedings of the 12th ACM International Conference on Computing Frontiers

Moldable jobs, which allow the number of allocated processors to be adjusted before running in clusters, have attracted increasing concern in parallel job scheduling research. Compared with traditional rigid jobs where the number of allocated processors ...

Comments

Information & Contributors

Information

Published In

ICAC '11: Proceedings of the 8th ACM international conference on Autonomic computing

June 2011

278 pages

ISBN:9781450306072

DOI:10.1145/1998582

General Chairs:
Hartmut Schmeck
Karlsruhe Institute of Technology, Germany
,
Wolfgang Rosenstiel
University of Tübingen, Germany
,
Program Chairs:
Tarek Abdelzaher
University of Illinois at Urbana-Champaign, USA
,
Joseph Hellerstein
Google, USA

Copyright © 2011 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

IEEE
University of Arizona: University of Arizona
SIGARCH: ACM Special Interest Group on Computer Architecture

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 June 2011

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ICAC '11

Sponsor:

University of Arizona
SIGARCH

ICAC '11: 8th International Conference on Autonomic Computing

June 14 - 18, 2011

Karlsruhe, Germany

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

312
Total Citations
View Citations
1,115
Total Downloads

Downloads (Last 12 months)17
Downloads (Last 6 weeks)1

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bergui MHourri SNajah SNikolov N(2024)Predictive modelling of MapReduce job performance in cloud environments using machine learning techniquesJournal of Big Data10.1186/s40537-024-00964-z11:1Online publication date: 23-Jul-2024
https://doi.org/10.1186/s40537-024-00964-z
Wang SChen SShi Y(2024)GPARS: Graph predictive algorithm for efficient resource scheduling in heterogeneous GPU clustersFuture Generation Computer Systems10.1016/j.future.2023.10.022152(127-137)Online publication date: Mar-2024
https://doi.org/10.1016/j.future.2023.10.022
Lin WXu HZhong HChen FHu Z(2024)AMORA: An Advanced Malleable and Operational Framework for Performance Prediction of Big Data SystemsSoftware: Practice and Experience10.1002/spe.338255:3(491-523)Online publication date: 24-Oct-2024
https://doi.org/10.1002/spe.3382
Singh BVerma HMadaan V(2023)Performance Challenges and Solutions in Big Data Platform HadoopRecent Advances in Computer Science and Communications10.2174/266625581666623060816514616:9Online publication date: Nov-2023
https://doi.org/10.2174/2666255816666230608165146
Hedayati SMaleki NOlsson TAhlgren FSeyednezhad MBerahmand K(2023)MapReduce scheduling algorithms in Hadoop: a systematic studyJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-023-00520-912:1Online publication date: 10-Oct-2023
https://dl.acm.org/doi/10.1186/s13677-023-00520-9
Mohapatra AOh K(2023)SmartpickProceedings of the 24th International Middleware Conference10.1145/3590140.3592850(29-42)Online publication date: 27-Nov-2023
https://dl.acm.org/doi/10.1145/3590140.3592850
Li YLin YWang YYe KXu C(2023)Serverless Computing: State-of-the-Art, Challenges and OpportunitiesIEEE Transactions on Services Computing10.1109/TSC.2022.316655316:2(1522-1539)Online publication date: 1-Mar-2023
https://doi.org/10.1109/TSC.2022.3166553
Raza AAkhtar NIsahagian VMatta IHuang L(2023)Configuration and Placement of Serverless Applications Using Statistical LearningIEEE Transactions on Network and Service Management10.1109/TNSM.2023.325443720:2(1065-1077)Online publication date: Jun-2023
https://doi.org/10.1109/TNSM.2023.3254437
Cheng DWang YDai D(2023)Dynamic Resource Provisioning for Iterative Workloads on Apache SparkIEEE Transactions on Cloud Computing10.1109/TCC.2021.310804311:1(639-652)Online publication date: 1-Jan-2023
https://doi.org/10.1109/TCC.2021.3108043
Ardagna CBena NHebert CKrotsiani MKloukinas CSpanoudakis G(2023)Big Data Assurance: An Approach Based on Service-Level AgreementsBig Data10.1089/big.2021.036911:3(239-254)Online publication date: 1-Jun-2023
https://doi.org/10.1089/big.2021.0369
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten