research-article

Providing fairness on shared-memory multiprocessors via process scheduling

Authors:

Zhenjiang WangAuthors Info & Claims

SIGMETRICS '12: Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems

Pages 295 - 306

https://doi.org/10.1145/2254756.2254792

Published: 11 June 2012 Publication History

Abstract

Competition for shared memory resources on multiprocessors is the most dominant cause for slowing down applications and makes their performance varies unpredictably. It exacerbates the need for Quality of Service (QoS) on such systems. In this paper, we propose a fair-progress process scheduling (FPS) policy to improve system fairness. Its strategy is to force the equally-weighted applications to have the same amount of slowdown when they run concurrently. The basic approach is to monitor the progress of all applications at runtime. When we find an application suffered more slowdown and accumulated less effective work than others, we allocate more CPU time to give it a better parity. Our policy also allows different weights to different threads, and provides an effective and robust tuner that allows the OS to freely make tradeoffs between system fairness and higher throughput. Evaluation results show that FPS can significantly improve system fairness by an average of 53.5% and 65.0% on a 4-core processor with a private cache and a 4-core processor with a shared cache, respectively. The penalty is about 1.1% and 1.6% of the system throughput. For memory-intensive workloads, FPS also improves system fairness by an average of 45.2% and 21.1% on 4-core and 8-core system respectively at the expense of a throughput loss of about 2%.

References

[1]

S. Zhuravlev, S. Blagodurov, and A. Fedorova. Addressing shared resource contention in multicore processors via scheduling. In ASPLOS-19, 2010.

Digital Library

[2]

E. Ebrahimi, C. J. Lee, O. Mutlu, and Y. N. Patt. Fairness via source throttling: a configurable and high-performance fairness substrate for multi-core memory systems. In ASPLOS-19, 2010.

Digital Library

[3]

A. Fedorova, M. Seltzer, and M. D. Smith. Improving Performance Isolation on Chip Multiprocessors via an Operating System Scheduler. In PACT-16, 2007.

Digital Library

[4]

D. Xu, C. Wu, and P. C. Yew. On mitigating memory bandwidth contention through bandwidth-aware scheduling. In PACT-19, 2010.

Digital Library

[5]

H. Y. Cheng, C. H. Lin, J. Li, and C. L. Yang. Memory Latency Reduction via Thread Throttling. In MICRO-43, 2010.

Digital Library

[6]

O. Mutlu and T. Moscibroda. Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors. In MICRO-40, 2007.

Digital Library

[7]

R. Iyer, L. Zhao, F. Guo, R. Illikkal, S. Makineni, D. Newell, Y. Solihin, L. Hsu, and S. Reinhardt. QoS policies and architecture for cache/memory in CMP platforms. In SIGMETRICS, 2007.

Digital Library

[8]

R. Iyer. CQoS: a framework for enabling QoS in shared caches of CMP platforms. In ICS-18, 2004.

Digital Library

[9]

K. Luo, J. Gummaraju, and M. Franklin. Balancing throughput and fairness in SMT processors. In ISPASS, 2001.

[10]

K. J. Nesbit, N. Aggarwal, J. Laudon, and J. E. Smith. Fair Queuing Memory Systems. In MICRO-39, 2006.

Digital Library

[11]

A. Snavely and D. M. Tullsen. Symbiotic job scheduling for a simultaneous multithreaded processor. In ASPLOS-9, 2000.

Digital Library

[12]

C. D. Antonopoulos, D. S. Nikolopoulos, and T. S. Papatheodorou. Scheduling algorithms with bus bandwidth con-siderations for smps. In ICPP, 2003.

[13]

E. Koukis and N. Koziris. Memory and network bandwidth aware scheduling of multiprogrammed workloads on clusters of SMPs. In ICPADS-12, 2006.

Digital Library

[14]

T. Sherwood, E. Perelman, G. Hamerly, and B. Calder. Automatically characterizing large scale program behavior. In ASPLOS-10, 2002.

Digital Library

[15]

A. S. Dhodapkar and J. E. Smith. Comparing program phase detection techniques. In MICRO-36, 2003.

Digital Library

[16]

T. Sherwood, S. Sair, and B. Calder. Phase tracking and prediction. In ISCA-30, 2003.

Digital Library

[17]

T. Sherwood, E. Perelman, B. Calder. Basic Block Distribution Analysis to Find Periodic Behavior and Simulation Points in Applications. In PACT-10, 2001.

Digital Library

[18]

D. Chandra, F. Guo, S. Kim, and Y. Solihin. Predicting Inter-Thread Cache Contention on a Chip Multi-Processor Architecture. In HPCA-11, 2005.

Digital Library

[19]

C. CaBcaval and D. A. Padua. Estimating cache misses and locality using stack distances. In ICS-17, 2003.

Digital Library

[20]

S. Kim, D. Chandra, and Y. Solihin. Fair Cache Sharing and Partitioning in a Chip Multiprocessor Architecture. In PACT-13, 2004.

Digital Library

[21]

O. Mutlu and T. Moscibroda. Parallelism-Aware Batch Scheduling: Enhancing both Performance and Fairness of Shared DRAM Systems. In ISCA-35, 2008

Digital Library

Cited By

Jin WPeng X(2023)SLITS: Sparsity-Lightened Intelligent Thread SchedulingACM SIGMETRICS Performance Evaluation Review10.1145/3606376.359356851:1(21-22)Online publication date: 27-Jun-2023
https://doi.org/10.1145/3606376.3593568
Bilbao CSaez JPrieto-Matias M(2023)Divide&Content: A Fair OS-Level Resource Manager for Contention Balancing on NUMA MulticoresIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.330999934:11(2928-2945)Online publication date: Nov-2023
https://doi.org/10.1109/TPDS.2023.3309999
Garcia-Garcia ASaez JCastro FPrieto-Matias M(2019)LFOCProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337925(1-10)Online publication date: 5-Aug-2019
https://dl.acm.org/doi/10.1145/3337821.3337925
Show More Cited By

Index Terms

Providing fairness on shared-memory multiprocessors via process scheduling
1. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Operating systems
        Process management
        Scheduling

Recommendations

Providing fairness on shared-memory multiprocessors via process scheduling
Performance evaluation review

Competition for shared memory resources on multiprocessors is the most dominant cause for slowing down applications and makes their performance varies unpredictably. It exacerbates the need for Quality of Service (QoS) on such systems. In this paper, we ...
Scalable directory architecture for distributed shared memory chip multiprocessors

Traditional Directory-based cache coherence protocol is far from optimal for large-scale cache coherent shared memory multiprocessors due to the increasing latency to access directories stored in DRAM memory. Instead of keeping directories in main ...
Cache memory design and performance issues in shared-memory multiprocessors

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGMETRICS '12: Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems

June 2012

450 pages

ISBN:9781450310970

DOI:10.1145/2254756

General Chair:
Peter Harrison
Imperial College London, United Kingdom
,
Program Chairs:
Martin Arlitt
HP Labs, USA and University of Calgary, Canada
,
Giuliano Casale
Imperial College London, United Kingdom

ACM SIGMETRICS Performance Evaluation Review Volume 40, Issue 1
Performance evaluation review
June 2012
433 pages
ISSN:0163-5999
DOI:10.1145/2318857
Issue’s Table of Contents

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMETRICS: ACM Special Interest Group on Measurement and Evaluation
IFIP: International Federation for Information Processing

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SIGMETRICS '12

Sponsor:

SIGMETRICS
IFIP

SIGMETRICS '12: ACM SIGMETRICS/PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems

June 11 - 15, 2012

London, England, UK

Acceptance Rates

Overall Acceptance Rate 459 of 2,691 submissions, 17%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

23
Total Citations
View Citations
545
Total Downloads

Downloads (Last 12 months)18
Downloads (Last 6 weeks)1

Reflects downloads up to 25 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Jin WPeng X(2023)SLITS: Sparsity-Lightened Intelligent Thread SchedulingACM SIGMETRICS Performance Evaluation Review10.1145/3606376.359356851:1(21-22)Online publication date: 27-Jun-2023
https://doi.org/10.1145/3606376.3593568
Bilbao CSaez JPrieto-Matias M(2023)Divide&Content: A Fair OS-Level Resource Manager for Contention Balancing on NUMA MulticoresIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.330999934:11(2928-2945)Online publication date: Nov-2023
https://doi.org/10.1109/TPDS.2023.3309999
Garcia-Garcia ASaez JCastro FPrieto-Matias M(2019)LFOCProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337925(1-10)Online publication date: 5-Aug-2019
https://dl.acm.org/doi/10.1145/3337821.3337925
Garcia-Garcia ASaez JPrieto-Matias M(2018)Contention-Aware Fair Scheduling for Asymmetric Single-ISA Multicore SystemsIEEE Transactions on Computers10.1109/TC.2018.283641867:12(1703-1719)Online publication date: 1-Dec-2018
https://doi.org/10.1109/TC.2018.2836418
Garcia-Garcia ASaez JPrieto-Matias M(2018)Delivering Fairness on Asymmetric Multicore Systems via Contention-Aware SchedulingEuro-Par 2017: Parallel Processing Workshops10.1007/978-3-319-75178-8_49(610-622)Online publication date: 8-Feb-2018
https://doi.org/10.1007/978-3-319-75178-8_49
Feliu JSahuquillo JPetit SDuato J(2017)Perf&FairIEEE Transactions on Computers10.1109/TC.2016.262097766:5(905-911)Online publication date: 1-May-2017
https://dl.acm.org/doi/10.1109/TC.2016.2620977
Zhao JCui HXue JFeng X(2016)Predicting Cross-Core Performance Interference on Multicore Processors with Regression AnalysisIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2015.244298327:5(1443-1456)Online publication date: 1-May-2016
https://dl.acm.org/doi/10.1109/TPDS.2015.2442983
Ayodele ARao JBoult T(2016)Towards Application-centric Fairness in Multi-tenant Clouds with Adaptive CPU Sharing Model2016 IEEE 9th International Conference on Cloud Computing (CLOUD)10.1109/CLOUD.2016.0056(367-375)Online publication date: Jun-2016
https://doi.org/10.1109/CLOUD.2016.0056
Wu SPeng YJin H(2016)Time Donating Barrier for efficient task scheduling in competitive multicore systemsFuture Generation Computer Systems10.1016/j.future.2015.04.00554(469-477)Online publication date: Jan-2016
https://doi.org/10.1016/j.future.2015.04.005
He WCui HLu BZhao JLi SRuan GXue JFeng XYang WYan YBhuyan LChong FSarkar V(2015)Hadoop+Proceedings of the 29th ACM on International Conference on Supercomputing10.1145/2751205.2751236(143-153)Online publication date: 8-Jun-2015
https://dl.acm.org/doi/10.1145/2751205.2751236
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten