skip to main content
10.1145/3167132.3167196acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

Controller-aware memory coloring for multicore real-time systems

Published:09 April 2018Publication History

ABSTRACT

Memory latencies vary in non-uniform memory access (NUMA) systems so that execution times may become unpredictable in a multicore real-time system. This results in overly conservative scheduling with low utilization due to loose bounds on the worst-case execution time (WCET) of tasks. This work contributes a controller/node-aware memory coloring (CAMC) allocator inside the Linux kernel for the entire address space to reduce access conflicts and latencies by isolating tasks from one another. CAMC improves timing predictability and performance over Linux' buddy allocator and prior coloring methods. It provides core isolation with respect to banks and memory controllers for real-time systems. To our knowledge, this work is first to consider multiple memory controllers in real-time systems, combine them with bank coloring, and assess its performance on a NUMA architecture.

References

  1. Manu Awasthi, David W Nellans, Kshitij Sudan, Rajeev Balasubramonian, and Al Davis. Handling the problems and opportunities posed by multiple on-chip memory controllers. In International Conference on Parallel Architectures and Compilation Techniques, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Christian Bienia, Sanjeev Kumar, Jaswinder Pal Singh, and Kai Li. The parsec benchmark suite: Characterization and architectural implications. In PACT, October 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Sergey Blagodurov, Sergey Zhuravlev, Alexandra Fedorova, and Ali Kamali. A case for numa-aware contention management on multicore systems. In International Conference on Parallel Architectures and Compilation Techniques, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Micaiah Chisholm, Bryan C Ward, Namhoon Kim, and James H Anderson. Cache sharing and isolation tradeoffs in multicore mixed-criticality systems. In IEEE Real-Time Systems Symposium, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Pengcheng Huang, Georgia Giannopoulou, Rehan Ahmed, Davide B. Bartolini, and Lothar Thiele. An isolation scheduling model for multi-cores. In IEEE Real-Time Systems Symposium, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Hyoseung Kim, Dionisio de Niz, Björn Andersson, Mark Klein, Onur Mutlu, and Ragunathan Raj Rajkumar. Bounding memory interference delay in cots-based multi-core systems. In IEEE Real-Time Embedded Technology and Applications Symposium, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  7. Renaud Lachaize, Baptiste Lepers, Vivien Quéma, et al. Memprof: A memory profiler for numa multicore systems. In USENIX Annual Technical Conference, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hui Li, Sudarsan Tandri, Michael Stumm, and Kenneth C Sevcik. Locality and loop scheduling on numa multiprocessors. In International Conference on Parallel Processing, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Lei Liu, Zehan Cui, Mingjie Xing, Yungang Bao, Mingyu Chen, and Chengyong Wu. A software memory partition approach for eliminating bank-level interference in multicore systems. In International Conference on Parallel Architectures and Compilation Techniques, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Zoltan Majo and Thomas R Gross. Matching memory access patterns and data placement for numa systems. In International Symposium on Code Generation and Optimization, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Zoltan Majo and Thomas R Gross. (mis) understanding the numa memory system performance of multithreaded workloads. In International Symposium on Workload Characterization, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  12. Renato Mancuso, Rodolfo Pellizzoni, Caccamo Marco, Lui Sha, and Heechul Yun. Wcet(m) estimation in multi-core systems using single core equivalence. In Euromicro Conference on Real-Time Systems, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Jaydeep Marathe, Vivek Thakkar, and Frank Mueller. Feedback-directed page placement for ccnuma via hardware-generated memory traces. Journal of Parallel and Distributed Computing, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Collin McCurdy and Jeffrey Vetter. Memphis: Finding and fixing numa-related performance problems on multi-core platforms. In International Symposium on Performance Analysis of Systems & Software, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  15. Takeshi Ogasawara. Numa-aware memory manager with dominant-thread-based copying gc.Google ScholarGoogle Scholar
  16. Xing Pan, Yasaswini J. Gownivaripalli, and Frank Mueller. Tintmalloc: Reducing memory access divergence via controller-aware coloring. In International Parallel and Distributed Processing Symposium, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  17. Rodolfo Pellizzoni and Heechul Yun. Memory servers for multicore systems. In IEEE Real-Time Embedded Technology and Applications Symposium, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  18. Xiao Zhang Sandhya Dwarkadas Kai Shen. Hardware execution throttling for multi-core resource management. In USENIX Annual Technical Conference, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Noriaki Suzuki, Hyoseung Kim, Dionisio de Niz, Bjorn Andersson, Lutz Wrage, Mark Klein, and Ragunathan Rajkumar. Coordinated bank and cache coloring for temporal protection of memory accesses. In International Conference on Computational Science and Engineering, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Bryan C. Ward. Relaxing resource-sharing constraints for improved hardware management and schedulability. In IEEE Real-Time Systems Symposium, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Zheng Pei Wu, Yogen Krish, and Rodolfo Pellizzoni. Worst case analysis of dram latency in multi-requestor systems. In IEEE Real-Time Systems Symposium, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Heechul Yun, Renato Mancuso, Zheng-Pei Wu, and Rodolfo Pellizzoni. Palloc: Dram bank-aware memory allocator for performance isolation on multicore platforms. In IEEE Real-Time Embedded Technology and Applications Symposium, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  23. Heechul Yun, Rodolfo Pellizzoni, and Prathap Valsan, Kumar. Parallelism-aware memory interference delay analysis for cots multi-core systems. In Euromicro Conference on Real-Time Systems, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Heechul Yun, Gang Yao, Rodolfo Pellizzoni, Marco Caccamo, and Lui Sha. Memguard: Memory bandwidth reservation system for efficient performance isolation in multi-core platforms. In IEEE Real-Time Embedded Technology and Applications Symposium, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Controller-aware memory coloring for multicore real-time systems

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SAC '18: Proceedings of the 33rd Annual ACM Symposium on Applied Computing
      April 2018
      2327 pages
      ISBN:9781450351911
      DOI:10.1145/3167132

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 April 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,650of6,669submissions,25%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader