skip to main content
10.1145/3571306.3571402acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicdcnConference Proceedingsconference-collections
research-article

cHPCe: Data Locality and Memory Bandwidth Contention-aware Containerized HPC

Published:04 January 2023Publication History

ABSTRACT

The High-Performance Computing (HPC) community started using Operating System virtualization, aka Containerization, due to its near-native performance compared to the BareMetal environment. Despite several existing container solutions for HPC, users are still unconvinced about the suitability of container orchestration solutions for their extreme-scale applications due to the lack of thorough performance assessment on recent advances. A key concern in current generation containerized HPC is the inherent performance degradation due to resource interference by co-hosted applications. This paper proposes an analytical model for data locality and memory bandwidth contention-aware container placement strategy for our developed containerized High-Performance Computing environment (cHPCe). Performance is evaluated and compared against LXD, Docker Swarm, Kubernetes, and Singularity using HPC Challenge and NAS parallel benchmarks. To the best of our knowledge, no study has been reported yet for such a comparative analysis with insight into data locality and memory bandwidth contention-aware container placement. The experimental result shows that data locality and memory bandwidth contention-awareness reduce the overall execution time of the benchmark in cHPCe by 51.41% in the best case compared to the Docker Swarm.

References

  1. Surya Kant Garg and J Lakshmi. 2017. Workload performance and interference on containers. In IEEE SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI. 1–6.Google ScholarGoogle Scholar
  2. D Ghatrehsamani et al.2020. The art of cpu-pinning: Evaluating and improving the performance of virtualization and containerization platforms. In 49th ICPP. ACM, 1–11.Google ScholarGoogle Scholar
  3. A Hartstein et al.2006. Cache miss behavior: is it √2?. In Proc. of the 3rd conference on Computing frontiers. ACM, 313–320.Google ScholarGoogle Scholar
  4. Al Jawarneh et al.2019. Container orchestration engines: A thorough functional and performance comparison. In IEEE ICC. 1–6.Google ScholarGoogle Scholar
  5. Animesh Kuity and Sateesh K Peddoju. 2017. Performance evaluation of container-based high performance computing ecosystem using OpenPOWER. In International Conference on HiPC. Springer, 290–308.Google ScholarGoogle Scholar
  6. J Langguth et al.2018. Memory bandwidth contention: communication vs computation tradeoffs in supercomputers with multicore architectures. In 24th ICPADS. IEEE, 497–506.Google ScholarGoogle Scholar
  7. J Lin et al.2008. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. In 14th International Symposium on High Performance Computer Architecture. IEEE, 367–378.Google ScholarGoogle Scholar
  8. V Meyer et al.2020. An interference-aware application classifier based on machine learning to improve scheduling in clouds. In 28th Euromicro International Conference on PDP. IEEE, 80–87.Google ScholarGoogle Scholar
  9. V Meyer et al.2021. ML-driven classification scheme for dynamic interference-aware resource scheduling in cloud infrastructures. Journal of Systems Architecture 116 (2021), 102064.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Chiang RC. 2020. Contention-aware container placement strategy for docker swarm with machine learning based clustering algorithms. Cluster Computing (2020), 1–11.Google ScholarGoogle Scholar
  11. MG Xavier et al.2013. Performance evaluation of container-based virtualization for high performance computing environments. In 21st Euromicro International Conference on PDP. IEEE, 233–240.Google ScholarGoogle Scholar
  12. Di Xu et al.2010. On mitigating memory bandwidth contention through bandwidth-aware scheduling. In Proc. of the 19th international conference on PACT. ACM, 237–248.Google ScholarGoogle Scholar
  13. D Zhao et al.2018. Locality-aware scheduling for containers in cloud computing. IEEE Transactions on cloud computing 8, 2 (2018), 635–646.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. cHPCe: Data Locality and Memory Bandwidth Contention-aware Containerized HPC

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ICDCN '23: Proceedings of the 24th International Conference on Distributed Computing and Networking
          January 2023
          461 pages
          ISBN:9781450397964
          DOI:10.1145/3571306

          Copyright © 2023 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 January 2023

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format