skip to main content
10.1145/2370816.2370865acmconferencesArticle/Chapter ViewAbstractPublication PagespactConference Proceedingsconference-collections
research-article

Multi2Sim: a simulation framework for CPU-GPU computing

Published:19 September 2012Publication History

ABSTRACT

Accurate simulation is essential for the proper design and evaluation of any computing platform. Upon the current move toward the CPU-GPU heterogeneous computing era, researchers need a simulation framework that can model both kinds of computing devices and their interaction. In this paper, we present Multi2Sim, an open-source, modular, and fully configurable toolset that enables ISA-level simulation of an x86 CPU and an AMD Evergreen GPU. Focusing on a model of the AMD Radeon 5870 GPU, we address program emulation correctness, as well as architectural simulation accuracy, using AMD's OpenCL benchmark suite. Simulation capabilities are demonstrated with a preliminary architectural exploration study, and workload characterization examples. The project source code, benchmark packages, and a detailed user's guide are publicly available at www.multi2sim.org.

References

  1. AMD Accelerated Parallel Processing (APP) Software Development Kit (SDK). http://developer.amd.com/sdks/amdappsdk/.Google ScholarGoogle Scholar
  2. AMD Accelerated Parallel Processing OpenCL Programming Guide (v1.3c).Google ScholarGoogle Scholar
  3. AMD Evergreen Family Instruction Set Arch. (v1.0d). http://developer.amd.com/sdks/amdappsdk/documentation/.Google ScholarGoogle Scholar
  4. AMD Intermediate Language (IL) Spec. (v2.0e). http://developer.amd.com/sdks/amdappsdk/documentation/.Google ScholarGoogle Scholar
  5. Intel Ivy Bridge. http://ark.intel.com/products/codename/29902/Ivy-Bridge.Google ScholarGoogle Scholar
  6. NVIDIA PTX: Parallel Thread Execution ISA. http://developer.nvidia.com/cuda-downloads/.Google ScholarGoogle Scholar
  7. OpenCL: The Open Standard for Parallel Programming of Heterogeneous Systems. www.khronos.org/opencl.Google ScholarGoogle Scholar
  8. The AMD Fusion Family of APUs. http://fusion.amd.com/.Google ScholarGoogle Scholar
  9. The NVIDIA Denver Project. http://blogs.nvidia.com/.Google ScholarGoogle Scholar
  10. A. Bakhoda, G. Yuan, W. Fung, H. Wong, and T. Aamodt. Analyzing CUDA Workloads Using a Detailed GPU Simulator. In Proc. of the Int'l Symposium on Performance Analysis of Systems and Software (ISPASS), Apr. 2009.Google ScholarGoogle ScholarCross RefCross Ref
  11. N. L. Binkert, E. G. Hallnor, and S. K. Reinhardt. Network-Oriented Full-System Simulation Using M5. 6th Workshop on Computer Architecture Evaluation using Commercial Workloads (CAECW), Feb. 2003.Google ScholarGoogle Scholar
  12. S. Collange, M. Daumas, D. Defour, and D. Parello. Barra: A Parallel Functional Simulator for GPGPU. In Proc. of the 18th Int'l Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), Aug. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. G. Diamos, A. Kerr, S. Yalamanchili, and N. Clark. Ocelot: a Dynamic Optimization Framework for Bulk-Synchronous Applications in Heterogeneous Systems. In Proc. of the 19th Int'l Conference on Parallel Architectures and Compilation Techniques, Sept. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. S. M. et. al. Simics: A Full System Simulation Platform. IEEE Computer, 35(2), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. W. W. L. Fung, I. Sham, G. Yuan, and T. M. Aamodt. Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow. In Proc. of the 40th Int'l Symposium on Microarchitecture, Dec. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. B. Jang, D. Schaa, P. Mistry, and D. Kaeli. Exploiting Memory Access Patterns to Improve Memory Performance in Data-Parallel Architectures. IEEE Transactions on Parallel and Distributed Systems, 22(1), Jan. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Houston and M. Mantor. AMD Graphics Core Next. http://developer.amd.com/afds/assets/presentations/2620_final.pdf.Google ScholarGoogle Scholar
  18. G. L. Yuan, A. A. Bakhoda, and T. M. Aamodt. Complexity Effective Memory Access Scheduling for Many-Core Accelerator Architectures. In 42nd Int'l Symposium on Microarchitecture, Dec. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Multi2Sim: a simulation framework for CPU-GPU computing

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      PACT '12: Proceedings of the 21st international conference on Parallel architectures and compilation techniques
      September 2012
      512 pages
      ISBN:9781450311823
      DOI:10.1145/2370816

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 19 September 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate121of471submissions,26%

      Upcoming Conference

      PACT '24
      International Conference on Parallel Architectures and Compilation Techniques
      October 14 - 16, 2024
      Southern California , CA , USA

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader