skip to main content
10.1145/3061639.3062262acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

HyCUBE: A CGRA with Reconfigurable Single-cycle Multi-hop Interconnect

Published:18 June 2017Publication History

ABSTRACT

CGRAs are promising as accelerators due to their improved energy-efficiency compared to FPGAs. Existing CGRAs support reconfigurability for operations, but not communications because of the static neighbor-to-neighbor interconnect, leading to both performance loss and increased complexity of the compiler. In this paper, we introduce HyCUBE, a novel CGRA architecture with a reconfigurable interconnect providing single-cycle communications between distant FUs, resulting in a new formulation of the application mapping problem that leads to the design of an efficient compiler. HyCUBE achieves 1.5X and 3X better performance-per-watt compared to a CGRA with standard NoC and a CGRA with neighbor-to-neighbor connectivity, respectively.

References

  1. Arm cortex-a5. https://goo.gl/pGytB2.Google ScholarGoogle Scholar
  2. Bouwens et al. Architectural exploration of the adres coarse-grained reconfigurable array. In ARC '07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chen et al. Algorithmic optimizations for energy efficient throughput-oriented fft architectures on fpga. In IGCC '14.Google ScholarGoogle Scholar
  4. L. Chen et al. Graph minor approach for application mapping on cgras. TRETS '14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. B. De Sutter et al. Coarse-grained reconfigurable array architectures. In Handbook of signal processing systems. '13.Google ScholarGoogle Scholar
  6. M. R. Guthaus et al. Mibench. In WWC-4 '01.Google ScholarGoogle Scholar
  7. M. Hamzeh et al. Epimap: using epimorphism to map applications on cgras. In DAC '12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Hamzeh et al. Regimap: register-aware application mapping on coarse-grained reconfigurable architectures. In DAC '13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Kim et al. Ulp-srp: Ultra low power samsung reconfigurable processor for biomedical applications. In FPT '12.Google ScholarGoogle Scholar
  10. T. Krishna et al. Breaking the on-chip latency barrier using smart. In HPCA'13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. Lattner et al. Llvm: A compilation framework for lifelong program analysis & transformation. In CGO '04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. B. Mei et al. Adres: An architecture with tightly coupled vliw processor and coarse-grained reconfigurable matrix. In FPL '03.Google ScholarGoogle Scholar
  13. B. Mei et al. Dresc: A retargetable compiler for coarse-grained reconfigurable architectures. In FPT '02.Google ScholarGoogle Scholar
  14. B. Mei et al. Exploiting loop-level parallelism on coarse-grained reconfigurable architectures using modulo scheduling. IEE-Computers and Digital Techniques '03.Google ScholarGoogle Scholar
  15. Park et al. Efficient performance scaling of future cgras for mobile applications. In FPT '12.Google ScholarGoogle Scholar
  16. H. Park et al. Polymorphic pipeline array: a flexible multicore accelerator with virtualized execution for mobile multimedia applications. In MICRO '09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. B. R. Rau. Iterative modulo scheduling: An algorithm for software pipelining loops. In MICRO '94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Thomas et al. Cortexsuite. In IISWC '14.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017
    June 2017
    533 pages
    ISBN:9781450349277
    DOI:10.1145/3061639

    Copyright © 2017 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 18 June 2017

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate1,770of5,499submissions,32%

    Upcoming Conference

    DAC '24
    61st ACM/IEEE Design Automation Conference
    June 23 - 27, 2024
    San Francisco , CA , USA

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader