skip to main content
10.1145/3479876.3481591acmconferencesArticle/Chapter ViewAbstractPublication PagesnocsConference Proceedingsconference-collections
short-paper

NEWROMAP: mapping CNNs to NoC-interconnected self-contained data-flow accelerators for edge-AI

Published:08 October 2021Publication History

ABSTRACT

Conventional AI accelerators are limited by von-Neumann bottlenecks for edge workloads. Domain-specific accelerators (often neuromorphic) solve this by applying near/in-memory computing, NoC-interconnected massive-multicore setups, and data-flow computation. This requires an effective mapping of neural networks (i.e, an assignment of network layers to cores) to balance resources/memory, computation, and NoC traffic. Here, we introduce a mapping called Snake for the predominant convolutional neural networks (CNNs). It utilizes the feed-forward nature of CNNs by folding layers to spatially adjacent cores. We achieve a total NoC bandwidth improvement of up to 3.8X for MobileNet and ResNet vs. random mappings. Furthermore, NEWROMAP is proposed that continues to optimize Snake mapping through a meta-heuristic; it also simulates the NoC traffic and can work with TensorFlow models. The communication is further optimized with up to 22.52% latency improvement vs. pure snake mapping shown in simulations.

References

  1. A Hansson et al. 2007. Avoiding message-dependent deadlock in network-based systems on chip. VLSI design (2007).Google ScholarGoogle Scholar
  2. C Marcon et al. 2005. Exploring NoC mapping strategies: an energy and timing aware technique. In DATE. IEEE.Google ScholarGoogle Scholar
  3. CE Graves et al. 2019. Memristor TCAMs Accelerate Regular Expression Matching for Network Intrusion Detection. IEEE TNANO 18 (2019).Google ScholarGoogle Scholar
  4. F Akopyan et al. 2015. Truenorth: Design and tool flow of a 65 mw 1 million neuron programmable neurosynaptic chip. IEEE TCAD 34, 10 (2015).Google ScholarGoogle Scholar
  5. H Kwon et al. 2018. Maeri: Enabling flexible dataflow mapping over dnn accelerators via reconfigurable interconnects. ACM SIGPLAN Notices 53, 2 (2018).Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. JM Joseph et al. 2021. Ratatoskr: An open-source framework for in-depth power, performance and area analysis and optimization in 3D NoCs. ACM TOMACS 32, 1 (2021).Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M Davies et al. 2018. Loihi: A neuromorphic manycore processor with on-chip learning. Micro (2018).Google ScholarGoogle Scholar
  8. MF Reza et al. 2019. Energy-efficient and high-performance NoC architecture and mapping solution for deep neural networks. In NOCS. IEEE.Google ScholarGoogle Scholar
  9. O Moreira et al. 2020. NeuronFlow: A Hybrid Neuromorphic - Dataflow Processor Architecture for AI Workloads. In AICAS. IEEE.Google ScholarGoogle Scholar
  10. S Gupta et al. 2019. NNPIM: A Processing In-Memory Architecture for Neural Network Acceleration. IEEE Trans. Comput. 68, 9 (2019).Google ScholarGoogle Scholar
  11. S Han et al. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and Huffman coding. arXiv:1510.00149 [cs.CV]Google ScholarGoogle Scholar
  12. SY Mirmahaleh et al. 2019. DNN pruning and mapping on NoC-Based communication infrastructure. Microelectronics Journal 94 (2019), 104655.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y Chen, C Petti. 2016. ReRAM technology evolution for storage class memory application. In ESSDERC. IEEE.Google ScholarGoogle Scholar
  14. YH Chen et al. 2016. Eyeriss: A spatial architecture for energy-efficient dataflow for convolutional neural networks. SIGARCH Comp. Archit. News 44, 3 (2016).Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. NEWROMAP: mapping CNNs to NoC-interconnected self-contained data-flow accelerators for edge-AI
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            NOCS '21: Proceedings of the 15th IEEE/ACM International Symposium on Networks-on-Chip
            October 2021
            91 pages
            ISBN:9781450390835
            DOI:10.1145/3479876

            Copyright © 2021 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 8 October 2021

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • short-paper

            Acceptance Rates

            Overall Acceptance Rate14of44submissions,32%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader