skip to main content
10.1145/3491003.3498354acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicdcnConference Proceedingsconference-collections
demonstration

Memory Allocation for Neural Networks using Graph Coloring

Published:24 January 2022Publication History

ABSTRACT

The Memory Allocation problem for neural networks can be represented as a two-dimensional optimization problem. The neural network is allocated into limited memory space while allocating as much data as possible into the low latency memory. Our solution is based on a generalization of graph coloring, edge-to-node transformation and considers the order in which the graph nodes are colored. We observed improvement of more than 40% in SRAM memory bandwidth in various neural networks.

References

  1. A. Krizhevsky, I. Sutskever and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in Neural Information Processing Systems, pp. 1097-1105, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Zhang, S. H. Yeung, Y. Shu, B. He and W. Wang, "Efficient memory management for gpu-based deep learning systems," arXiv:1903.06631, 2019.Google ScholarGoogle Scholar
  3. S. V. Pemmaraju, R. Raman, and K. Varadarajan. "Buffer minimization using max-coloring," SODA, vol. 4, 2004.Google ScholarGoogle Scholar
  4. Y. Pisarchyk and J. Lee, "Efficient Memory Management for Deep Neural Net Inference," arXiv preprint arXiv:2001.03288, 2020.Google ScholarGoogle Scholar
  5. L. Li, H. Feng and J. Xue, "Compiler-directed Scratchpad Memory Management via Graph Coloring," ACM Trans. Archit. Code Optim., vol. 6, no. 3, pp. 1-17, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. A. Kierstead, "A polynomial time approximation algorithm for dynamic storage allocation," Discrete Mathematics, vol. 88, pp. 231-237, 1991.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Memory Allocation for Neural Networks using Graph Coloring
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICDCN '22: Proceedings of the 23rd International Conference on Distributed Computing and Networking
        January 2022
        298 pages
        ISBN:9781450395601
        DOI:10.1145/3491003

        Copyright © 2022 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 24 January 2022

        Check for updates

        Qualifiers

        • demonstration
        • Research
        • Refereed limited

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format