ABSTRACT
The Memory Allocation problem for neural networks can be represented as a two-dimensional optimization problem. The neural network is allocated into limited memory space while allocating as much data as possible into the low latency memory. Our solution is based on a generalization of graph coloring, edge-to-node transformation and considers the order in which the graph nodes are colored. We observed improvement of more than 40% in SRAM memory bandwidth in various neural networks.
- A. Krizhevsky, I. Sutskever and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in Neural Information Processing Systems, pp. 1097-1105, 2012.Google ScholarDigital Library
- J. Zhang, S. H. Yeung, Y. Shu, B. He and W. Wang, "Efficient memory management for gpu-based deep learning systems," arXiv:1903.06631, 2019.Google Scholar
- S. V. Pemmaraju, R. Raman, and K. Varadarajan. "Buffer minimization using max-coloring," SODA, vol. 4, 2004.Google Scholar
- Y. Pisarchyk and J. Lee, "Efficient Memory Management for Deep Neural Net Inference," arXiv preprint arXiv:2001.03288, 2020.Google Scholar
- L. Li, H. Feng and J. Xue, "Compiler-directed Scratchpad Memory Management via Graph Coloring," ACM Trans. Archit. Code Optim., vol. 6, no. 3, pp. 1-17, 2009.Google ScholarDigital Library
- H. A. Kierstead, "A polynomial time approximation algorithm for dynamic storage allocation," Discrete Mathematics, vol. 88, pp. 231-237, 1991.Google ScholarDigital Library
Index Terms
- Memory Allocation for Neural Networks using Graph Coloring
Recommendations
List-coloring the square of a subcubic graph
The square G2 of a graph G is the graph with the same vertex set G and with two vertices adjacent if their distance in G is at most 2. Thomassen showed that every planar graph G with maximum degree Δ(G) = 3 satisfies χ(G2) ≤ 7. Kostochka and Woodall ...
Enabling Hybrid PCM Memory System with Inherent Memory Management
RACS '16: Proceedings of the International Conference on Research in Adaptive and Convergent SystemsReplacing the traditional volatile main memory, e.g., DRAM, with a non-volatile phase change memory (PCM) has become a possible solution to reduce the energy consumption of computing systems. To further reduce the bit cost of PCM, the development trend ...
Cut-Colorings in Coloring Graphs
This paper studies the connectivity and biconnectivity of coloring graphs. For $$k\in \mathbb {N}$$k?N, the k-coloring graph of a base graph G has vertex set consisting of the proper k-colorings of G and edge set consisting of the pairs of k-colorings ...
Comments