Abstract
The design of Digital Signal Processing systems involving video data has many challenges other than the optimal design of the processing algorithms. The data handling starting from the raw data storage and retrieval for efficient processing to store back for further analysis, has many challenges. The memory data layout, access patterns, and interface design are critical factors that need more investigation in the architectural design of video processing systems in general. With the increase in video post-processing algorithms’ complexity, the memory access pattern is turning more and more irregular. Unless the memory data layout, i.e., data storage pattern across the Dynamic Random Access Memory (DRAM), is tailored to the application, the memory bandwidth and power dissipation will be affected adversely. Memory efficiency in terms of throughput and energy consumption plays a vital role in deciding the overall efficiency of such systems involving a massive volume of data. An exhaustive analysis of different memory data layouts and access patterns for video data received in raster scan order and processed in blocks is conducted in this work. A Field Programmable Gate Array (FPGA) based video stabilization system for marine surveillance using inbuilt memory controller Intellectual Property (IP) is the user application under consideration. The optimal memory data layout and access patterns for this system are proposed here, and we can extend the findings from this study to many other application scenarios.
Similar content being viewed by others
References
Bhati I, Chang M-T, Chishti Z, Shih-Lien L, Jacob B (2016) DRAM refresh mechanisms, penalties, and trade-offs. IEEE Trans Comput 65 (1):108–121
Chandrasekar K, Christian W, Yonghui L, Sven G, Matthias J, Omar N, Benny A, Norbert W, Kees G (2012) DRAMPower: Open-source DRAM power and energy estimation tool. http://www.drampower. info 22
Choi H, Dosun H, Jaesung L, Sungjoo Y (2020) Reducing DRAM refresh power consumption by runtime profiling of retention time and dual-row activation. Microprocessors Microsystems 72:102942
Delaluz V, Kandemir Mt, Vijaykrishnan N, Sivasubramaniam A, Irwin MJ (2001) Hardware and software techniques for controlling dram power modes. IEEE Trans Comput 50(11):1154–1173
(2015) DDR4 Point-to-Point Design GuideMicron Technical Note, TN-40-40
Jeong MK, Yoon DH, Sunwoo D, Sullivan M, Lee I, Erez M (2012) Balancing DRAM locality and parallelism in shared memory CMP systems. In: 2012 IEEE 18th international symposium on high performance computer architecture (HPCA), pp 1–12. IEEE
Jeong MK, Yoon DH, Sunwoo D, Sullivan M, Lee I, Erez M (2012) Balancing DRAM locality and parallelism in shared memory CMP systems. In: 2012 IEEE 18th International Symposium on High Performance Computer Architecture (HPCA), pp 1–12. IEEE
Lee K-B, Lin T-C, Jen C-W (2005) An efficient quality-aware memory controller for multimedia platform SoC. IEEE Trans Circ Syst Video Technol 15 (5):620–633
Marino MD, Kuan-Ching L (2018) RAMON: region-aware memory controller. IEEE Trans Very Large Scale Integr (VLSI) Syst 26(4):697–710
Micron Micron DDR3 Datasheet
Nedbailo YA, Petrov IA (2020) Increasing DDR4 SDRAM throughput in parallel workloads. In: 2020 Moscow workshop on electronic and networking technologies (MWENT), pp 1–4. IEEE
Unnikrishnan S, Sreelekha G (2019) Video stabilization performance enhancement for low-texture videos. J Real-Time Image Process 1–18
(2018) UltraScale Architecture-Based FPGAs Memory Interface Solutions v1.4LogiCore IP Product Guide, PG150
Wang X, Jian W, Xinxuan T, Meng Y, Jinmei L Design and Implementation of DDR3 SDRAM Controller. In: 2018 14th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT), pp 1–3. IEEE
Wilko G, Laurent O, Azeddine B (2021) Video stabilization: overview, challenges and perspectives. Signal Process: Image Commun 90:0923–5965
Youn JM, Cho D (2017) Improving memory system performance for multimedia applications. Multimed Tool Appl 76(4):5951–5963
Zulian EF, Weis C, Wehn N (2020) Access-aware per-bank DRAM refresh for reduced DRAM refresh overhead. In: 2020 IEEE International Symposium on Circuits and Systems (ISCAS), pp 1–5. IEEE
Funding
No funding was received to assist with the preparation of this manuscript. The authors have no competing interests to declare that are relevant to the content of this article.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Funding and Competing interests
No funding was received to assist with the preparation of this manuscript. The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Unnikrishnan, S., G, S. Performance analysis of memory data layout for sub-block data access. Multimed Tools Appl 82, 7229–7245 (2023). https://doi.org/10.1007/s11042-022-13564-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13564-x