Abstract
Digital signal processors (DSPs), with their powerful computing abilities, are commonly used for multimedia coding/decoding processes. Therefore, the SOC (System on Chip) industry integrates DSP with ARM (Advanced RISC Machine) for input/output processing, saving power, and building up a multi-core platform used in handheld devices. The computing ability of ARM has been substantially improved with some state of the art technique. The industry currently has regarded the integration of ARM and DSP into SOC as two independent cores to enhance the efficiency. Since one algorithm is added to process the distributed computing work of the dual cores, the efficiency must be doubled. The system will assign the work to the core with higher processing efficiency. Instead of the traditional Static task scheduling, this article proposed a new approach called Dynamic task scheduling, providing 29.88 % higher efficiency than that of Static task scheduling. The reason is that the static partition will finally send the heavy load of computing to DSP; therefore, it is not possible to achieve the enhanced efficiency of the multi-core. However, the dynamic task scheduling will consider the actual loading of each core for computing and communicate with each other; furthermore, it can share the work to assist the process of each core. Besides, the Direct Memory Access is integrated with the multi-core platform to reduce the time-consumption resulted from the communication between the dual cores. The experimental result shows that the dynamic partition operated by the heterogeneous dual core system can use 192 MHz pulse to decode the CIF video signal that the decoding speed can reach 30fps and the efficiency is improved 50 % with DMA (Direct Memory Access) technology incorporated.
Similar content being viewed by others
References
Byeong-Doo Choi, Kang-Sun Choi, Sung-Jea Ko, Aldo W Morales (2003) “Efficient Real-Time Implementation of MPEG-4 Audiovisual Decoder Using DSP and RISC Chips,” IEEE International Conference on Consumer Electronics (ICCE)
Casavant TL, Kuhl JG (1988) A taxonomy of scheduling in general-purpose distributed computing systems. IEEE Trans Softw Eng 14(2):141–154
Cheng-Nan Chiu, Chien-Tang Tseng, and Chun-Jen Tsai (2005) “Tightly-coupled MPEG-4 video encoder framework on asymmetric dual-core platforms,” Circuits and Systems. ISCAS 2005. Vol. 3, Pages: 2132–2135
Fengguang Song, Asim Yarkhan and Jack Dongarra (2009) “Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems,”, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Article No. 19
Hakem Beitollahi and Geert Deconinck (2006) “Fault-Tolerant Partitioning Scheduling Algorithms in Real-Time Multiprocessor Systems,” Proceedings of the 12th Pacific Rim International Symposium on Dependable Computing, Pages 296–304
Heirich A, Arvo J (1998) A competitive analysisof load balancing strategies for parallel ray tracing. J Supercomput 12(1–2):57–68
Koziri M, Zacharis D, Katsavounidis I, Bellas N (2011) “Implementation of the AVS Video Decoder on a Heterogeneous Dual-Core SIMD Processor” IEEE Transactions on Consumer Electronics. Page (s): 673–681
Kyu Ha Lee, Keun-Sup Lee, Tae-Hoon Hwang, Young-Cheol Park and Dae Hee Youn (2001) “An Architecture and Implementation of MPEG Audio Layer III Decoder using Dual-Core DSP,” IEEE Transactions on Consumer Electronics, November
Lee JY, Lee J-J, Park SM (2010) “Multi-core platform for an efficient H.264 and VC-1video decoding based on macroblock row-level parallelism”. Circ, Devices Syst, IET 4(2):147–158
T Liu, and C Choudary (2004) “Content-aware streaming of lecture videos over wireless networks,” Proceedings of IEEE Sixth International Symposium on Multimedia Software Engineering, 2004, pp.458–465, 13–15
Daniel Cedermany and Philippas Tsigas (2008) “On dyna, ic load balancing on graphics processors,”, Proceedings of the 23rd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware, pp. 57–64
Sheila Rader, etc. “Mobile Etreme Convergence: A Streamlined Architecture to Deliver Mass-market Converged Mobile Device,” Freescale white paper.
Yu-Hsien Lin Chiaheng Tu Chi-Sheng, Shih Shih-Hao Hung (2009) “Zero-Buffer Inter-Core Process Communication Protocol for Heterogeneous Multi-core Platforms,” IEEE International Conference on Embedded and Real-Time Computing Systems and Applications. Page(s): 69–78
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Huang, YS., Chieu, BC. A video decoding optimization for heterogeneous dual-core platforms architecture. Multimed Tools Appl 75, 627–646 (2016). https://doi.org/10.1007/s11042-014-2312-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-2312-8