Skip to main content
Log in

A video decoding optimization for heterogeneous dual-core platforms architecture

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Digital signal processors (DSPs), with their powerful computing abilities, are commonly used for multimedia coding/decoding processes. Therefore, the SOC (System on Chip) industry integrates DSP with ARM (Advanced RISC Machine) for input/output processing, saving power, and building up a multi-core platform used in handheld devices. The computing ability of ARM has been substantially improved with some state of the art technique. The industry currently has regarded the integration of ARM and DSP into SOC as two independent cores to enhance the efficiency. Since one algorithm is added to process the distributed computing work of the dual cores, the efficiency must be doubled. The system will assign the work to the core with higher processing efficiency. Instead of the traditional Static task scheduling, this article proposed a new approach called Dynamic task scheduling, providing 29.88 % higher efficiency than that of Static task scheduling. The reason is that the static partition will finally send the heavy load of computing to DSP; therefore, it is not possible to achieve the enhanced efficiency of the multi-core. However, the dynamic task scheduling will consider the actual loading of each core for computing and communicate with each other; furthermore, it can share the work to assist the process of each core. Besides, the Direct Memory Access is integrated with the multi-core platform to reduce the time-consumption resulted from the communication between the dual cores. The experimental result shows that the dynamic partition operated by the heterogeneous dual core system can use 192 MHz pulse to decode the CIF video signal that the decoding speed can reach 30fps and the efficiency is improved 50 % with DMA (Direct Memory Access) technology incorporated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Byeong-Doo Choi, Kang-Sun Choi, Sung-Jea Ko, Aldo W Morales (2003) “Efficient Real-Time Implementation of MPEG-4 Audiovisual Decoder Using DSP and RISC Chips,” IEEE International Conference on Consumer Electronics (ICCE)

  2. Casavant TL, Kuhl JG (1988) A taxonomy of scheduling in general-purpose distributed computing systems. IEEE Trans Softw Eng 14(2):141–154

    Article  Google Scholar 

  3. Cheng-Nan Chiu, Chien-Tang Tseng, and Chun-Jen Tsai (2005) “Tightly-coupled MPEG-4 video encoder framework on asymmetric dual-core platforms,” Circuits and Systems. ISCAS 2005. Vol. 3, Pages: 2132–2135

  4. Fengguang Song, Asim Yarkhan and Jack Dongarra (2009) “Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems,”, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Article No. 19

  5. Hakem Beitollahi and Geert Deconinck (2006) “Fault-Tolerant Partitioning Scheduling Algorithms in Real-Time Multiprocessor Systems,” Proceedings of the 12th Pacific Rim International Symposium on Dependable Computing, Pages 296–304

  6. Heirich A, Arvo J (1998) A competitive analysisof load balancing strategies for parallel ray tracing. J Supercomput 12(1–2):57–68

    Article  Google Scholar 

  7. Koziri M, Zacharis D, Katsavounidis I, Bellas N (2011) “Implementation of the AVS Video Decoder on a Heterogeneous Dual-Core SIMD Processor” IEEE Transactions on Consumer Electronics. Page (s): 673–681

  8. Kyu Ha Lee, Keun-Sup Lee, Tae-Hoon Hwang, Young-Cheol Park and Dae Hee Youn (2001) “An Architecture and Implementation of MPEG Audio Layer III Decoder using Dual-Core DSP,” IEEE Transactions on Consumer Electronics, November

  9. Lee JY, Lee J-J, Park SM (2010) “Multi-core platform for an efficient H.264 and VC-1video decoding based on macroblock row-level parallelism”. Circ, Devices Syst, IET 4(2):147–158

    Article  Google Scholar 

  10. T Liu, and C Choudary (2004) “Content-aware streaming of lecture videos over wireless networks,” Proceedings of IEEE Sixth International Symposium on Multimedia Software Engineering, 2004, pp.458–465, 13–15

  11. Daniel Cedermany and Philippas Tsigas (2008) “On dyna, ic load balancing on graphics processors,”, Proceedings of the 23rd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware, pp. 57–64

  12. Sheila Rader, etc. “Mobile Etreme Convergence: A Streamlined Architecture to Deliver Mass-market Converged Mobile Device,” Freescale white paper.

  13. Yu-Hsien Lin Chiaheng Tu Chi-Sheng, Shih Shih-Hao Hung (2009) “Zero-Buffer Inter-Core Process Communication Protocol for Heterogeneous Multi-core Platforms,” IEEE International Conference on Embedded and Real-Time Computing Systems and Applications. Page(s): 69–78

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yung-Sung Huang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huang, YS., Chieu, BC. A video decoding optimization for heterogeneous dual-core platforms architecture. Multimed Tools Appl 75, 627–646 (2016). https://doi.org/10.1007/s11042-014-2312-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-014-2312-8

Keywords

Navigation