Run-time parallelization switching for resource optimization on an MPSoC platform

Abbas, Naeem; Ma, Zhe

doi:10.1007/s10617-014-9128-7

Run-time parallelization switching for resource optimization on an MPSoC platform

Published: 19 March 2014

Volume 18, pages 279–293, (2014)
Cite this article

Design Automation for Embedded Systems Aims and scope Submit manuscript

Naeem Abbas¹^nAff2 &
Zhe Ma¹^nAff3

224 Accesses
6 Citations
Explore all metrics

Abstract

The recent development of multimedia applications on mobile terminals raised the need for flexible and scalable computing platforms that are capable of providing considerable (application specific) computational performance within a low cost and a low energy budget. The MPSoC with multi-disciplinary approach, resolving application mapping, platform architecture and runtime management issues, provides such multiple heterogeneous, flexible processing elements. In MPSoC, the run-time manager takes the design time exploration information as an input and selects an active Pareto point based on quality requirement and available platform resources, where a Pareto point corresponds to a particular parallelization possibility of the target application. To use system’s scalability at best and enhance application’s flexibility a step further, the resource management and Pareto point selection decisions need to be adjustable at run-time. This research work experiment run-time Pareto point switching for the MPEG-4 encoder. The work involves design time exploration and then embedding of two parallelization possibilities of the MPEG-4 encoder into one single component and enabling run-time switching between these parallelizations, to give run-time control over adjusting performance-cost criteria and allocation deallocation of hardware resources at run-time. The new system has the capability to encode each video frame with different parallelization. The obtained results offer a number of operating points on the Pareto curve in between the previous ones at sequence encoding level. The run-time manager can improve application performance up to 50 % or can save memory bandwidth up to 15 %, according to quality request.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Can GPU performance increase faster than the code error rate?

Article Open access 18 April 2024

Optical experimental solution for the multiway number partitioning problem and its application to computing power scheduling

Article 03 August 2023

The Egyptian national HPC grid (EN-HPCG): open-source Slurm implementation from cluster to grid approach

Article Open access 17 April 2024

References

Baert R, De Greef E, Brockmeyer E (2008) An automatic scratch pad memory management tool and MPEG-4 encoder case study. In: 45th ACM/IEEE design automation conference (DAC 2008), June 8–13, pp 201–204
Google Scholar
Brockmeyer E (2008) Mapping of MPEG-4 encoder using MH and MPA showing the MPMH potential
De Florio V, Blondia C (2012) Robust and tuneable family of gossiping algorithms. In: 20th euromicro international conference on parallel, distributed and network-based processing (PDP), pp 154–161
Google Scholar
Florio VD, Blondia C (2006) The algorithm of pipelined gossiping. J Syst Archit 52(4):235–256
Article Google Scholar
Gheorghita SV, Palkovic M, Hamers J, Vandecappelle A, Mamagkakis S, Basten T, Eeckhout L, Corporaal H, Catthoor F, Vandeputte F, de Bosschere K (2009) System-scenario-based design of dynamic embedded systems. ACM Trans Des Autom Electron Syst 14(1):3
Article Google Scholar
IMEC Scientific Report (2007) Multi-processor system-on-chip (MPSoC). http://www.imec.be/ScientificReport/SR2007/html/1384126.html
IMEC, MPSoC design. http://www2.imec.be/be_en/services/ip-licensing-service/mpsoc-design.html
Jafri SMAH, Hemani A, Paul K, Plosila J, Tenhunen H (2011) Compact generic intermediate representation (CGIR) to enable late binding in coarse grained reconfigurable architectures. In: Proc. international conference on field programmable technology (FPT), pp 1–6
Google Scholar
Jafri SMAH, Hemani A, Paul K, Plosila J, Tenhunen H (2011) Compression based efficient and agile configuration mechanism for coarse grained reconfigurable architectures. In: Proc. international symposium on parallel and distributed processing workshops (IPDPSW), pp 290–293
Google Scholar
Joven J, Marongiu A, Angiolini F, Benini L, Micheli GD (2013) An integrated, programming model-driven framework for noc-qos support in cluster-based embedded many-cores. Parallel Comput 39(10):549–566
Article Google Scholar
Kritikakou A, Catthoor F, Kelefouras VI, Goutis CE (2013) A systematic approach to classify design-time global scheduling techniques. ACM Comput Surv 45(2):14
Article Google Scholar
Ma Z, Marchal P, Scarpazza D, Yang P, Wong C, Gómez JI, Himpe S, Ykman-Couvreur C, Catthoor F (2007) Systematic methodology for real-time cost-effective mapping of dynamic concurrent task-based systems on heterogeneous platforms. Springer, Berlin
Book Google Scholar
Mariani G, Avasare P, Vanmeerbeeck G, Ykman-Couvreur C, Palermo G, Silvano C, Zaccaria V (2010) An industrial design space exploration framework for supporting run-time resource management on multi-core systems. In: Design, automation & test in Europe conference & exhibition (DATE)
Google Scholar
Mei B, Vernalde S, Verkest D, Man H, Lauwereins R (2003) In: Field programmable logic and application. Springer, Berlin, pp 61–70
Chapter Google Scholar
Mei B, Vernalde S, Verkest D, De Man H, Lauwereins R (2004) Design methodology for a tightly coupled VLIW/reconfigurable matrix architecture: a case study
Mercati P, Bartolini A, Paterna F, Rosing TS, Benini L (2013) Workload and user experience-aware dynamic reliability management in multicore processors. In: The 50th annual design automation conference (DAC’13), Austin, TX, USA, May 29–June 07, 2013. ACM, New York
Google Scholar
Mignolet J-Y, Baert R, Ashby TJ, Avasare P, Jang H-O, Son JC (2009) MPA: parallelizing an application onto a multicore platform made easy. IEEE MICRO 29(3):31–39
Article Google Scholar
Nollet V, Verkestt D (2007) A quick safari through the MPSoC run-time management jungle. In: IEEE/ACM/IFIP workshop on embedded systems for real-time multimedia (ESTIMedia 2007), Oct 4–5, pp 41–46
Chapter Google Scholar
Palkovic M Mapping of data intensive applications on multi-processor platforms
Singh AK, Shafique M, Kumar A, Henkel J (2013) Mapping on multi/many-core systems: survey of current and emerging trends. In: The 50th annual design automation conference (DAC’13), Austin, TX, USA, May 29–June 07, 2013. ACM, New York
Google Scholar
Tajammul MA, Jafri SMAH, Hemani A, Plosila J, Tenhunen H (2013) Private configuration environments for efficient configuration in CGRAs. In: Proc. application specific systems architectures and processors (ASAP)
Google Scholar
Wettin P, Murray J, Pande PP, Shirazi B, Ganguly A (2013) Energy-efficient multicore chip design through cross-layer approach. In: Macii E (ed) DATE. EDA Consortium/ACM, San Jose/New York, pp 725–730
Chapter Google Scholar
Ykman-Couvreur C, Brockmeyer E, Nollet V, Marescaux T, Catthoor F, Corporaal H (2005) Design-time application exploration for MP-SoC customized run-time management. In: 2005 international symposium on system-on-chip. Proceedings, 17 Nov 2005, pp 66–69
Chapter Google Scholar
Ykman-Couvreur C, Nollet V, Marescaux T, Brockmeyer E, Catthoor Fr; Corporaal H (2006) Pareto-based application specification for MP-SoC customized run-time management. In: International conference on embedded computer systems: architectures, modeling and simulation (IC-SAMOS 2006), pp 78–84
Chapter Google Scholar

Download references

Author information

Naeem Abbas
Present address: SSUET, Karachi, Pakistan
Zhe Ma
Present address: BioCartis, Mechelen, Belgium

Authors and Affiliations

IMEC, Kapeldreef 75, 3000, Leuven, Belgium
Naeem Abbas & Zhe Ma

Authors

Naeem Abbas
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Naeem Abbas.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abbas, N., Ma, Z. Run-time parallelization switching for resource optimization on an MPSoC platform. Des Autom Embed Syst 18, 279–293 (2014). https://doi.org/10.1007/s10617-014-9128-7

Download citation

Received: 28 September 2013
Accepted: 08 January 2014
Published: 19 March 2014
Issue Date: September 2014
DOI: https://doi.org/10.1007/s10617-014-9128-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Run-time parallelization switching for resource optimization on an MPSoC platform

Abstract

Access this article

Similar content being viewed by others

Can GPU performance increase faster than the code error rate?

Optical experimental solution for the multiway number partitioning problem and its application to computing power scheduling

The Egyptian national HPC grid (EN-HPCG): open-source Slurm implementation from cluster to grid approach

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Run-time parallelization switching for resource optimization on an MPSoC platform

Abstract

Access this article

Similar content being viewed by others

Can GPU performance increase faster than the code error rate?

Optical experimental solution for the multiway number partitioning problem and its application to computing power scheduling

The Egyptian national HPC grid (EN-HPCG): open-source Slurm implementation from cluster to grid approach

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation