research-article

OmpSs@Zynq all-programmable SoC ecosystem

Authors:

Antonio Filgueras,

Daniel Jimenez-Gonzalez,

Carlos Alvarez,

Xavier Martorell,

Juanjo Noguera,

Kees VissersAuthors Info & Claims

FPGA '14: Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays

Pages 137 - 146

https://doi.org/10.1145/2554688.2554777

Published: 26 February 2014 Publication History

Abstract

OmpSs is an OpenMP-like directive-based programming model that includes heterogeneous execution (MIC, GPU, SMP, etc.) and runtime task dependencies management. Indeed, OmpSs has largely influenced the recently appeared OpenMP 4.0 specification. Zynq All-Programmable SoC combines the features of a SMP and a FPGA and benefits DLP, ILP and TLP parallelisms in order to efficiently exploit the new technology improvements and chip resource capacities. In this paper, we focus on programmability and heterogeneous execution support, presenting a successful combination of the OmpSs programming model and the Zynq All-Programmable SoC platforms.

References

[1]

Altera, Corp. Nios II C2H Compiler User Guide, 2009.

[2]

E. Ayguade and et. al. A proposal to extend the openmp tasking model for heterogeneous architectures. In Proceedings of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme Parallelism, IWOMP '09, pages 154--167. Springer-Verlag, 2009.

Digital Library

[3]

E. Ayguadé and et. al. The Design of OpenMP Tasks. IEEE Trans. Parallel Distrib. Syst., 20(3):404--418, 2009.

Digital Library

[4]

Barcelona Supercomputing Center. Extrae Instrumentation Library, Sept. 2013. http://www.bsc.es/computer-sciences/extrae.

[5]

Barcelona Supercomputing Center. Paraver Visualization Tool, Sept. 2013. http://www.bsc.es/computer-sciences/performance-tools/paraver.

[6]

Barcelona Supercomputing Center. Programming Models @ BSC, Sept. 2013. http://pm.bsc.es/mcxx.

[7]

I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: stream computing on graphics hardware. In SIGGRAPH '04: ACM SIGGRAPH 2004 Papers, pages 777--786, New York, NY, USA, 2004. ACM Press.

Digital Library

[8]

A. Canis, J. Choi, M. Aldham, V. Zhang, A. Kammoona, J. H. Anderson, S. Brown, and T. Czajkowski. Legup: High-level synthesis for fpga-based processor/accelerator systems. In Proceedings of the 19th ACM/SIGDA International Symposium on Field Programmable Gate Arrays, FPGA '11, pages 33--36, New York, NY, USA, 2011. ACM.

Digital Library

[9]

R. H. Dennard, F. H. Gaensslen, H. Yu, V. L. Rideout, E. Bassous, and A. R. LeBlanc. Design of ion-implanted MOSFET's with very small physical dimensions. IEEE Journal of Solid-State Circuits, 9:256--268, Oct. 1974.

[10]

R. Dolbeau, S. Bihan, and F. Bodin. HMPP: A hybrid multi-core parallel programming environment. In First Workshop on General Purpose Processing on Graphics Processing Units, October 2007.

[11]

H. Esmaeilzadeh, E. Blem, R. St. Amant, K. Sankaralingam, and D. Burger. Dark silicon and the end of multicore scaling. In Proceedings of the 38th annual international symposium on Computer architecture, ISCA '11, pages 365--376, New York, NY, USA, 2011. ACM.

Digital Library

[12]

R. Hameed and et. al. Understanding sources of inefficiency in general-purpose chips. In Proceedings of the 37th annual international symposium on Computer architecture, ISCA '10, pages 37--47, New York, NY, USA, 2010. ACM.

Digital Library

[13]

Khronos OpenCL Working Group. The OpenCL Specification. Aaftab Munshi, Ed., 2009.

[14]

W. A. Najjar and J. R. Villarreal. Fpga code accelerators - the compiler perspective. In DAC, page 141, 2013.

Digital Library

[15]

Nvidia. CUDA Compute Unified Device Architecture - Programming Guide, 2007.

[16]

OpenMP Architecture Review Board. OpenMP 3.0 Specification. http://www.openmp.org, May 2008.

[17]

A. Papakonstantinou, D. Chen, W.-M. Hwu, J. Cong, and Y. Liang. Throughput-oriented kernel porting onto fpgas. In Proceedings of the 50th Annual Design Automation Conference, DAC '13, pages 11:1--11:10, New York, NY, USA, 2013. ACM.

Digital Library

[18]

D. C. Pham and et. al. Overview of the architecture, circuit design, and physical implementation of a first-generation cell processor. Solid-State Circuits, IEEE Journal of, 41(1):179--196, 2006.

[19]

The Portland Group. PGI Accelerator Programming Model for Fortran & C.

[20]

G. Venkatesh, J. Sampson, N. Goulding, S. Garcia, V. Bryksin, J. Lugo-Martinez, S. Swanson, and M. B. Taylor. Conservation cores: reducing the energy of mature computations. volume 38, pages 205--218, New York, NY, USA, Mar. 2010. ACM.

Digital Library

[21]

J. R. Villarreal, A. Park, W. A. Najjar, and R. Halstead. Designing modular hardware accelerators in c with roccc 2.0. In R. Sass and R. Tessier, editors, FCCM, pages 127--134. IEEE Computer Society, 2010.

Digital Library

[22]

Xilinx. Zynq-7000 All Programmable SoC, Sept. 2013. http://www.xilinx.com/products/silicon-devices/soc/zynq-7000/.

Cited By

Rosso PPetrica LLisa NPereira MRigo SYviquel HBonato VFrancesquini EAraujo G(2024)Integrating Multi-FPGA Acceleration to OpenMP Distributed ComputingAdvancing OpenMP for Future Accelerators10.1007/978-3-031-72567-8_4(49-63)Online publication date: 16-Sep-2024
https://doi.org/10.1007/978-3-031-72567-8_4
Filgueras AVidal MMateu MJimenez-Gonzalez DAlvarez CMartorell XAyguade ETheodoropoulos DPnevmatikatos DGai PGarzarella SOro DHernando JBettin NPomella AProcaccini MGiorgi R(2021)The AXIOM Project: IoT on Heterogeneous Embedded PlatformsIEEE Design & Test10.1109/MDAT.2019.295233538:5(74-81)Online publication date: Oct-2021
https://doi.org/10.1109/MDAT.2019.2952335
Huthmann JPodobas ASommer LKoch ASano K(2020)Extending High-Level Synthesis with High-Performance Computing Performance Visualization2020 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER49012.2020.00047(371-380)Online publication date: Sep-2020
https://doi.org/10.1109/CLUSTER49012.2020.00047
Show More Cited By

Index Terms

OmpSs@Zynq all-programmable SoC ecosystem

Recommendations

Exploiting Parallelism on GPUs and FPGAs with OmpSs
ANDARE '17: Proceedings of the 1st Workshop on AutotuniNg and aDaptivity AppRoaches for Energy efficient HPC Systems

This paper presents the OmpSs approach to deal with heterogeneous programming on GPU and FPGA accelerators. The OmpSs programming model is based on the Mercurium compiler and the Nanos++ runtime. Applications are annotated with compiler directives ...
Programmable HSA Accelerators for Zynq UltraScale+ MPSoC Systems
Euro-Par 2018: Parallel Processing Workshops
Abstract
Modern algorithms for virtual reality, machine learning or big data find its way into more and more application fields and result in stricter power per watt requirements. This challenges traditional homogeneous computing concepts and drives the ...
Leveraging OmpSs to Exploit Hardware Accelerators
SBAC-PAD '14: Proceedings of the 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing

CUDA and OpenCL are the most widely used programming models to exploit hardware accelerators. Both programming models provide a C-based programming language to write accelerator kernels and a host API used to glue the host and kernel parts. Although ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

FPGA '14: Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays

February 2014

272 pages

ISBN:9781450326711

DOI:10.1145/2554688

General Chair:
Vaughn Betz
University of Toronto, Canada
,
Program Chair:
George A. Constantinides
Imperial College London, UK

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGDA: ACM Special Interest Group on Design Automation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 February 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

FPGA'14

Sponsor:

SIGDA

FPGA'14: The 2014 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

February 26 - 28, 2014

California, Monterey, USA

Acceptance Rates

FPGA '14 Paper Acceptance Rate 30 of 110 submissions, 27%;

Overall Acceptance Rate 125 of 627 submissions, 20%

Upcoming Conference

FPGA '25

Sponsor:
sigda

The 2025 ACM/SIGDA International Symposium on Field Programmable Gate Arrays

February 27 - March 1, 2025

Monterey , CA , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

17
Total Citations
View Citations
443
Total Downloads

Downloads (Last 12 months)7
Downloads (Last 6 weeks)1

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Rosso PPetrica LLisa NPereira MRigo SYviquel HBonato VFrancesquini EAraujo G(2024)Integrating Multi-FPGA Acceleration to OpenMP Distributed ComputingAdvancing OpenMP for Future Accelerators10.1007/978-3-031-72567-8_4(49-63)Online publication date: 16-Sep-2024
https://doi.org/10.1007/978-3-031-72567-8_4
Filgueras AVidal MMateu MJimenez-Gonzalez DAlvarez CMartorell XAyguade ETheodoropoulos DPnevmatikatos DGai PGarzarella SOro DHernando JBettin NPomella AProcaccini MGiorgi R(2021)The AXIOM Project: IoT on Heterogeneous Embedded PlatformsIEEE Design & Test10.1109/MDAT.2019.295233538:5(74-81)Online publication date: Oct-2021
https://doi.org/10.1109/MDAT.2019.2952335
Huthmann JPodobas ASommer LKoch ASano K(2020)Extending High-Level Synthesis with High-Performance Computing Performance Visualization2020 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER49012.2020.00047(371-380)Online publication date: Sep-2020
https://doi.org/10.1109/CLUSTER49012.2020.00047
Huthmann JSommer LPodobas AKoch ASano K(2020)OpenMP Device Offloading to FPGAs Using the Nymble InfrastructureOpenMP: Portable Multi-Level Parallelism on Modern Systems10.1007/978-3-030-58144-2_17(265-279)Online publication date: 1-Sep-2020
https://doi.org/10.1007/978-3-030-58144-2_17
Mayer FKnaust MPhilippsen M(2019)OpenMP on FPGAs—A SurveyOpenMP: Conquering the Full Hardware Spectrum10.1007/978-3-030-28596-8_7(94-108)Online publication date: 9-Aug-2019
https://doi.org/10.1007/978-3-030-28596-8_7
Christodoulis GBroquedis FMuller OSelva MDesprez F(2018)An FPGA target for the StarPU heterogeneous runtime system2018 13th International Symposium on Reconfigurable Communication-centric Systems-on-Chip (ReCoSoC)10.1109/ReCoSoC.2018.8449373(1-8)Online publication date: Jul-2018
https://doi.org/10.1109/ReCoSoC.2018.8449373
Watanabe YLee JBoku TSato M(2018)Trade-Off of Offloading to FPGA in OpenMP Task-Based ProgrammingEvolving OpenMP for Evolving Architectures10.1007/978-3-319-98521-3_7(96-110)Online publication date: 29-Aug-2018
https://doi.org/10.1007/978-3-319-98521-3_7
Zhang SAngepat HChiou D(2017)HGum: Messaging framework for hardware accelerators2017 International Conference on ReConFigurable Computing and FPGAs (ReConFig)10.1109/RECONFIG.2017.8279799(1-8)Online publication date: Dec-2017
https://doi.org/10.1109/RECONFIG.2017.8279799
Wagner MLlort GFilgueras AJiménez-González DServat HTeruel XMercadal EÁlvarez CGiménez JMartorell XAyguadé ELabarta J(2017)Monitoring Heterogeneous Applications with the OpenMP Tools InterfaceTools for High Performance Computing 201610.1007/978-3-319-56702-0_3(41-57)Online publication date: 9-May-2017
https://doi.org/10.1007/978-3-319-56702-0_3
Giorgi RPalermo GFeo JTumeo AFranke H(2016)Exploring dataflow-based thread level parallelism in cyber-physical systemsProceedings of the ACM International Conference on Computing Frontiers10.1145/2903150.2906829(295-300)Online publication date: 16-May-2016
https://dl.acm.org/doi/10.1145/2903150.2906829
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten