Cycle-accurate multi-FPGA platform for accelerated emulation of large on-chip networks

Romanov, Aleksandr Y.; Lerner, Anatoly; Amerikanov, Aleksandr A.

doi:10.1007/s11227-024-06306-3

Cycle-accurate multi-FPGA platform for accelerated emulation of large on-chip networks

Published: 23 June 2024

Volume 80, pages 22462–22478, (2024)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Aleksandr Y. Romanov¹,
Anatoly Lerner¹ &
Aleksandr A. Amerikanov¹

258 Accesses
Explore all metrics

Abstract

On-chip networks (NoCs) have become a popular choice for designing large multiprocessor architectures. Software-based emulation is often used to perform the design verification. However, if the considered design is sufficiently large, software-based emulation becomes impractically slow. To avoid this limitation, multi-FPGA emulation was introduced, where multiple interconnected FPGAs collectively emulate a single circuit. The number of external FPGA pins is often insufficient for emulating large network-on-chip designs accurately. As a result, the overall emulation frequency has to be severely limited. We propose a method for accelerating multi-FPGA emulation by reducing the amount of data FPGAs need to transmit to each other. To achieve cycle-accurate emulation in the absence of constant transmission latency, synchronous messaging is implemented. The proposed method was tested on a functioning prototype. It is shown that the use of our method for multi-FPGA emulation of large NoC designs can reach several orders.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

P-NoC: Performance Evaluation and Design Space Exploration of NoCs for Chip Multiprocessor Architecture Using FPGA

Article 19 June 2020

A Low-Cost Test Solution for Reliable Communication in Networks-on-Chip

Article 15 April 2019

Communication-aware branch and bound with cluster-based latency-constraint mapping technique on network-on-chip

Article 06 May 2016

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Data availability

The software associated with this article is freely available at https://github.com/alerner-6502/NoC_partitioner/

References

Sodani A, Gramunt R, Corbal J, Kim H-S et al (2016) Knights landing: second-generation Intel Xeon phi product. IEEE Micro 36(2):34–46. https://doi.org/10.1109/MM.2016.25
Article Google Scholar
Sepúlveda J, Marangozova-Martin V, Castrillon J (2017) Architecture, languages, compilation and hardware support for emerging manycore systems (alchemy): preface. Proced Comput Sci 108:1071–1072. https://doi.org/10.1016/j.procs.2017.05.276
Article Google Scholar
Abdallah L, Jan M, Ermont J, Fraboul C (2016) Reducing the Contention Experienced by Real-Time Core-to-I/O Flows over a Tilera-Like Network on Chip. In: 28th Euromicro Conference on Real-Time Systems (ECRTS), pp 86–96. https://doi.org/10.1109/ECRTS.2016.9
Somasundaram K (2020) Design of a virtual channel router architecture for low power on mesh-of-grid topology for network on Chip. Applied Soft Computing and Communication Networks: Proceedings of ACN 2019. Springer, Singapore, pp 63–79. https://doi.org/10.1007/978-981-15-3852-0_5
Chapter Google Scholar
Rocki K, Van Essendelft D, Sharapov I, Schreiber R et al (2020) Fast stencil-Code computation on a wafer-scale processor. In: SC20: International Conference for High Performance Computing, Networking, Storage and Analysis, pp 1–14. https://doi.org/10.1109/SC41405.2020.00062
Ditzel D, Espasa R, Aymerich N, Baum A et al (2021) Accelerating ML Recommendation with over a thousand RISC-V/tensor processors on Esperanto’s ET-SoC-1 Chip. In: IEEE Hot Chips 33 Symposium (HCS), pp 1–23. https://doi.org/10.1109/HCS52781.2021.9566904
Wolkotte PT, Holzenspies PKF, Smit GJM (2007) Fast, accurate and detailed NoC simulations. In: First International Symposium on Networks-on-Chip (NOCS'07), pp 323–332. https://doi.org/10.1109/NOCS.2007.18
Jain A, Kumar A, Shukla AP, Alshazly H et al (2022) Smart communication using 2d and 3d mesh network-on-chip. Intell Autom Soft Comput 34(3):2007–2021. https://doi.org/10.32604/iasc.2022.024770
Article Google Scholar
Romanov A, Ivannikov A (2018) SystemC Language Usage as the Alternative to the HDL and High-level Modeling for NoC Simulation. Int. J. Embed Real Time Commun Syst 9(2):18–31. https://doi.org/10.4018/IJERTCS.2018070102
Article Google Scholar
Jallouli K, Mazouzi M, Diguet J-P, Monemi A, Hasnaoui S (2022) MIMO-OFDM LTE system based on a parallel IFFT/FFT on NOC-based FPGA. Ann Telecommun 77(9–10):689–702. https://doi.org/10.1007/s12243-021-00901-8
Article Google Scholar
Kouadri-Mostefaoui A-M, Senouci B, Petrot F (2007) Scalable multi-FPGA platform for networks-on-chip emulation. In: IEEE International Conf on Application-specific Systems, Architectures and Processors (ASAP), pp 54–60. https://doi.org/10.1109/ASAP.2007.4429958
Dorai A, Sentieys O, Dubois H (2017) Evaluation of NoC on multi-FPGA interconnection using GTX transceiver. In: 24th IEEE International Conference on Electronics, Circuits and Systems (ICECS), pp 170–173. https://doi.org/10.1109/ICECS.2017.8292007
Yangfan L, Peng L, Yingtao J, Mei Y et al (2010) Building a multi-FPGA-based emulation framework to support networks-on-chip design and verification. Int J Electron 97:1241–1262. https://doi.org/10.1080/00207217.2010.512017
Article Google Scholar
Umer F, Bander A (2020) Exploring and optimizing partitioning of large designs for multi-FPGA based prototyping platforms. Computing 102:2361–2383. https://doi.org/10.1007/s00607-020-00834-5
Article Google Scholar
Tang Q, Mehrez H, Tuna M (2012) Design for prototyping of a parameterizable cluster-based multi-core system-on-chip on a multi-FPGA board. In: 23rd IEEE International Symposium on Rapid System Prototyping (RSP), pp 71–77. https://doi.org/10.1109/RSP.2012.6380693
Dreschmann M, Heisswolf J, Geiger M, Becker J, HauBecker M (2015) A framework for multi-FPGA interconnection using multi gigabit transceivers. In: 28th Symposium on Integrated Circuits and Systems Design (SBCCI), pp 1–6. https://doi.org/10.1145/2800986.2800993
Kouadri-Mostefaoui, Abdellah-Medjadji, Senouci B, Petrot F (2008) Large scale on-chip networks: an accurate multi-FPGA emulation platform. In: 11th EUROMICRO Conference on Digital System Design Architectures, Methods and Tools, pp 3–9.https://doi.org/10.1109/DSD.2008.130
Junyan T, Virginie F, Frederic R (2012) From mono-FPGA to multi FPGA emulation platform for NoC performance evaluations. Adv Parallel Comput 22:603–610. https://doi.org/10.3233/978-1-61499-041-3-603
Article Google Scholar
Farooq U, Chotin-Avot R, Azeem M, Ravoson M et al (2016) Inter-FPGA routing environment for performance exploration of multi-FPGA systems. In: International Symposium on Rapid System Prototyping (RSP), pp 1–7. https://doi.org/10.1145/2990299.2990317
Azeem MM, Chotin-Avot R, Farooq U, Ravoson M, Mehrez H (2016) Multiple FPGAs based prototyping and debugging with complete design flow. In: 2016 11th International Design & Test Symposium (IDT), pp 171–176. https://doi.org/10.1109/IDT.2016.7843035
Junyan T, Virginie F, Frederic R (2016) Rapid prototyping of Networks-on-Chip on multi-FPGA platforms. In: MATEC Web of Conferences, pp 12002. https://doi.org/10.1051/matecconf/20165412002
Tang Q (2015) Methodology of multi-FPGA prototyping platform generation. Dissertation, Pierre and Marie Curie Univ
Lezhnev EV, Zunin VV, Amerikanov AA, Romanov AY (2024) Electronic computer-aided design for low-level modeling of networks-on-chip. IEEE Access 12:48750–48763. https://doi.org/10.1109/ACCESS.2024.3382710
Article Google Scholar
Li Y, Zhou P (2023) Fast and accurate noc latency estimation for application-specific traffics via machine learning. IEEE Trans Circuits Syst II Express Briefs 70:3569–3573. https://doi.org/10.1109/TCSII.2023.3258700
Article Google Scholar
Mandal SK, Krishnakumar A, Ayoub R, et al (2020) Performance analysis of priority-aware NoCs with deflection routing under traffic congestion. In: Proceedings of the 39th International Conference on Computer-Aided Design. ACM, New York, NY, USA, pp 1–9. https://doi.org/10.1145/3400302.3415654
Pereira RVM, Seman LO, Berejuck MD et al (2021) MPI hardware framework for many-core based embedded systems. Int J Sens Networks 35:42. https://doi.org/10.1504/IJSNET.2021.112888
Article Google Scholar
Kumar AS, Rao TVKH (2022) Performance assessment of adaptive core mapping for NoC-based architectures. Int J Embed Syst 15:395. https://doi.org/10.1504/IJES.2022.127149
Article Google Scholar
Kumar AS, Rao TVKH (2020) Scalable benchmark synthesis for performance evaluation of NoC core mapping. Microprocess Microsyst 79:103272. https://doi.org/10.1016/j.micpro.2020.103272
Article Google Scholar
Montanana JM, De Andres D, Tirado F (2013) Fault tolerance on NoCs. In: 27th International Conference on Advanced Information Networking and Applications Workshops. pp 138–143. https://doi.org/10.1109/WAINA.2013.221
Benmessaoud Gabis A, Koudil M (2016) NoC routing protocols – objective-based classification. J Syst Archit 66–67:14–32. https://doi.org/10.1016/j.sysarc.2016.04.011
Article Google Scholar
Trik M, Pour Mozaffari S, Bidgoli AM (2021) Providing an adaptive routing along with a hybrid selection strategy to increase efficiency in NoC-based neuromorphic systems. Comput Intell Neurosci 2021:1–8. https://doi.org/10.1155/2021/8338903
Article Google Scholar
Abd El ghany MA, El-Moursy MA, Ismail M (2009) High throughput architecture for high performance NoC. In: 2009 IEEE International Symposium on Circuits and Systems. IEEE, pp 2241–2244. https://doi.org/10.1109/ISCAS.2009.5118244

Download references

Funding

This article is an output of a research project implemented as part of the Basic Research Program at the National Research University Higher School of Economics (HSE University).

Author information

Authors and Affiliations

HSE University, 34 Tallinskaya Ulitsa, Moscow, Russian Federation, 123458
Aleksandr Y. Romanov, Anatoly Lerner & Aleksandr A. Amerikanov

Authors

Aleksandr Y. Romanov
View author publications
You can also search for this author inPubMed Google Scholar
Anatoly Lerner
View author publications
You can also search for this author inPubMed Google Scholar
Aleksandr A. Amerikanov
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Conceptualization was done by A.Y.R., A.L., and A.A.A.; methodology was done by A.Y.R.; validation was done by A.Y.R. and A.L.; writing—original draft preparation was done by A.Y.R. and A.L.; writing—review and editing was done by A.A.A. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Aleksandr Y. Romanov.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Romanov, A.Y., Lerner, A. & Amerikanov, A.A. Cycle-accurate multi-FPGA platform for accelerated emulation of large on-chip networks. J Supercomput 80, 22462–22478 (2024). https://doi.org/10.1007/s11227-024-06306-3

Download citation

Accepted: 13 June 2024
Published: 23 June 2024
Issue Date: October 2024
DOI: https://doi.org/10.1007/s11227-024-06306-3

Keywords

Part of a collection:

Section - Advanced Embedded Systems

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cycle-accurate multi-FPGA platform for accelerated emulation of large on-chip networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

P-NoC: Performance Evaluation and Design Space Exploration of NoCs for Chip Multiprocessor Architecture Using FPGA

A Low-Cost Test Solution for Reliable Communication in Networks-on-Chip

Communication-aware branch and bound with cluster-based latency-constraint mapping technique on network-on-chip

Explore related subjects

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now