Abstract
The choice of the on-chip communication topology in many systems is of vital importance because it affects the entire inter-component data traffic and impacts significantly the overall system performance and cost. On the other hand, there is today a very large spectrum of on-chip interconnect topologies that potentially meet given communication requirements, determining various trade-offs between cost and performance. This work proposes an automated methodology to search the interconnect design space, avoiding a manual and time consuming try-and-error process. The methodology turns the description of the application communication requirements into an on-chip synthesizable interconnection structure satisfying given area constraints. Targeted at FPGA technologies, the approach combines crossbars and shared buses, connected through bridges, yielding a scalable, efficient structure. The resulting architecture improves the level of communication parallelism that can be exploited, while keeping area requirements low. The paper thoroughly describes the formalisms and the methodology used to derive such optimized heterogeneous topologies. It also discusses some case-studies emphasizing the impact of the proposed approach and highlighting the essential differences with a few other solutions presented in the technical literature.
Similar content being viewed by others
Notes
A beat is an individual data transfer within an AXI burst.
According to the component documentation, arbitration latencies typically do not impact data throughput when transactions average at least three data beats.
References
Ogras UY, Hu J, Marculescu R. (2005) Key research problems in NoC design: a holistic perspective. In: Proceedings of the 3rd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, ACM, pp 69–74
Pande PP, Grecu C, Jones M, Ivanov A, Saleh R (2005) Performance evaluation and design trade-offs for network-on-chip interconnect architectures. Comput IEEE Trans 54(8):1025–1040
Gerstlauer A, Haubelt C, Pimentel AD, Stefanov TP, Gajski DD, Teich J (2009) Electronic system-level synthesis methodologies. IEEE Trans Comput-Aided Des Integr Circuit Syst 28(10):1517–1530
Cilardo A, Gallo L, Mazzocca N, Mazzeo A. (2013) Efficient and scalable OpenMP-based system-level design. In: Design, Automation and Test in Europe Conference DATE’13
Cilardo A, Gallo L, Mazzocca N (2013) Design space exploration for high-level synthesis of multi-threaded applications. J Syst Archit 59(10):1171–1183
Cilardo A, Durante P, Lofiego C, Mazzeo A (2010) Early prediction of hardware complexity in HLL-to-HDL translation. In: Proceedings of the international conference on field programmable logic and applications (FPL), pp 483–488
Kim S, Ha S (2006) Efficient exploration of bus-based system-on-chip architectures. In: IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol 14(7), pp 681–692
Lahiri K, Raghunathan A, Dey S (2004) Design space exploration for optimizing on-chip communication architectures. IEEE Trans Comput-Aided Des Integr Circuits Syst 23(6):952–961
Dally WJ, Towles B. (2001) Route packets, not wires: On-chip interconnection networks. In: Proceedings of the IEE design automation conference, 2001, pp 684–689
Atienza D, Angiolini F, Murali S, Pullini A, Benini L, De Micheli G (2008) Network-on-chip design and synthesis outlook. INTEGRA VLSI J 41(3):340–359
Benini L, De Micheli G (2002) Networks on chips: a new SoC paradigm. Computer 35(1):70–78
Bertozzi D, Jalabert A, Murali S, Tamhankar R, Stergiou S, Benini L, De Micheli G (2005) NoC synthesis flow for customized domain specific multiprocessor systems-on-chip. IEEE Trans Parallel Distrib Syst 16(2):113–129
Murali S, Meloni P, Angiolini F, Atienza D, Carta S, Benini L, De Micheli G, Raffo L (2006) Designing application-specific networks on chips with floorplan information. In: Proceedings of the 2006 IEEE/ACM international conference on computer-aided design, ACM, pp 355–362
Reza Kakoee M, Angiolin F, Murali S, Pullini A, Seiculescu C, Benini L (2009) A floorplan-aware interactive tool flow for NoC design and synthesis. In: IEEE international SOC conference, SOCC, IEEE, 2009, pp 379–382
Todorov V, Mueller-Gritschneder D, Reinig H, Schlichtmann U (2013) A spectral clustering approach to application-specific network-on-chip synthesis. In: Design, automation & test in Europe conference & exhibition (DATE), IEEE, 2013, pp 1783–1788
Pasricha S, Dutt N, Ben-Romdhane M (2006) Constraint-driven bus matrix synthesis for MPSoC. In: Proceedings of the 2006 Asia and South Pacific design automation conference, IEEE Press, pp 30–35
Murali S, Benini L, De Micheli G (2007) An application-specific design methodology for on-chip crossbar generation. IEEE Trans Comput-Aided Des Integr Circuits Syst 26(7):1283–1296
Yoo J, Lee D, Yoo S, Choi K (2007) Communication architecture synthesis of cascaded bus matrix. In: Design automation conference, ASP-DAC’07, Asia and South Pacific, IEE, 2007, pp 171–177
Jun M, Woo D, Chung EY (2012) Partial connection-aware topology synthesis for on-chip cascaded crossbar network. IEEE Trans Comput 61(1):73–86
Jun M, Yoo S, Chung EY (2008) Mixed integer linear programming-based optimal topology synthesis of cascaded crossbar switches. In: Design automation conference, 2008. ASPDAC 2008. Asia and South Pacific, IEEE, pp 583–588
Jun M, Yoo S, Chung EY (2009) Topology synthesis of cascaded crossbar switches. IEEE Trans Comput-Aided Des Integr Circuits Syst 28(6):926–930
Yoo J, Yoo S, Choi K (2009) Topology/floorplan/pipeline co-design of cascaded crossbar bus. In: IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol 17(8), pp 1034–1047
Lahtinen V, Salminen E, Kuusilinna K, Hamalainen T. (2003) Comparison of synthesized bus and crossbar interconnection architectures. In: Proceedings of the 2003 international symposium on circuits and systems, ISCAS’03, vol. 5, 2003, IEEE pp. V-433
Cilardo A, Fusella E, Gallo L, Mazzeo A. (2013) Automated synthesis of FPGA-based heterogeneous interconnect topologies. In: 23rd international conference on field programmable logic and applications (FPL), IEEE, 2013, pp 1–8
Cilardo A, Fusella E, Gallo L, Mazzeo A (2014) Joint communication scheduling and interconnect synthesis for FPGA-based many-core systems. In: Design, automation & test in Europe conference & exhibition (DATE), IEEE, 2014
Cilardo A, Mazzocca N (2013) Exploiting vulnerabilities in cryptographic hash functions based on reconfigurable hardware. IEEE Trans Inf Forensics Secur 8(5):810–820
Cilardo A, Mazzeo A, Romano L, Saggese G (2005) An FPGA-based key-store for improving the dependability of security services. In: 10th IEEE international workshop on object-oriented real-time dependable systems, 2005. WORDS 2005, pp 389–396
Cilardo A (2013) Fast parallel \({GF}(2^m)\) polynomial multiplication for all degrees. IEEE Trans Comput 62(5):929–943
Devanur NR, Feige U (2011) An \({O} (n \log n)\) algorithm for a load balancing problem on paths. In: Algorithms and data structures, Springer, New York, pp 326–337
Xilinx: Zynq-7000 All Programmable SoC Overview (2012)
ARM: AMBA AXI and ACE Protocol Specification (2011)
Srinivasan K, Chatha KS (2006) A low complexity heuristic for design of custom network-on-chip architectures. In: Proceedings of the conference on Design, automation and test in Europe: Proceedings European Design and Automation Association pp. 130–135
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Cilardo, A., Fusella, E., Gallo, L. et al. Automated design space exploration for FPGA-based heterogeneous interconnects. Des Autom Embed Syst 18, 157–170 (2014). https://doi.org/10.1007/s10617-014-9134-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10617-014-9134-9