skip to main content
10.1145/2145694.2145706acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
research-article

Saturating the transceiver bandwidth: switch fabric design on FPGAs

Published:22 February 2012Publication History

ABSTRACT

Driven by the demand of communication systems, field programmable gate array (FPGA) devices have significantly enhanced their aggregate transceiver bandwidth, reaching terabits per second for the upcoming generation. This paper asks the question whether a single-chip switch fabric can be built that saturates the available transceiver bandwidth.

In answering this question, we propose a new switch fabric organization, called Grouped Crosspoint Queued switch, that brings significant memory efficiency over the state-of-the-art organizations. This makes it possible to build high bandwidth, high radix switches directly on FPGA that rivals ASIC performance. The proposal was validated at small scale by a 16x16 160Gps switch on the available Virtex-6 device, and simulated at a larger scale of fat-tree switching network with 5Tbps capacity.

References

  1. Booksim interconnection network simulator. http://nocs.stanford.edu/cgi-bin/trac.cgi/wiki/resources/booksim.Google ScholarGoogle Scholar
  2. Xilinx virtex-7 data sheet. http://www.xilinx.com/support/documentation/data_sheets/ds183_Virtex_7_Data_Sheet.pdf.Google ScholarGoogle Scholar
  3. F. Abel, C. Minkenberg, R. P. Luijten, M. Gusat, and I. IIiadis. A four-terabit packet switch supporting long round-trip times. 2002.Google ScholarGoogle Scholar
  4. Actel, Inc. Designing high-speed ATM switch fabrics by using Actel FPGAs. http://www.actel.com/documents/hispeedatm an.pdf, 1996.Google ScholarGoogle Scholar
  5. Altera, Inc. Integrating 100-GbE switching solutions on 28-nms FPGAs. http://www.altera.com/literature/wp/wp-01127-stxv-100gbe-switching.pdf, 2010.Google ScholarGoogle Scholar
  6. N. Binkert, A. Davis, N. P. Jouppi, M. McLaren, N. Muralimanohar, R. Schreiber, and J. H. Ahn. The role of optics in future high radix switch design. In Proceedings of the 38th annual international symposium on Computer architecture, ISCA '11, pages 437--448, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. U. Cummings, D. Daly, R. Collins, and V. Agarwal. Fulcrum's FocalPoint FM4000: A scalable, low-latency 10 gige switch for high-performance data centers. In 17th IEEE Symposium on High Performance Interconnects, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. W. J. Dally and B. Towles. Principles and Practices of Interconnection Networks. Morgan Kaufman, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. E. Denzel, A. P. J. Engbersen, and I. Iliadis. A flexible shared-buffer switch for ATM at Gb/s rates. Computer Networks & IDSN Systems, 27:611--624, January 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. P. J. Engbersen. Prizma switch technology. IBM Journal of Research and Development, March, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Iyer. Load balancing and parallelism for the internet. PhD. Thesis, Standford University, July, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Iyer and N. McKeown. Using constraint sets to achieve delay bounds in CIOQ switches. IEEE Communications Letters, 7(6), June, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  13. Y. Kanizo, D. Hay, and I. Keslassy. The crosspoint-queued switch. In IEEE International Conference on Computer Communications, Rio de Janeiro, Brazil, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  14. M. J. Karol, M. G. Hluchyj, and S. P. Morgan. Input vs. output queuing on a space-division packet switch. IEEE Transactions on Communication, 35(12):1347--1356, 1987.Google ScholarGoogle ScholarCross RefCross Ref
  15. J. Kim, W. J. Dally, B. Towles, and A. K. Gupta. Microarchitecture of a high-radix router. In 32nd Annual International Symposium on Computer Architecture, New York, NY, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. W. Lockwood, N. McKeown, G. Watson, G. Gibb, P. Hartke, J. Naous, R. Raghuraman, and J. Luo. NetFPGA - an open platform for gigabit-rate network switching and routing. In IEEE Microelectronic Systems Education, San Diego, CA, USA, June 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. N. McKeown. Scheduling algorithms for input-queued cell switches. PhD. Thesis, University of California at Berkeley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. N. McKeown, M. Lzzard, A. Mekkittikul, W. Ellersick, and M. Horowitz. The tiny tera: A small high-bandwidth packet switch core. In Proceedings of Hot Interconnects IV.Google ScholarGoogle Scholar
  19. N. McKeown, A. Mekkittikul, V. Anantharam, and J. Walrand. Achieving 100% throughput in an input queue switch. In IEEE International Conference on Computer Communications, San Francisco, CA, USA, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Minkenberg and T. Engbersen. A combined input and output queued packet switched system based on PRIZMA switch-on-a-chip technology. IEEE Communication Magazine, 38:70--77, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. Simos. Design of a 32x32 variable-packet-size buffered crossbar switch chip. MSc. Thesis, University of Crete, July, 2004.Google ScholarGoogle Scholar
  22. Y. Tamir and G. Frazier. High performance multi-queue buffers for VLSI communication switches. In 15th Annual International Symposium on Computer Architecture, HI, USA, June 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Xilinx, Inc. High-speed buffered crossbar switch design using Virtex-EM devices. http://japan.xilinx.com/support/documentation/application_notes/xapp240.pdf, 2000.Google ScholarGoogle Scholar
  24. Xilinx, Inc. Building crosspoint switches with CoolRunner-II CPLDs. http://www.xilinx.com/support/documentation/application_notes/xapp380.pdf, 2002.Google ScholarGoogle Scholar
  25. K. Yoshigoe, K. Christensen, and A. Jacob. The RR/RR CICQ switch: Hardware design for 10-Gbps link speed. In IEEE International Performance, Computing, and Communications Conference, 2003.Google ScholarGoogle Scholar

Index Terms

  1. Saturating the transceiver bandwidth: switch fabric design on FPGAs

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      FPGA '12: Proceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays
      February 2012
      352 pages
      ISBN:9781450311557
      DOI:10.1145/2145694

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 February 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      FPGA '12 Paper Acceptance Rate20of87submissions,23%Overall Acceptance Rate125of627submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader