Skip to main content

Performance of the Achilles Router

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2823))

Abstract

The Achilles Router provides low latency, high bandwidth connections between processors, enabling a network of low-cost processors to perform as a high-performance parallel processor. It is also economical, being constructed from low cost Field Programmable Gate Arrays (FPGAs). These programmable devices allow it to be re-programmed for use in a variety of network topologies; they also permit ‘tuning’ the router for optimum performance in different applications. A key factor in its simplicity and performance is the 3-D structure: this allows us to build a full cross-bar switch with a wide, high-bandwidth datapath. The simple cross-bar circuit also has very low latency: we measured latencies of ~ 800ns in the hardware and 2.5μs when software overheads were included. We measured the basic performance of an inter-processor link using Achilles, and then, using a range of benchmarks with different characteristics, showed that Achilles clearly outperforms Fast Ethernet.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ang, B., Chiou, D.: StarT-Voyager. In: Proceedings of the MIT student workshop for scalable computing, Massachusetts, Cambridge (1996)

    Google Scholar 

  2. Boughton, G.: Arctic switch fabric. In: Proceedings of the 1997 Parallel Computing, Routing and Communication Workshop, GA, Atlanta, USA (1997)

    Google Scholar 

  3. Dolphin Interconnect Solutions Inc.: The Dolphin SCI interconnect. Technical report, Dolphin Interconnect Solutions Inc., California, Westlake Village, USA (1996)

    Google Scholar 

  4. Gillet, R., Kaufmann, R.: Using the Memory Channel network. IEEE Micro, 19–25 (1997)

    Google Scholar 

  5. Kluge, J., Bruning, U., Fischer, M., Rzymianowicz, L.: The ATOLL approach for a fast and reliable System Area Network. In: PDPTA 1999, Nevada (1999)

    Google Scholar 

  6. Myricom: Myrinet performance measurements (1997), http://www.myri.com

  7. von Eicken, T., Culler, D., Goldstein, S., Schauser, K.: Active messages: a mechanism for integrated communication and computation. In: 19th International Symposium on Computer Architecture, Gold Coast, Australia (1992)

    Google Scholar 

  8. Peterson, L.L., Davie, B.S.: Computer networks: a systems approach. Morgan Kauffman, San Francisco (2000)

    Google Scholar 

  9. Blumofe, R., Joerg, C., Kuszmaul, B., Leiserson, C., Randall, K., Zhou, Y.: Cilk: an efficient multithreaded runtime system. In: PPoPP 1995, Santa Barbara (1995)

    Google Scholar 

  10. Tham, C.K.: Achilles: A high bandwidth, low latency, low overhead network interconnect for high performance parallel processing using a network of workstations. PhD thesis, The University of Western Australia (2003)

    Google Scholar 

  11. Snir, M.: MPI: The complete reference. MIT Press, Cambridge (1996)

    Google Scholar 

  12. Xilinx Inc.: Xilinx component data sheets (2000), http://www.xilinx.com

  13. Elliott, J., Sachs, M.: Enterprise systems connection (ESCON) architecture. IBM journal of Research and Development 36 (1992)

    Google Scholar 

  14. Dutton, H., Lenhard, P.: High-speed networking technology: an introductory survey, 3rd edn. Prentice Hall, Inc., New Jersey (1995)

    Google Scholar 

  15. Paull, D.: The Need for speed. B.E. (Hons) thesis. Electrical and Electronic Engineering, University of Western Australia (1998)

    Google Scholar 

  16. Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical Recipes: The Art of Scientific Computing. Cambridge University Press, Cambridge (1986)

    Google Scholar 

  17. Young, D.: Iterative solution of large linear systems. Academic Press, New York (1971)

    MATH  Google Scholar 

  18. Hageman, L., Young, D.: Applied Iterative Methods. Academic Press, New York (1981)

    MATH  Google Scholar 

  19. Gregg, R.R., Herbert, D., McCoull, J., Morris, J.: Thetis: A Parallel Processor Leveraging Commercial Technology. In: Proc. Australian Computer Science Conference, Adelaide (1995)

    Google Scholar 

  20. Tham, S., Morris, J., Gregg, R.: Achilles: High bandwidth, low latency, low overhead communication. In: Australasian Computer Architecture Conference, Auckland, New Zealand, pp. 173–184. Springer, Singapore (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tham, S., Morris, J. (2003). Performance of the Achilles Router. In: Omondi, A., Sedukhin, S. (eds) Advances in Computer Systems Architecture. ACSAC 2003. Lecture Notes in Computer Science, vol 2823. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39864-6_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39864-6_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20122-9

  • Online ISBN: 978-3-540-39864-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics