Skip to main content
Log in

Full-system simulation of distributed memory multicomputers

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

In this paper we discuss environments for the full-system simulation of multicomputers. These environments are composed of a large collection of modules that simulate the compute nodes and the network, plus additional linking elements that perform communication and synchronization. We present our own environment, in which we integrate Simics with INSEE. We reuse as many Simics modules as possible to reduce the effort of hardware modeling, and also to simulate standard machines running unmodified operating systems. This way we avoid the error-prone effort of developing drivers and libraries. The environment we propose in this paper enables us to show some of the difficulties we found when integrating diverse tools, and how we were able to overcome them. Furthermore we show some important details to have into account in order to do a valid full-system simulation of multicomputers, mostly related with synchronization and timing. Thus, a trade-off has to be found between simulation speed and accuracy of results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Adiga, N.R., et al.: Blue Gene/L torus interconnection network. IBM J. Res. Dev. 49(2/3) (2005)

  2. Binkert, N.L., Hallnor, E.G., Reinhardt, S.K.: Network-oriented full-system simulation using M5. In: Sixth Workshop on Computer Architecture Evaluation using Commercial Workloads (CAECW), Feb. 2003

  3. Binkert, N.L., Dreslinski, R.G., Hsu, L.R., Lim, K.T., Saidi, A.G., Reinhardt, S.K.: The M5 simulator: modeling networked systems. IEEE Micro 26(4), 52–60 (2006)

    Article  Google Scholar 

  4. Boden, N.J., Cohen, D., Felderman, R.E., Kulawik, A.E., Seitz, C.L., : Myrinet. A gigabit per second local area network. IEEE Micro 15(1), 29–36 (1995)

    Article  Google Scholar 

  5. García, P.J., Quiles, F.J., Flich, J., Duato, J., Jhonson, I., Naven, F.: Efficient, scalable congestion management for interconnection networks. IEEE Micro 26(5), 52–66 (2006)

    Article  Google Scholar 

  6. IBM: IBM full-system simulator for the cell broadband engine processor. Available at http://alphaworks.ibm.com/tech/cellsystemsim (May 2008)

  7. Jacobson, V.: Congestion avoidance and control. Comput. Commun. Rev. 18(4), 314–329 (1988)

    Article  Google Scholar 

  8. Jain, R.: Congestion control in computer networks: issues and trends. IEEE Netw. 4(3), 24–30 (1990)

    Article  Google Scholar 

  9. Karlin, S., Peterson, L.: Maximum packet rates for full-duplex ethernet. Technical Report TR–645–02, Princeton University (February 2002)

  10. LA-MPI Home Page: The Los Alamos message passing interface. Available at http://public.lanl.gov/lampi/ (May 2008)

  11. LAM/MPI Home Page: LAM/MPI parallel computing. Available at http://www.lam-mpi.org/ (Apr. 2008)

  12. Liu, J., Wu, J., Panda, D.K.: High performance RDMA-based MPI implementation over infiniBand, Int. J. Parallel Program. (2004)

  13. Magnusson, P.S., Christensson, M., Eskilson, J., Forsgren, D., Hallberg, G., Hogberg, J., Larsson, F., Moestedt, A., Werner, B.: Simics: a full system simulation platform. IEEE Comput. 35(2), 50–58 (2002)

    Google Scholar 

  14. Martin, M.M.K., : Multifacet’s general execution-driven multiprocessor simulator (GEMS) toolset. Sigarch Comput. Archit. News 33(4), 92–99 (2005)

    Article  Google Scholar 

  15. Mauer, C.J., Hill, M.D., Wood, D.A.: Full-system timing-first simulation. In: ACM SIGMETRICS, June 2002

  16. Miguel-Alonso, J., Izu, C., Gregorio, J.A.: Improving the performance of large interconnection networks using congestion-control mechanisms. Perform. Eval. 65, 203–211 (2008)

    Article  Google Scholar 

  17. Miguel-Alonso, J., Navaridas, J., Ridruejo, F.J.: Interconnection network simulation using traces of MPI applications. Int. J. Parallel. Program. (to appear). DOI 10.1007/s10766-008-0089-y

  18. MPI Forum: MPICH home page. Available at http://www-unix.mcs.anl.gov/mpi/mpich/ (May 2008)

  19. Myricom Documentation and Software Downloads. Available at http://www.myri.com/scs/ (May 2008)

  20. NASA Advanced Supercomputing (NAS) division: NAS Parallel Benchmarks. Available at http://www.nas.nasa.gov/Resources/Software/npb.html (May 2008)

  21. Navaridas, J., Ridruejo, F.J., Miguel-Alonso, J.: Evaluation of interconnection networks using full-system simulators: lessons learned. In: Proc. 40th Annual Simulation Symposium, Norfolk, VA, March 26–28, 2007

  22. Navaridas, J., Miguel-Alonso, J., Ridruejo, F.J.: On synthesizing workloads emulating MPI applications. In: The 9th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC-08). April 14–18, 2008, Miami, Florida, USA

  23. OPNET Technologies, Inc. corporate web page, available at http://www.opnet.com (May 2008)

  24. Pai, V.S., Ranganathan, P., Adve, S.V.: RSIM: an execution-driven simulator for ILP-based shared-memory multiprocessors and uniprocessors. In: IEEE TCCA New., Oct. 1997

  25. Pfister, G.F.: Aspects of the InfiniBand(tm) architecture. In: Third IEEE International Conference on Cluster Computing (CLUSTER’01), October 2001, pp. 369

  26. Puente, V., Izu, C., Gregorio, J.A., Beivide, R., Vallejo, F.: The adaptive bubble router. J. Parallel Distrib. Comput. 61(9), 1180–1208 (2001)

    Article  MATH  Google Scholar 

  27. Puente, V., Gregorio, J.A., Beivide, R.: SICOSYS: an integrated framework for studying interconnection network in multiprocessor systems. In: Proceedings of the IEEE 10th Euromicro Workshop on Parallel and Distributed Processing, Gran Canaria, Spain (2002)

  28. Puente, V., Gregorio, J.A., Vallejo, F., Beivide, R.: Immunet: a cheap and robust fault-tolerant packet routing mechanism. In: International Symposium on Computer Architecture (ISCA), June 2004, pp. 198–211

  29. Ridruejo, F.J., Miguel-Alonso, J.: INSEE: an interconnection network simulation and evaluation environment. In: Proc. Euro-Par 2005. Lecture Notes in Computer Science, vol. 3648, pp. 1014–1023

  30. Ridruejo, F.J., Gonzalez, A., Miguel-Alonso, J.: TrGen: a traffic generation system for interconnection network simulators. In: International Conference on Parallel Processing, 2005. 1st. Int. Workshop on Performance Evaluation of Networks for Parallel, Cluster and Grid Computing Systems (PEN-PCGCS’05). ICPP 2005 Workshops, 14–17 June 2005, pp. 547–553

  31. Rosenblum, M., : Complete computer system simulation: the SimOS approach. IEEE Parallel Distrib. Tech. 3(4), 34–43 (1995)

    Article  Google Scholar 

  32. Schaelicke, L., Parker, M.: ML-RSIM reference manual. Tech. Report 02-10, Department of Computer Science and Engineering, Univ. of Notre Dame, Notre Dame, ID (2002)

  33. SMART group at the U. of Southern California. FlexSim 1.2. Available at http://ceng.usc.edu/smart/FlexSim/flexsim.html (May 2008)

  34. The Chaotic Routing Project at the U. of Washington. Chaos Router Simulator. Available at http://www.cs.washington.edu/research/projects/lis/chaos/www/chaos.html (May 2008)

  35. The Network Simulator ns-2. Available at http://www.isi.edu/nsnam/ns/ (May 2008)

  36. Thottethodi, M., Lebeck, A.R., Mukherjee, S.S.: Exploiting global knowledge to achieve self-tuned congestion control for k-ary n-cube networks. IEEE Trans. Parallel Distrib. Syst. 15(3), 257–272 (2004)

    Article  Google Scholar 

  37. Wenisch, T.F., Wunderlich, R.E., Ferdman, M., Ailamaki, A., Falsafi, B., Hoe, J.C.: SimFlex: statistical sampling of computer system simulation. IEEE Micro 26(4), 18–31 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jose Miguel-Alonso.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ridruejo, F.J., Miguel-Alonso, J. & Navaridas, J. Full-system simulation of distributed memory multicomputers. Cluster Comput 12, 309–322 (2009). https://doi.org/10.1007/s10586-009-0086-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-009-0086-y

Keywords

Navigation