Abstract
In this paper we discuss environments for the full-system simulation of multicomputers. These environments are composed of a large collection of modules that simulate the compute nodes and the network, plus additional linking elements that perform communication and synchronization. We present our own environment, in which we integrate Simics with INSEE. We reuse as many Simics modules as possible to reduce the effort of hardware modeling, and also to simulate standard machines running unmodified operating systems. This way we avoid the error-prone effort of developing drivers and libraries. The environment we propose in this paper enables us to show some of the difficulties we found when integrating diverse tools, and how we were able to overcome them. Furthermore we show some important details to have into account in order to do a valid full-system simulation of multicomputers, mostly related with synchronization and timing. Thus, a trade-off has to be found between simulation speed and accuracy of results.
Similar content being viewed by others
References
Adiga, N.R., et al.: Blue Gene/L torus interconnection network. IBM J. Res. Dev. 49(2/3) (2005)
Binkert, N.L., Hallnor, E.G., Reinhardt, S.K.: Network-oriented full-system simulation using M5. In: Sixth Workshop on Computer Architecture Evaluation using Commercial Workloads (CAECW), Feb. 2003
Binkert, N.L., Dreslinski, R.G., Hsu, L.R., Lim, K.T., Saidi, A.G., Reinhardt, S.K.: The M5 simulator: modeling networked systems. IEEE Micro 26(4), 52–60 (2006)
Boden, N.J., Cohen, D., Felderman, R.E., Kulawik, A.E., Seitz, C.L., : Myrinet. A gigabit per second local area network. IEEE Micro 15(1), 29–36 (1995)
García, P.J., Quiles, F.J., Flich, J., Duato, J., Jhonson, I., Naven, F.: Efficient, scalable congestion management for interconnection networks. IEEE Micro 26(5), 52–66 (2006)
IBM: IBM full-system simulator for the cell broadband engine processor. Available at http://alphaworks.ibm.com/tech/cellsystemsim (May 2008)
Jacobson, V.: Congestion avoidance and control. Comput. Commun. Rev. 18(4), 314–329 (1988)
Jain, R.: Congestion control in computer networks: issues and trends. IEEE Netw. 4(3), 24–30 (1990)
Karlin, S., Peterson, L.: Maximum packet rates for full-duplex ethernet. Technical Report TR–645–02, Princeton University (February 2002)
LA-MPI Home Page: The Los Alamos message passing interface. Available at http://public.lanl.gov/lampi/ (May 2008)
LAM/MPI Home Page: LAM/MPI parallel computing. Available at http://www.lam-mpi.org/ (Apr. 2008)
Liu, J., Wu, J., Panda, D.K.: High performance RDMA-based MPI implementation over infiniBand, Int. J. Parallel Program. (2004)
Magnusson, P.S., Christensson, M., Eskilson, J., Forsgren, D., Hallberg, G., Hogberg, J., Larsson, F., Moestedt, A., Werner, B.: Simics: a full system simulation platform. IEEE Comput. 35(2), 50–58 (2002)
Martin, M.M.K., : Multifacet’s general execution-driven multiprocessor simulator (GEMS) toolset. Sigarch Comput. Archit. News 33(4), 92–99 (2005)
Mauer, C.J., Hill, M.D., Wood, D.A.: Full-system timing-first simulation. In: ACM SIGMETRICS, June 2002
Miguel-Alonso, J., Izu, C., Gregorio, J.A.: Improving the performance of large interconnection networks using congestion-control mechanisms. Perform. Eval. 65, 203–211 (2008)
Miguel-Alonso, J., Navaridas, J., Ridruejo, F.J.: Interconnection network simulation using traces of MPI applications. Int. J. Parallel. Program. (to appear). DOI 10.1007/s10766-008-0089-y
MPI Forum: MPICH home page. Available at http://www-unix.mcs.anl.gov/mpi/mpich/ (May 2008)
Myricom Documentation and Software Downloads. Available at http://www.myri.com/scs/ (May 2008)
NASA Advanced Supercomputing (NAS) division: NAS Parallel Benchmarks. Available at http://www.nas.nasa.gov/Resources/Software/npb.html (May 2008)
Navaridas, J., Ridruejo, F.J., Miguel-Alonso, J.: Evaluation of interconnection networks using full-system simulators: lessons learned. In: Proc. 40th Annual Simulation Symposium, Norfolk, VA, March 26–28, 2007
Navaridas, J., Miguel-Alonso, J., Ridruejo, F.J.: On synthesizing workloads emulating MPI applications. In: The 9th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC-08). April 14–18, 2008, Miami, Florida, USA
OPNET Technologies, Inc. corporate web page, available at http://www.opnet.com (May 2008)
Pai, V.S., Ranganathan, P., Adve, S.V.: RSIM: an execution-driven simulator for ILP-based shared-memory multiprocessors and uniprocessors. In: IEEE TCCA New., Oct. 1997
Pfister, G.F.: Aspects of the InfiniBand(tm) architecture. In: Third IEEE International Conference on Cluster Computing (CLUSTER’01), October 2001, pp. 369
Puente, V., Izu, C., Gregorio, J.A., Beivide, R., Vallejo, F.: The adaptive bubble router. J. Parallel Distrib. Comput. 61(9), 1180–1208 (2001)
Puente, V., Gregorio, J.A., Beivide, R.: SICOSYS: an integrated framework for studying interconnection network in multiprocessor systems. In: Proceedings of the IEEE 10th Euromicro Workshop on Parallel and Distributed Processing, Gran Canaria, Spain (2002)
Puente, V., Gregorio, J.A., Vallejo, F., Beivide, R.: Immunet: a cheap and robust fault-tolerant packet routing mechanism. In: International Symposium on Computer Architecture (ISCA), June 2004, pp. 198–211
Ridruejo, F.J., Miguel-Alonso, J.: INSEE: an interconnection network simulation and evaluation environment. In: Proc. Euro-Par 2005. Lecture Notes in Computer Science, vol. 3648, pp. 1014–1023
Ridruejo, F.J., Gonzalez, A., Miguel-Alonso, J.: TrGen: a traffic generation system for interconnection network simulators. In: International Conference on Parallel Processing, 2005. 1st. Int. Workshop on Performance Evaluation of Networks for Parallel, Cluster and Grid Computing Systems (PEN-PCGCS’05). ICPP 2005 Workshops, 14–17 June 2005, pp. 547–553
Rosenblum, M., : Complete computer system simulation: the SimOS approach. IEEE Parallel Distrib. Tech. 3(4), 34–43 (1995)
Schaelicke, L., Parker, M.: ML-RSIM reference manual. Tech. Report 02-10, Department of Computer Science and Engineering, Univ. of Notre Dame, Notre Dame, ID (2002)
SMART group at the U. of Southern California. FlexSim 1.2. Available at http://ceng.usc.edu/smart/FlexSim/flexsim.html (May 2008)
The Chaotic Routing Project at the U. of Washington. Chaos Router Simulator. Available at http://www.cs.washington.edu/research/projects/lis/chaos/www/chaos.html (May 2008)
The Network Simulator ns-2. Available at http://www.isi.edu/nsnam/ns/ (May 2008)
Thottethodi, M., Lebeck, A.R., Mukherjee, S.S.: Exploiting global knowledge to achieve self-tuned congestion control for k-ary n-cube networks. IEEE Trans. Parallel Distrib. Syst. 15(3), 257–272 (2004)
Wenisch, T.F., Wunderlich, R.E., Ferdman, M., Ailamaki, A., Falsafi, B., Hoe, J.C.: SimFlex: statistical sampling of computer system simulation. IEEE Micro 26(4), 18–31 (2006)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ridruejo, F.J., Miguel-Alonso, J. & Navaridas, J. Full-system simulation of distributed memory multicomputers. Cluster Comput 12, 309–322 (2009). https://doi.org/10.1007/s10586-009-0086-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-009-0086-y