Abstract
Considerable research and development has been invested in software Distributed Shared Memory (DSM). The primary focus of this work has traditionally been on high performance and consistency protocols. Unfortunately, clusters present a number of challenges for any DSM systems not solvable through consistency protocols alone. These challenges relate to the ability of DSM systems to adjust to load fluctuations, computers being added/removed from the cluster, to deal with faults, and the ability to use DSM objects larger than the available physical memory. This paper introduces the Synergy DSM System and its integration with the virtual memory, group communication and process migration services of the Genesis Cluster Operating System.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Web Site. http://www.top500.org. November, 2002. Last accessed 6th December, 2002.
L. Iftode and J. Singh. Shared Virtual Memory: Progress and Challenges, Shared Virtual Memory: Progress and Challenges. Proc. of the IEEE, Vol 87. No. 3, March 1999.
J. Carter, J. Bennett, and W. Zwaenepoel. Techniques for Reducing Consistency-Related Communication in Distributed Shared-Memory Systems. ACM Transactions on Computer Systems, Vol. 13 No. 3, August 1995.
J. Silcock and A. Goscinski. The RHODOS DSM System. Microprocessor and Microsystems, 22(3–4), 183–196, 1998.
Concurrent Programming with TreadMarks, “ParallelTools”, L.L.C. 1994.
W. Shi and Z. Tang. Intervals to Evaluating Distributed Shared Memory Systems. IEEE TCCA Newsletter, pp 3–10, August 1998.
D. Pnevmatikatos and E. P. Markatos and G. Magklis and S. Ioannidis. On Using Network RAM as a Non-volatile Buffer. Cluster Computing, 2(4), 295–303, 1999.
E.P. Markatos and G. Dramitinos. Implementation of a Reliable Remote Memory Pager. In Proc. of the 1996 Usenix Technical Conference, pp 177–190, January 1996.
Q. Li, J. Jing, and L. Xie. BFXM: A Parallel File System Model Based on the Mechanism of Distributed Shared Memory. ACM Operating Systems Review, 31(4):30–40, October 1997.
I. Zoraja, G. Rackl, and T. Ludwig. Towards Monitoring in Parallel and Distributed Systems. In Proc. of SoftCOM’99, pp 133–141, October 1999.
W. Shi, W. Hu, Z. Tang and M. Eskicioglu. Dynamic Task Migration in Home-based Software DSM Systems. In Proc of the 8th IEEE International Symposium on High Performance Distributed Computing, Redondo Beach, California August 1999.
W.C. Hsieh. Dynamic Computation Migration in Distributed Shared Memory Systems. PhD Thesis, Massachusetts Institute of Technology, Cambridge, MA, September 1995. Available as MIT/LCS/TR-665.
S. Dwarkadas, N. Hardavellas, L. Kontothanassis, R. Nikhil and R. Stets. Cashmere-VLM: Remote Memory Paging for Software Distributed Shared Memory. In Proc. of IPPS’99, April 1999.
C. Morin, R. Lottiaux and A.-M. Kermarrec. A Two-level Checkpoint Algorithm in a Highly-available Parallel Single Level Store System. In Proc. of the workshop on Distributed Shared Memory on Clusters (CCGrid-01), Brisbane (Australia), May 2001.
A. Agbaria and J. Plank. Design, Implementation, and Performance of Checkpointing in NetSolve, In Proc. Int’l Conf. on Dependable Systems and Networks, FTCS-30 and DCCA-8. New York, New York, June 2000.
A. Goscinski, M. Hobbs and J. Silcock. Genesis: An Efficient, Transparent and Easy to Use Cluster-based Operating System. Parallel Computing, 28(4), pp 557–606, 2002.
D. De Paoli and A. Goscinski. The RHODOS Migration Facility. The Journal of Systems and Software, Volume 40:51–65, Elsevier Science Inc., New York, 1998.
J. Rough and A. Goscinski. Exploiting Operating System Services to Efficiently Checkpoint Parallel Applications in GENESIS, Proc. 5th IEEE Inter. Conf. on Algorithms and Architectures for Parallel Processing (ICA3PP 2002), Beijing, October 2002.
J. Rough and A. Goscinski. A Group Communication Facility for Reliable Computing on Clusters, Proc. ISCA International Conference on Parallel and Distributed Computing Systems (PDCS 2001), ISCA, Cary, NC, USA, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hobbs, M., Silcock, J., Goscinski, A. (2003). Synergy: A Comprehensive Software Distributed Shared Memory System. In: Guo, M., Yang, L.T. (eds) Parallel and Distributed Processing and Applications. ISPA 2003. Lecture Notes in Computer Science, vol 2745. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-37619-4_25
Download citation
DOI: https://doi.org/10.1007/3-540-37619-4_25
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40523-8
Online ISBN: 978-3-540-37619-4
eBook Packages: Springer Book Archive