skip to main content
article

A model for characterizing the scalability of distributed systems

Published: 01 July 2005 Publication History

Abstract

Scalability is an important issue in the construction of distributed systems. A number of theoretical and experimental studies have been made on scalability of distributed systems. However, they have been either studies on specific technologies or have studied scalability in isolation. The main conjecture of our work is that scalability must be perceived along with the related issues of availability, synchronization and consistency. In this context, we propose a scalability model which characterizes scalability as being dependent on these factors as well as the workload and faultload. The model is generic and can be used to compare scalability of similar systems. We illustrate this by a comparison between NFS and AFS, two well known distributed file systems. The model is also useful in identifying scalability bottlenecks in distributed systems. We have applied the model to optimize Virat, a wide-area shared object space that we have built.

References

[1]
Sanjay Ghemawat, Howard Gobioff, and Leung Leung. The Google file system. In Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles, pages 96--108, Bolton Landing, NY, October 2003. ACM Press.
[2]
Mahadev Satyanarayanan. The Influence of Scale on Distributed File System Design IEEE Transactions on Software Engineering, 18(1):1--8, January 1992.
[3]
Maarten van Steen and Philip Homburg and Andrew S. Tanenbaum. Globe: A Wide-Area Distributed System. IEEE Concurrency, 7(1):70--78, January-March 1999.
[4]
Robbert Van Renesse, Kenneth P Birman and Werner Vogels. Astrolabe: A Robust and Scalable Technology for Distributed System Monitoring, Management, and Data Mining. ACM Transactions on Computer Systems, 21(2):164--206, May 2003.
[5]
Prasad Jogalekar and Murray Woodside. Evaluating the Scalability of Distributed Systems. IEEE Transactions on Parallel and Distributed Systems, 11(6):589--603, June 2000.
[6]
J A Rolia and K C Sevcik. The Method of Layers. IEEE Transactions on Software Engineering, 21(8):689--700, August 1995.
[7]
J E Neilson, C M Woodside, D C Petriu, and S Majumdar. Software Bottlenecking in Client-Server Systems and Rendezvous Networks. IEEE Transactions on Software Engineering, 21(9):776--782, September 1995.
[8]
Douglas Schmidt, Michael Stal, Hans Robert, and Frank Buschmann. Pattern-Oriented Software Architecture: Patterns for Concurrent and Networked Objects. John Wiley & Sons, Inc., 2000.
[9]
Paul Brebner and Jeffrey Gosper. How scalable is J2EE technology? ACM SIGSOFT Software Engineering Notes, 28(3):4--10, May 2003.
[10]
Seth Gibert and Nancy Lynch. Brewer's Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services. ACM SIGACT News, 33(2):51--59, June 2002.
[11]
Haifeng Yu and Amin Vahdat. The Costs and Limits of Availability of Replicated Services. In Proceedings of the ACM Symposium on Operating System Principles (SOSP). Banff, Canada, October 2001.
[12]
Florin Sultan and Thu D. Nguyen and Liviu Iftode. Lazy Garbage Collection of Recovery State for Fault-Tolerant Distributed Shared Memory. IEEE Transactions on Parallel and Distributed Systems, 10(13):1085--1098, October 2002.
[13]
Nicholas Carriero and David Gelenter. Linda in Context. Communications of the ACM, 4(32):444--458, April 1989.
[14]
Michael Stumm and Songnian Zhou. Algorithms Implementing Distributed Shared Memory. IEEE Computer, 23(5):54--64, May 1990.
[15]
A Vijay Srinivas, D Janakiram, and Raghevendra Koti. Virat: An Internet Scale Distributed Shared Object, Event and Service Space. Technical Report IITM-CSE-DOS-2004-03, Distributed & Object Systems Lab, Indian Institute of Technology, Madras, 2004.
[16]
A Vijay Srinivas, Raghavendra Koti, A Uday Kumar, and D Janakiram. Realizing Large Scale Distributed Event Style Interactions. In Proceedings of the European Conference on Object Oriented Programming (ECOOP) Workshop on Communication Abstractions for Distributed Systems. Oslo, Norway, 2004.
[17]
Chi Zhang and Zheng Zhang. Trading Replication Consistency for Performance and Availability: an Adaptive Approach. In Proceedings of the 23rd International Conference on Distributed Computing Systems. Providence, Rhode Island, USA, May 2003.
[18]
Mustaque Ahamad and Rammohan Kordale. Scalable Consistency Protocols for Distributed Services. IEEE Transactions on Parallel and Distributed Systems, 9(10):888--903, September 1999.
[19]
R. Sandberg, D. Goldberg, S. Kleiman, D. Walsh, and B. Lyon. Design and Implementation of the Sun Network Filesystem. In USENIX Association Summer Conference, Portland, USA, 1985.
[20]
Mahadev Satyanarayana. Influence of Scale on Distributed File System Design. IEEE Transactions on Software Engineering, 18(1):1--8, January 1992.
[21]
Mirjana Spasojevic and M. Satyanarayana. An Empirical Study of a Wide-Area Distributed File System. ACM Transactions on Computer Systems, 2(14):200--222, May 1996.

Cited By

View all
  • (2023)Rapid Scaling of a Danish Public Health System Under COVID-1936th Bled eConference – Digital Economy and Society: The Balancing Act for Digital Innovation in Times of Instability: June 25 – 28, 2023, Bled, Slovenia, Conference Proceedings10.18690/um.fov.6.2023.46(723-730)Online publication date: 12-Dec-2023
  • (2016)Using Performance Forecasting to Accelerate ElasticityAdaptive Resource Management and Scheduling for Cloud Computing10.1007/978-3-319-28448-4_2(17-31)Online publication date: 8-Jan-2016
  • (2012)Event Detection within Fine-Grained NetworksInternet-Scale Pattern Recognition10.1201/b12989-17(157-176)Online publication date: 20-Nov-2012
  • Show More Cited By

Index Terms

  1. A model for characterizing the scalability of distributed systems

                  Recommendations

                  Comments

                  Information & Contributors

                  Information

                  Published In

                  cover image ACM SIGOPS Operating Systems Review
                  ACM SIGOPS Operating Systems Review  Volume 39, Issue 3
                  July 2005
                  93 pages
                  ISSN:0163-5980
                  DOI:10.1145/1075395
                  Issue’s Table of Contents

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  Published: 01 July 2005
                  Published in SIGOPS Volume 39, Issue 3

                  Check for updates

                  Qualifiers

                  • Article

                  Contributors

                  Other Metrics

                  Bibliometrics & Citations

                  Bibliometrics

                  Article Metrics

                  • Downloads (Last 12 months)27
                  • Downloads (Last 6 weeks)1
                  Reflects downloads up to 05 Mar 2025

                  Other Metrics

                  Citations

                  Cited By

                  View all
                  • (2023)Rapid Scaling of a Danish Public Health System Under COVID-1936th Bled eConference – Digital Economy and Society: The Balancing Act for Digital Innovation in Times of Instability: June 25 – 28, 2023, Bled, Slovenia, Conference Proceedings10.18690/um.fov.6.2023.46(723-730)Online publication date: 12-Dec-2023
                  • (2016)Using Performance Forecasting to Accelerate ElasticityAdaptive Resource Management and Scheduling for Cloud Computing10.1007/978-3-319-28448-4_2(17-31)Online publication date: 8-Jan-2016
                  • (2012)Event Detection within Fine-Grained NetworksInternet-Scale Pattern Recognition10.1201/b12989-17(157-176)Online publication date: 20-Nov-2012
                  • (2012)Declarative distributed advertisement system for iDTVProceedings of the 14th symposium on Principles and practice of declarative programming10.1145/2370776.2370800(185-194)Online publication date: 19-Sep-2012
                  • (2012)Failover and takeover contingency mechanisms for network partition and node failureProceedings of the eleventh ACM SIGPLAN workshop on Erlang workshop10.1145/2364489.2364498(51-60)Online publication date: 14-Sep-2012
                  • (2011)SaaS performance and scalability evaluation in cloudsProceedings of the Proceedings of 2011 IEEE 6th International Symposium on Service Oriented System Engineering10.1109/SOSE.2011.6139093(61-71)Online publication date: 12-Dec-2011

                  View Options

                  Login options

                  View options

                  PDF

                  View or Download as a PDF file.

                  PDF

                  eReader

                  View online with eReader.

                  eReader

                  Figures

                  Tables

                  Media

                  Share

                  Share

                  Share this Publication link

                  Share on social media