Abstract
This paper proposes a novel replication architecture for stateful application servers that offers an integrated solution for fault-tolerance and load-distribution. Each application server replica is able to execute client requests and at the same time serves as backup for other replicas. We propose an effective load balancing mechanism that is only load-aware if a server is close to become overloaded. Furthermore, we present transparent reconfiguration algorithms that guarantee that each replica has the same number of backups in a dynamic environment where replicas can join or leave at any time. Our evaluation shows that our approach scales and distributes load across all servers even in heterogeneous environments while keeping the overhead for fault-tolerance and load-balancing small.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Narasimhan, P., Moser, L.E., Melliar-Smith, P.M.: Strongly consistent replication and recovery of fault-tolerant CORBA applications. Journal of Computer System Science and Engineering 32(8) (2002)
Barga, R., Lomet, D., Weikum, G.: Recovery guarantees for general multi-tier applications. In: Int. Conf. on Data Engineering, ICDE (2002)
Wu, H., Kemme, B.: Fault-tolerance for stateful application servers in the presence of advanced transactions patterns. In: IEEE Symp. on Reliable Distrib. Systems, SRDS (2005)
Schmidt, D.C., Levine, D.L., Mungee, S.: The design of the TAO real-time object request broker. Computer Communications 21(4) (1998)
Wu, H., Kemme, B., Maverick, V.: Eager replication for stateful J2EE servers. In: Int. Symp. on Distributed Objects and Applications, DOA (2004)
Chockler, G.V., Keidar, I., Vitenberg, R.: Group communication specifications: A comprehensive study. ACM Computing Surveys 33(4) (2001)
Fleury, M., Reverbel, F.: The JBoss extensible server. In: Middleware (2003)
Wu, H., Kemme, B.: A unified framework for load distribution and fault-tolerance of application servers. Technical Report SOCS-TR-2009.1, McGill University (2009)
Amir, Y., Danilov, C., Miskin-Amir, M., Schultz, J., Stanton, J.: The Spread toolkit: Architecture and performance. Technical Report CNDS-2004-1, Johns Hopkins Univ. (2004)
Andreolini, M., Colajanni, M., Morselli, R.: Performance study of dispatching algorithms in multi-tier web architectures. SIGMETRICS Performance Evaluation Review (2002)
Pai, V.S., Aron, M., Banga, G., Svendsen, M., Druschel, P., Zwaenepoel, W., Nahum, E.: Locality-aware request distribution in cluster-based network servers. In: Int. Conf. on Architectural Support for Programming Languages and Operating Systems (1998)
Liu, X., Zhu, X., Padala, P., Wang, Z., Singhal, S.: Optimal multivariate control for differentiated services on a shared hosting platform. In: IEEE Conf. on Decision and Contr. (2005)
Singh, A.V., Moser, L.E., Melliar-Smith, P.M.: Integrating fault tolerance and load balancing in distributed systems based on CORBA. In: Dal Cin, M., Kaâniche, M., Pataricza, A. (eds.) EDCC 2005. LNCS, vol. 3463, pp. 154–166. Springer, Heidelberg (2005)
Othman, O., Schmidt, D.C.: Optimizing distributed system performance via adaptive middleware load balancing. In: ACM SIGPLAN Workshop on Optimization of Middleware and Distributed Systems (2001)
Renesse, R.V., Schneider, F.B.: Chain replication for supporting high throughput and availability. In: Symp. on Operating System Design and Implementation, OSDI (2004)
Satyanarayanan, M., Kistler, J.J., Kumar, P., Okasaki, M.E., Siegel, E.H., Steere, D.C.: Coda: A highly available file system for a distributed workstation environment. IEEE Trans. Computers 39(4), 447–459 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, H., Kemme, B. (2009). A Unified Framework for Load Distribution and Fault-Tolerance of Application Servers. In: Sips, H., Epema, D., Lin, HX. (eds) Euro-Par 2009 Parallel Processing. Euro-Par 2009. Lecture Notes in Computer Science, vol 5704. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03869-3_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-03869-3_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03868-6
Online ISBN: 978-3-642-03869-3
eBook Packages: Computer ScienceComputer Science (R0)