Abstract
Many Internet applications employ multi-tier software architectures. The performance of such multi-tier Internet applications is typically measured by the end-to-end response times. Most of the earlier works in modeling the response times of such systems have limited their study to modeling the mean. However, since the user-perceived performance is highly influenced by the variability in response times, the variance of the response times is important as well.
We first develop a simple model for the end-to-end response times for multi-tiered Internet applications. We validate the model by real data from two large-scale applications that are widely deployed on the Internet. Second, we derive exact and approximate expressions for the mean and the variance, respectively, of the end-to-end response times. Extensive numerical validation shows that the approximations match very well with simulations. These observations make the results presented highly useful for capacity planning and performance prediction of large-scale multi-tiered Internet applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vogels, W.: Learning from the Amazon technology platform. ACM Queue 4(4) (2006)
Shneiderman, B.: Response time and display rate in human performance with computers. ACM Comput. Surv. 16(3), 265–285 (1984)
Menasce, D.A.: Web server software architectures. IEEE Internet Computing 7(6), 78–81 (2003)
Doyle, R., Chase, J., Asad, O., Jin, W., Vahdat, A.: Web server software architectures. In: Proc. of USENIX Symp. on Internet Technologies and Systems (2003)
Kamra, A., Misra, V., Nahum, E.: Yaksha: A controller for managing the performance of 3-tiered websites. In: Proceedings of the 12th IWQoS (2004)
Abdelzaher, T.F., Shin, K.G., Bhatti, N.: Performance guarantees for web server end-systems: A control-theoretical approach. IEEE Trans. Parallel Distrib. Syst. 13(1), 80–96 (2002)
Chen, J., Soundararajan, G., Amza, C.: Autonomic provisioning of backend databases in dynamic content web servers. In: Proceedings of the 3rd IEEE International Conference on Autonomic Computing (ICAC 2006), IEEE Computer Society Press, Los Alamitos (2006)
Urgaonkar, B., Pacifici, G., Shenoy, P., Spreitzer, M., Tantawi, A.: An analytical model for multi-tier internet services and its applications. In: Proc. of the ACM SIGMETRICS conference, pp. 291–302. ACM Press, New York (2005)
Villela, D., Pradhan, P., Rubenstein, D.: Provisioning servers in the application tier for e-commerce systems. In: Proceedings of the Twelfth IEEE International Workshop on Quality of Service (IWQoS 2004), Montreal, Canada, IEEE Computer Society Press, Los Alamitos (2004)
Bennani, M.N., Menasce, D.A.: Resource allocation for autonomic data centers using analytic performance models. In: ICAC ’05: Proc. of the Second Int. Conf. on Automatic Computing, Washington, DC, USA, pp. 229–240 (2005)
Urgaonkar, B., Shenoy, P.: Cataclysm: policing extreme overloads in internet applications. In: WWW’05: Proceedings of the 14th international conference on World Wide Web, New York, NY, USA, ACM Press, pp. 740–749. ACM Press, New York (2005)
van der Mei, R., Meeuwissen, H.: Modelling end-to-end quality-of-service for transaction-based services in a multi-domain environment. In: Proceedings IEEE International Conference on Web Services ICWS, Chicago, USA,, IEEE Computer Society Press, Los Alamitos (2006)
Coffman, E., Muntz, R., Trotter, H.: Waiting time distributions for processor-sharing systems. Journal of the ACM 17(1), 123–130 (1970)
Morrison, J.: Response-time distribution for a processor-sharing system. SIAM Journal on Applied Mathematics 45(1), 152–167 (1985)
Ott, T.: The sojourn time distribution in the M/G/1 queue with processor sharing. Journal of Applied Probability 21, 360–378 (1984)
Cooper, R.: Introduction to Queueing Theory. North-Holland, Amsterdam (1981)
Jackson, J.: Networks of waiting lines. Operations Research 5, 518–521 (1957)
Boxma, O., Daduna, H.: Sojourn times in queueing networks. In: Stochastic Analysis of Computer and Communication Systems, pp. 401–450 (1990)
Boxma, O., van der Mei, R., Resing, J., van Wingerden, K.: Sojourn time approximations in a two-node queueing network. In: Proc. of ITC 19, pp. 1121–1133 (2005)
van der Mei, R., Gijsen, B., Engelberts, P., van den Berg, J., van Wingerden, K.: Response times in queueing networks with feedback. Performance Evaluation 64 (2006)
van den Berg, J., Boxma, O.: The M/G/1 queue with processor sharing and its relation to a feedback queue. Queueing Syst. Theory Appl. 9(4), 365–402 (1991)
Smith, W.: TPC-W: Benchmarking an e-commerce solution, http://www.tpc.org/tpcw/tpcw_ex.asp
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bhulai, S., Sivasubramanian, S., van der Mei, R., van Steen, M. (2007). Modeling and Predicting End-to-End Response Times in Multi-tier Internet Applications. In: Mason, L., Drwiega, T., Yan, J. (eds) Managing Traffic Performance in Converged Networks. ITC 2007. Lecture Notes in Computer Science, vol 4516. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72990-7_47
Download citation
DOI: https://doi.org/10.1007/978-3-540-72990-7_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72989-1
Online ISBN: 978-3-540-72990-7
eBook Packages: Computer ScienceComputer Science (R0)