Abstract
The long-tailed distribution characterizes many Internet traffic properties which are often modeled by Lognormal distribution, Weibull or Pareto distribution theoretically. However, it is rather difficult to directly apply these models in traffic analysis and performance evaluation studies due to their complex representations and theoretical properties.
This paper proposes a Hyper-Erlang Model (Mixed Erlang distribution) for such long-tailed network traffic approximation. It fits network traffic with long-tailed characteristic into a mixed Erlang distribution directly to facilitate our further analysis. Compared with the well-known hyperexponential based method, the mixed Erlang model is more accurate in fitting the tail behavior and also computationally efficient. Further investigations on the M/G/1 queueing behavior also prove the efficiency of the Mixed Erlang based approximation.
Similar content being viewed by others
References
Leland W, Taqqu M, Willinger W et al (1994) On the self-similar nature of ethernet traffic (extended version). IEEE/ACM Transactions on Networking 2(1):1–15
Paxson V, Floyd S (1995) Wide-area traffic: The failure of poission modeling. IEEE/ACM Transactions on Networking 3(3):226–244
Crovella ME, Bestavros A (1997) Self-similarity in world wide web traffic: Evidence and possible causes. IEEE/ACM Transactions on Networking 5(6):835–846
Kalyanaraman S, Vandalore B, Jain R, et al (1998) Performance of TCP over ABR with long-range dependent VBR background traffic over terrestrial and satellite ATM networks. In Proceedings of the 23rd Annual Conference on Local Computer Networks, (LCN 1998), Lowell, MA, pp. 70–78
Asaka T, Ori K, Yamamoto H (2003) Method of estimating flow duration distribution using active measurements. IEICE Transactions on Communications E86-B(10):3030–3037
Ata S, Murata M, Miyahara H (2000) Analysis of network traffic and its application to design of high-speed routers. IEICE Transactions on Information and Systems, E83-D(5):988–995
Downey AB (2001) Evidence for long-tailed distributions in the internet. In Proceedings of ACM SIGCOMM Internet Measurement Workshop 2001 (IMW 2001), San Diego, CA, USA
Horvath A, Telek M (2000) Approximating heavy tailed behavior with phase type distribution. In Proceedings of the 3rd International Conference on Matrix-Analytic Methods in Stochastic Models, Leuven, Belgium.
Feldmann A, Whitt W (1998) Fitting mixtures of exponentials to long-tailed distributions to analyze network performance models. Performance Evaluation, 31(3–4):245–279
El R, Khayari A, Sadre R, Haverkort BR (2003) Fitting world-wide web request traces with the EM-algorithm. Performance Evaluation 52(2–3):175–191
Riska A, Diev V, Smirni E (2004) An EM-based technique for approximating long-tailed data sets with PH distributions. Performance Evaluation, 55(1–2):147–164
Bilmes JA (1998) A gentle tutorial of the EM algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. Technical Report, TR-97-021, International Computer Science Institue, Berkeley CA, April
Fang Y, Chlamtac I (1999) Teletraffic analysis and mobility modeling of PCS networks. IEEE Transactions on Communications 47(7):1062–1073
Kelly F (2004) Reversibility and stochastic networks. New York, Wiley, http://www.statslab.cam.ac.uk/frank/rsn.html
Klemm A, Lindemann C, Lohmann M (2003) Modeling IP traffic using the batch markovian arrival process. Performance Evaluation, 54:149–173
NASA HTTP Traces (2004) http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html.
Tang Y, Tang X (2000) Queueing theory—Fundamentals and applications. Publishing house of University of Electric Science and Technology of China
Shortle JF, Fischer MJ, Gross D, et al (2003) Using the transform approximation method to analyzed queues with heavy-tailed service. Journal of Probability and Statistical Science 1(1):15–27
Starobinski D, Sidi M (2000) Modeling and analysis of power-tail distributions via classical teletraffic methods. Queueing Systems 36(1–3):243–267
Greiner M, Jobmann M, Lipsky L (1999) The importance of power-tail distributions for modeling queueing systems. Operations Research 47(2):313–326
Wang J, Zhou M, Zhou H (2004) Clock synchronization for internet measurements: A clustering algorithm. Computer Networks 45:731–741
Markovitch NM, Krieger UR (2000) Nonparametric estimation of long-tailed density functions and its application to the analysis of world wide web traffic. Performance Evaluation 42:205–222
Bertolotti L, Calzarossa MC (2001) Models of mail server workloads. Performance Evaluation 46:65–76
Bobbio A, Horvath A, Telek M. (2004). The scale factor: A new degree of freedom in phase-time approximation. Performance Evaluation 56:121–144
Bobbio A, Cumani A (1992) ML estimation of the parameters of a PH distribution in triangular canonical form. Performance Evaluation 33–46
Garetto M, and Towsley D (2003) Modeling simulation and measurements of queuing delay under long-tail internet traffic. In Proceedings of the International Conference on Measurements and Modeling of Computer Systems, (ACM SIGMETRICS 2003), San Diego, CA, USA, pp. 47–57
Hernández-Campos F, Marron JS, Samorodnitsky G. et al (2004) Variable heavy tails in Internet traffic. Performance Evaluation 58:261–284
Markovich, NM (2005) High quantile estimation for heavy-tailed distributions. Performance Evaluation 62:178–192
Xia CH, Liu Z, Squillante MS et al (2005) Web traffic modeling at finer time scales and performance implications. Performance Evaluation 61:181–201
Author information
Authors and Affiliations
Corresponding author
Additional information
An abbreviated version of this manuscript has been presented at the 3rd International Symposium on Parallel and Distributed Processing and Applications (ISPA’05), Nanjing, P.R.China, November 2005.
Rights and permissions
About this article
Cite this article
Wang, J., Zhou, H., Zhou, M. et al. A general model for long-tailed network traffic approximation. J Supercomput 38, 155–172 (2006). https://doi.org/10.1007/s11227-006-7944-7
Issue Date:
DOI: https://doi.org/10.1007/s11227-006-7944-7