Skip to main content
Log in

A general model for long-tailed network traffic approximation

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

The long-tailed distribution characterizes many Internet traffic properties which are often modeled by Lognormal distribution, Weibull or Pareto distribution theoretically. However, it is rather difficult to directly apply these models in traffic analysis and performance evaluation studies due to their complex representations and theoretical properties.

This paper proposes a Hyper-Erlang Model (Mixed Erlang distribution) for such long-tailed network traffic approximation. It fits network traffic with long-tailed characteristic into a mixed Erlang distribution directly to facilitate our further analysis. Compared with the well-known hyperexponential based method, the mixed Erlang model is more accurate in fitting the tail behavior and also computationally efficient. Further investigations on the M/G/1 queueing behavior also prove the efficiency of the Mixed Erlang based approximation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Leland W, Taqqu M, Willinger W et al (1994) On the self-similar nature of ethernet traffic (extended version). IEEE/ACM Transactions on Networking 2(1):1–15

    Article  MATH  Google Scholar 

  2. Paxson V, Floyd S (1995) Wide-area traffic: The failure of poission modeling. IEEE/ACM Transactions on Networking 3(3):226–244

    Article  Google Scholar 

  3. Crovella ME, Bestavros A (1997) Self-similarity in world wide web traffic: Evidence and possible causes. IEEE/ACM Transactions on Networking 5(6):835–846

    Article  Google Scholar 

  4. Kalyanaraman S, Vandalore B, Jain R, et al (1998) Performance of TCP over ABR with long-range dependent VBR background traffic over terrestrial and satellite ATM networks. In Proceedings of the 23rd Annual Conference on Local Computer Networks, (LCN 1998), Lowell, MA, pp. 70–78

  5. Asaka T, Ori K, Yamamoto H (2003) Method of estimating flow duration distribution using active measurements. IEICE Transactions on Communications E86-B(10):3030–3037

    Google Scholar 

  6. Ata S, Murata M, Miyahara H (2000) Analysis of network traffic and its application to design of high-speed routers. IEICE Transactions on Information and Systems, E83-D(5):988–995

    Google Scholar 

  7. Downey AB (2001) Evidence for long-tailed distributions in the internet. In Proceedings of ACM SIGCOMM Internet Measurement Workshop 2001 (IMW 2001), San Diego, CA, USA

  8. Horvath A, Telek M (2000) Approximating heavy tailed behavior with phase type distribution. In Proceedings of the 3rd International Conference on Matrix-Analytic Methods in Stochastic Models, Leuven, Belgium.

  9. Feldmann A, Whitt W (1998) Fitting mixtures of exponentials to long-tailed distributions to analyze network performance models. Performance Evaluation, 31(3–4):245–279

    Article  Google Scholar 

  10. El R, Khayari A, Sadre R, Haverkort BR (2003) Fitting world-wide web request traces with the EM-algorithm. Performance Evaluation 52(2–3):175–191

    Google Scholar 

  11. Riska A, Diev V, Smirni E (2004) An EM-based technique for approximating long-tailed data sets with PH distributions. Performance Evaluation, 55(1–2):147–164

    Article  Google Scholar 

  12. Bilmes JA (1998) A gentle tutorial of the EM algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. Technical Report, TR-97-021, International Computer Science Institue, Berkeley CA, April

  13. Fang Y, Chlamtac I (1999) Teletraffic analysis and mobility modeling of PCS networks. IEEE Transactions on Communications 47(7):1062–1073

    Article  Google Scholar 

  14. Kelly F (2004) Reversibility and stochastic networks. New York, Wiley, http://www.statslab.cam.ac.uk/frank/rsn.html

  15. Klemm A, Lindemann C, Lohmann M (2003) Modeling IP traffic using the batch markovian arrival process. Performance Evaluation, 54:149–173

    Article  Google Scholar 

  16. NASA HTTP Traces (2004) http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html.

  17. Tang Y, Tang X (2000) Queueing theory—Fundamentals and applications. Publishing house of University of Electric Science and Technology of China

  18. Shortle JF, Fischer MJ, Gross D, et al (2003) Using the transform approximation method to analyzed queues with heavy-tailed service. Journal of Probability and Statistical Science 1(1):15–27

    Google Scholar 

  19. Starobinski D, Sidi M (2000) Modeling and analysis of power-tail distributions via classical teletraffic methods. Queueing Systems 36(1–3):243–267

    Article  MATH  MathSciNet  Google Scholar 

  20. Greiner M, Jobmann M, Lipsky L (1999) The importance of power-tail distributions for modeling queueing systems. Operations Research 47(2):313–326

    Article  MATH  Google Scholar 

  21. Wang J, Zhou M, Zhou H (2004) Clock synchronization for internet measurements: A clustering algorithm. Computer Networks 45:731–741

    Article  MATH  Google Scholar 

  22. Markovitch NM, Krieger UR (2000) Nonparametric estimation of long-tailed density functions and its application to the analysis of world wide web traffic. Performance Evaluation 42:205–222

    Article  MATH  Google Scholar 

  23. Bertolotti L, Calzarossa MC (2001) Models of mail server workloads. Performance Evaluation 46:65–76

    Article  MATH  Google Scholar 

  24. Bobbio A, Horvath A, Telek M. (2004). The scale factor: A new degree of freedom in phase-time approximation. Performance Evaluation 56:121–144

    Article  Google Scholar 

  25. Bobbio A, Cumani A (1992) ML estimation of the parameters of a PH distribution in triangular canonical form. Performance Evaluation 33–46

  26. Garetto M, and Towsley D (2003) Modeling simulation and measurements of queuing delay under long-tail internet traffic. In Proceedings of the International Conference on Measurements and Modeling of Computer Systems, (ACM SIGMETRICS 2003), San Diego, CA, USA, pp. 47–57

  27. Hernández-Campos F, Marron JS, Samorodnitsky G. et al (2004) Variable heavy tails in Internet traffic. Performance Evaluation 58:261–284

    Article  Google Scholar 

  28. Markovich, NM (2005) High quantile estimation for heavy-tailed distributions. Performance Evaluation 62:178–192

    Article  Google Scholar 

  29. Xia CH, Liu Z, Squillante MS et al (2005) Web traffic modeling at finer time scales and performance implications. Performance Evaluation 61:181–201

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Junfeng Wang.

Additional information

An abbreviated version of this manuscript has been presented at the 3rd International Symposium on Parallel and Distributed Processing and Applications (ISPA’05), Nanjing, P.R.China, November 2005.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, J., Zhou, H., Zhou, M. et al. A general model for long-tailed network traffic approximation. J Supercomput 38, 155–172 (2006). https://doi.org/10.1007/s11227-006-7944-7

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-006-7944-7

Keywords

Navigation