Abstract
Packet header traces are widely used in network analysis. Header traces are the aggregate of traffic from many concurrent applications. We present a methodology, based on machine learning, that can break the trace down into clusters of traffic where each cluster has different traffic characteristics. Typical clusters include bulk transfer, single and multiple transactions and interactive traffic, amongst others. The paper includes a description of the methodology, a visualisation of the attribute statistics that aids in recognising cluster types and a discussion of the stability and effectiveness of the methodology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Kolence, K., Kiviat, P.: Software Unit Profiles and Kiviat Figures. ACM SIGMETRICS, Performance Evaluation Review 2(3), 2–12 (1973)
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, Heidelberg (2001)
Smyth, P.: Clustering Using Monte Carlo Cross-Validation. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 126–133. AAAI Press, Menlo Park (1996)
Hartigan, J.: Clustering Algorithms. John Wiley, Chichester (1975)
Dempster, A., Laird, N., Rubin, D.: Maximum Likelihood from Incomplete Data Via the EM Algorithm. Journal of the Royal Statistical Society Series B 30(1), 1–38 (1977)
Claffy, K., Braun, H.-W., Polyzos, G.: Internet traffic flow profiling Applied Network Research, San Diego Supercomputer Center (1994), Available at: http://www.caida.org/outreach/papers/
Mochalski, K., Micheel, J., Donnelly, S.: Packet Delay and Loss at the Auckland Internet Access Path. In: Proceedings of the PAM 2002 Passive and Active Measurement Conference, Fort Collins, Colorado USA (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
McGregor, A., Hall, M., Lorier, P., Brunskill, J. (2004). Flow Clustering Using Machine Learning Techniques. In: Barakat, C., Pratt, I. (eds) Passive and Active Network Measurement. PAM 2004. Lecture Notes in Computer Science, vol 3015. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24668-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-540-24668-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21492-2
Online ISBN: 978-3-540-24668-8
eBook Packages: Springer Book Archive