Abstract
Fault management systems detect performance problems and intermittent failures by periodically examining a metric (such as the utilization of a link),and raising an alarm if the value is above a threshold. Such systems can generate numerous alarms. Various schemes have been proposed for reducing the number of alarms,or filtering out the important ones. The time over threshold detection algorithm reduces the volume of alarms at the source detector. This paper describes an experiment that compares time over threshold against simple threshold crossings. The experiment demonstrates that it reduces the number of alarms raised by a factor of 25 to 1 without any significant reduction in the problems detected.
Chapter PDF
Similar content being viewed by others
References
Bondavalli, A., Chiaradonna, S., Di Giandomenico, F., Grandoni, F.,Threshold-Based Mechanisms to Discriminate Transient from Intermittent Faults,IEEE Trans.on Computers,v 49, no 3, Mar 2000, pp.230–245.
Hellerstein, J. Zhang, F., and Shahabuddin, P., An Approach to Predictive Detection for Service Management,Integrated Network Management VI, Edited by Sloman, M., Mazumdar, S., and Lupu, E.,1999, IEEE Publishing, pp.309–322.
Huntington-Lee, J., Terplan, K., and Gibson, J.,HP OpenView:A Managers Guide, McGraw-Hill, New York, NY, 1997, pp.137–9.
Lelend, W., Taqqu, M., Willinger, W., Wilson, D.,On the Self-Similar Nature of Ethernet Traffic (ExtendedVersion), IEEE/ACM Trans.on Networking, v.2, no.1, Feb 1994, pp.1–15.
ISO/IEC 10164-11:1994 Information Technology —Open Systems Interconnection —Systems Management:Metric Objects and Attributes.
Maggiora, P., Elliott, C., Pavone, R., Phelps, K., and Thompson, J., Performance and Fault Management,Cisco Press, Indianapolis, IN, 2000, pp.91–97.
Thottan, M., and Ji, C., Adaptive Thresholding for Proactive Network Problem Detection, Third IEEE International Workshop on Systems Management, Newport, RI, Apr 22–24, 1998, pp.108–116.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sylor, M., Meng, L. (2000). Using Time over Threshold to Reduce Noise in Performance and Fault Management Systems. In: Ambler, A., Calo, S.B., Kar, G. (eds) Services Management in Intelligent Networks. DSOM 2000. Lecture Notes in Computer Science, vol 1960. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44460-2_13
Download citation
DOI: https://doi.org/10.1007/3-540-44460-2_13
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41427-8
Online ISBN: 978-3-540-44460-2
eBook Packages: Springer Book Archive