Abstract
Telecommunications networks are often managed by a large number of management centers, each responsible for a logically autonomous part of the network. This could be a small subnetwork such as an Ethernet, a Token Ring or an FDDI ring, or a large subnetwork comprising many smaller networks. In response to a single fault in a telecommunications network, many network elements may raise alarms, which are typically reported only to the subarea management center that contains the network element raising the alarm. As a result, a particular management center has a partial view of the status of the network. Management Centers must therefore cooperate in order to correctly infer the real cause of the failure. The algorithms proposed in this paper outline the way these management centers could collaborate in correlating alarms and identifying faults.
Similar content being viewed by others
References
A. Bouloutas, S. Calo, and A. Finkel, Alarm correlation and fault identification in communication networks.IEEE Trans. Comm. Vol. 42, 523–533, 1994.
Robert H. Deng, Aurel Lazar, and Weiguo Wang, A probabilistic approach to fault diagnosis in linear lightwave networks.IEEE JSAC, Vol. 11, 1438–1449, 1993.
J. F. Jordan and M. E. Paterok, Event correlation in heterogeneous networks using the OSI management framework.Third IEEE/IFIP Int'l. Symp. on Integrated Network Managment, San Francisco, April 18–23, 1993.
Irene Katzela and Mischa Schwartz, Schemes for fault identification in communication networks. Technical Report CU/CTR/TR 362-49-09, CTR-Columbia University, 1994.
Marc Reise. Diagnosis of communication systems: dealing with incompleteness and uncertainly.Qualitative Reasoning and Naive Physics, pp. 1480–1485.
Marc Reise, Model-Based diagnosis of networks: problem characterization and survey.OEGAI-91 Workshop on Model Based Reasoning, Viennai, 1991.
Clark Wang and Mischa Schwartz, Fault detection with multiple observers.IEEE Infocom, Vol. 3, 2187–2196, 1992.
Author information
Authors and Affiliations
Additional information
Work done while the author was with the IBM T. J. Watson Research Center, New York.
Work done while the author was with the IBM T. J. Watson Research, Center, New York.
Work done during the author's internship at the IBM T. J. Watson Research Center, New York, Summer 93.
Rights and permissions
About this article
Cite this article
Bouloutas, A.T., Calo, S.B., Finkel, A. et al. Distributed fault identification in telecommunication networks. J Netw Syst Manage 3, 295–312 (1995). https://doi.org/10.1007/BF02138931
Issue Date:
DOI: https://doi.org/10.1007/BF02138931