Abstract
As part of its HiPer-D Program, the United States Navy is developing an experimental distributed system which achieves survivability by dynamically reconfiguring the system using replicated system components and resources. To enable the reconfiguration, resource monitors observe the behavior of the system and report this information to a resource manager. The resource manager makes reconfiguration decisions based on this information. Because all reconfiguration decisions are based on data obtained from resource monitors and the network is the common resource linking all components in the distributed system, this paper focuses specifically on network resource monitoring. A generalized network resource monitor architecture is proposed. Two instantiations of this architecture are then presented. The first is based on custom developed tools tailored to a specific application while the second is based on commercially available products (e.g. SNMP, RMON, etc). Scalability, intrusiveness, and fidelity are identified as evaluation criteria against which implementation trade-offs are made. This paper presents the results of initial experiments as well as future research directions.
Preview
Unable to display preview. Download preview PDF.
References
Philip M. Irey IV, Robert D. Harrison, David. T. Marlow, “Evaluating LAN Communications Performance for a Real-Time Environment”, Proceedings of the 4th International Workshop on Parallel and Distributed Real-Time Systems, 1996.
Lonnie. R. Welch, “Large-Grain, Dynamic Control System Architectures”, Proceedings of The Joint Workshop on Parallel and Distributed Real-Time Systems.
Philip M. Irey IV, Robert D. Harrison, David T. Marlow, “Techniques for LAN Performance Analysis in a Real-Time Environment”, Real-Time Systems — International Journal of Time Critical Computing Systems, Volume 14, Number 1, January 1998.
Philip. M. Irey IV, David T. Marlow, Robert D. Harrison, “Distributing Time Sensitive Data in a COTS Shared Media Environment”, Proceedings of the Joint Workshop on Parallel and Distributed Real-Time Systems, 1997.
Geary and Masters, “Investigating New Computing Technologies for Shipboard Combat Systems” Naval Engineers Journal, Volume 107, Number 3, May 1995.
William Stallings, “SNMP, SNMPv2, and RMON: Practical Network Management, Second Edition”, Reading, Massachusetts, Addison Wesley Longman, Inc., 1996.
Michael J. Katchabaw, Stephen L. Howard, Hanan L. Lutfiyya, Andrew D. Marshall, Michael A. Bauer, “Making Distributed Applications Manageable Through Instrumentation”, Proceedings of 2nd International Workshop on Software Engineering for Parallel and Distributed Systems, 1997.
Leander Conradic, Maria-Athina Mountzia, “A Relational Model for Distributed Systems Monitoring using Flexible Agents”, Proceedings of 3rd International Workshop on Services in Distributed and Networked Environments, 1996.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Irey, P.M., Hott, R.W., Marlow, D.T. (1998). An architecture for network resource monitoring in a distributed environment. In: Rolim, J. (eds) Parallel and Distributed Processing. IPPS 1998. Lecture Notes in Computer Science, vol 1388. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64359-1_781
Download citation
DOI: https://doi.org/10.1007/3-540-64359-1_781
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64359-3
Online ISBN: 978-3-540-69756-5
eBook Packages: Springer Book Archive