Abstract
Monitoring systems are nowadays ubiquitous in complex environments, such as Grids. Their use is fundamental for performance evaluation, problem spotting, advanced debugging and per-use accounting. Building such systems raises challenging issues, like data gathering from Grid components, low intrusiveness, ease of use, adaptive data visualization, fault-tolerance and self-maintenance. This paper presents a new layered architecture, named Toytle, specifically designed to address these issues in the context of control Grids. All their components, from computing and network resources to complete physical processes with soft time constraints, can be monitored with Toytle. The architecture’s layers, namely the distributed core, the hierarchical connections and the local monitors, have been designed to ensure scalability, high-speed sampling and efficient dealing with large data bursts. The future Toytle implementation will adapt existing tools and also create entirely new modules.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Balis, B., Bubak, M., Furnika, W., Szepieniec, T., Wissmueller, R., Radecki, M.: Monitoring Grid applications with Grid-enabled OMIS monitor. In: Proceedings of the First European Grids Conference, AxGrid 2003, Santiago de Compostela, Spain, February 2003, pp. 230–239. Springer, Heidelberg (2003)
Berman, F., Hey, A., Fox, G.: Grid Computing: Making The Global Infrastructure a Reality. Wiley Publishing House, Chichester (2003) ISBN: 0-470-85319-0
Fox, G.: Experience with distance education 1998-2003. Collection of resc., http://grids.ucs.indiana.edu/ptliupages/publications/disted/
Gerndt, M., Wismueller, R., Balaton, Z., et al.: Performance tools for the grid: State of the art and future. Technical report, APART WP3 (2004)
Network Working Group. eXternal Data Representation (XDR), IETF RFC (1832) (August 1995)
Iosup, A., Vialle, S.: Mobile robot navigation and self-localization system: Parallel and distributed experiments. In: The Dagstuhl Workshop on Plan-Based Control of Robotic Agents, Dagstuhl, Germany (June 2003)
Kacsuk, P.: Parallel program development and execution in the grid. In: IEEE International. PARELEC 2002, Warszaw, Poland, September 2002, pp. 131–141 (2002)
Krauter, K., Buyya, R., Maheswaran, M.: A taxonomy and survey of Grid resource management systems for distributed computing. Software Practice and Experience 32(2), 135–164 (2002)
Massie, M., Chun, B., Culler, D.: The Ganglia distributed monitoring system: Design, implementation, and experience. Parallel Computing 30(7), 817–840 (2004)
Nemeth, Z.s., Gombas, G., Balaton, Z.: Performance evaluation on Grids: Directions, issues, and open problems. In: Proceedings of the 12th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2004), A Coruna, Spain, February 2004, pp. 290–297. IEEE Computer Society Press, Los Alamitos (2004)
Newman, H.B., Legrand, I.C., Galvez, P., Voicu, R., Cirstoiu, C.: MonALISA: A distributed monitoring service architecture. In: CHEP 2003, La Jola, California (March 2003)
Sabatier, F., De Vivo, A., Vialle, S.: Grid programming for distributed remote robot control. In: International Workshops on Enabling Technologies: Infrastructures for Collaborative Enterprises (WETICE-2004) (June 2004)(to appear)
Stallings, W.: SNMP, SNMPv2, SNMPv3, and RMON 1 and 2, 3rd edn. Wiley Publishing House, Chichester (1998) ISBN: 0-201-48534-6
Stoica, I., Morris, R., Liben-Nowell, D., Karger, D., Kaashoek, M.F.: CHORD: A scalable peer-to-peer lookup protocol for Internet applications. IEEE/ACM Transactions on Networking 11, 17–32 (2003)
Tapus, N., Cristea, V., Burcea, M., Staicu, V.: RoGrid towards a Romanian computational Grid. In: Proceedings of the 14th International Conference on Control Systems and Computer Science (CSCS14), Romania (July 2004)
Tapus, N., Slusanschi, E., Popescu, T.: Distributed rendering engine. In: Grigoras, D., Nicolau, A., Toursel, B., Folliot, B. (eds.) IWCC 2001. LNCS, vol. 2326, pp. 207–215. Springer, Heidelberg (2002)
van Renesse, R., Birman, K., Vogels, W.: Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining. ACM Transactions on Computer Systems 21(2), 164–206 (2003)
Vetter, J.S., Reed, D.A.: Real-time performance monitoring, adaptive control, and interctive steering of computational grids. The International Journal of High-Performance Computing Applications 14, 357–366 (Winter 2000)
Wolski, R.: Experiences with predicting resource performance on-line in computational grid settings. ACM SIGMETRICS Performance Evaluation Review 30(4), 41–49 (2003)
Yu, H., Vahdat, A.: Design and evaluation of a conit-based continuous consistency model for replicated services. ACM Transactions on Computer Systems 20(3), 239–282 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Iosup, A., Ţãpuş, N., Vialle, S. (2005). A Monitoring Architecture for Control Grids. In: Sloot, P.M.A., Hoekstra, A.G., Priol, T., Reinefeld, A., Bubak, M. (eds) Advances in Grid Computing - EGC 2005. EGC 2005. Lecture Notes in Computer Science, vol 3470. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508380_94
Download citation
DOI: https://doi.org/10.1007/11508380_94
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26918-2
Online ISBN: 978-3-540-32036-4
eBook Packages: Computer ScienceComputer Science (R0)