Abstract
In Grid environments, high-performance applications have to take into account the available network performance between the individual sites. Existing monitoring tools like the Network Weather Service (NWS) measure bandwidth and latency of end-to-end network paths. This information is necessary but not sufficient. With more than two participating sites, simultaneous transmissions may collide with each other on shared links of the wide-area network. If this occurs, applications may obtain lower network performance than predicted by NWS. In this paper, we describe Topo Mon, a monitoring tool for Grid networks that augments NWS with additional sensors for the routes between the sites of a Grid environment. Our tool conforms to the Grid Monitoring Architecture (GMA) denned by the Global Grid Forum. It unites NWS performance and topology discovery in a single monitoring architecture. Our topology consumer process collects route information between the sites of a Grid environment and derives the overall topology for utilization by application programs and communication libraries. The topology can also be visualized for Grid application developers.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network Flows. Prentice Hall, 1993.
Active Measurement Project (AMP), http://amp.nlanr.net.
R. Aydt, D. Gunter, W. Smith, M. Swany, V. Taylor, B. Tierney, and R. Wolski. A Grid Monitoring Architecture. Global Grid Forum, Performance Working Group, Grid Working Document GWD-Perf-16-l, 2001.
P. Dinda, T. Gross, R. Karrer, B. Lowekamp, N. Miller, P. Steenkiste, and D. Sutherland. The Architecture of the Remos System. In IEEE Symposium on High Performance Distributed Computing (HPDC10), San Francisco, CA, 2001.
I. Foster and C. Kesselman. Globus: A Metacomputing Infrastructure Toolkit. Int. Journal of Supercomputer Applications, 11(2):115–128, 1997.
P. Francis, S. Jamin, C. Jin, Y. Jin, D. Raz, Y. Shavitt, and L. Zhang. IDMaps: A Global Internet Host Distance Estimation Service. IEEE/ACM Transactions on Networking, 2001.
E. R. Gansner, E. Koutsofios, S. C. North, and K.-P. Vo. A Technique for Drawing Directed Graphs. IEEE Trans, of Software Engineering, 19(3):214–230, 1993.
N. T. Karonis, B. R. de Supinski, I. Foster, W. Gropp, E. Lusk, and J. Bresnahan. Exploiting Hierarchy in Parallel Computer Networks to Optimize Collective Operation Performance. In International Parallel and Distributed, Processing Symposium (IPDPS 2000), pages 377–384, Cancun, Mexico, May 2000. IEEE.
T. Kielmann, H. E. Bal, S. Gorlatch, K. Verstoep, and R. F. H. Hofman. Network Performance-aware Collective Communication for Clustered Wide Area Systems. Parallel Computing, 27(11):1431–1456, 2001.
T. Kielmann, R. F. H. Hofman, H. E. Bal, A. Plaat, and R. A. F. Bhoedjang. Mag-PIe: MPI’s Collective Communication Operations for Clustered Wide Area Systems. In Symposium on Principles and Practice of Parallel Programming (PPoPP), pages 131–140, Atlanta, GA, May 1999.
B. A. Mah. pchar. http://www.caida.org/tools/utilities/others/pathchar/.
Pathrate. http://www.pathrate.org/.
V. Paxson, J. Mahdavi, A. Adams, and M. Mathis. An Architecture for Large-scale Internet Measurement. IEEE Communications, 1988.
A. Plaat, H. E. Bal, R. F. Hofman, and T. Kielmann. Sensitivity of Parallel Applications to Large Differences in Bandwidth and Latency in Two-Layer Interconnects. Future Generation Computer Systems, 17(6):769–782, 2001.
Y. Rekhter and T. Li. A Border Gateway Protocol 4 (BGP-4). IETF Network Working Group, RFC 1771, 1995. http://www.faqs.org/rfcs/rfcl771.html.
G. Shao, F. Berman, and R. Wolski. Using Effective Network Views to Promote Distributed Application Performance. In Parallel and Distributed Processing Techniques and Applications (PDPTA), 1999.
R. Siamwalla, R. Sharma, and S. Keshav. Discovering Internet Topology. In IEEE INFOCOM, 1999.
W. Smith, D. Gunter, and D. Quesnel. A Simple XML Producer-Consumer Protocol. Global Grid Forum, Performance Working Group, Grid Working Document GWD-Perf-8-2, 2001.
Sprobe. http://sprobe.cs.washington.edu/.
M. Swany and R. Wolski. Topology Discovery for the Network Weather Service. Submitted for publication.
R. Wolski, N. Spring, and J. Hayes. The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing. Future Generation Computing Systems, 15(5–6):757–768, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
den Burger, M., Kielmann, T., Bal, H.E. (2002). TopoMon: A Monitoring Tool for Grid Network Topology. In: Sloot, P.M.A., Hoekstra, A.G., Tan, C.J.K., Dongarra, J.J. (eds) Computational Science — ICCS 2002. ICCS 2002. Lecture Notes in Computer Science, vol 2330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46080-2_58
Download citation
DOI: https://doi.org/10.1007/3-540-46080-2_58
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43593-8
Online ISBN: 978-3-540-46080-0
eBook Packages: Springer Book Archive