Abstract
Scheduling in large scale dynamic grids comprising eclectic collections of resources is increasingly difficult. Autonomous resource neighborhoods may wish to determine the level of grid offered load that they can or will accept; different sites may wish to attract different amounts of load, to satisfy some desired property within a grid economy. This changes the traditional notion of load sharing, which generally assumes that the desired equilibrium should be an equal distribution of load across all participating machines, because they are under the jurisdiction of a single site, and therefore more likely to implement one common policy. In large-scale grids, nodes and neighborhoods should instead get a portion of the load that best matches their local policies for supporting and admitting grid jobs. This article describes information dissemination protocols that can distribute load in this way, without using load rebalancing through job migration, which is more difficult and costly in large-scale heterogeneous grids. Essentially, nodes adjust their advertising rates and aggressiveness to influence where jobs get scheduled. We report experimental results with example resource configurations in which each resource neighborhood determines its ideal grid load and disseminates accordingly. In turn, each neighborhood attracts the requisite amount of resource requests from the grid. Moreover, performance does not degrade: overall query satisfaction rates are within 9% of both adaptive dissemination protocols that use static adaptation policies, and static dissemination protocols that may be custom-tailored to specific resource and load distributions.
Similar content being viewed by others
References
Abu-Ghazaleh N, Lewis MJ (2006) Short paper: toward self organizing grids. In: Proceedings of the 15th IEEE international symposium on high performance distributed computing (hot topics session)
Anderson DP (2004) Boinc: a system for public-resource computing and storage. In: GRID’04: proceedings of the fifth IEEE/ACM international workshop on grid computing (GRID’04), Washington, DC, USA, 2004. IEEE Computer Society, Los Alamitos, pp 4–10
Androutsellis-Theotokis S, Spinellis D (2004) A survey of peer-to-peer content distribution technologies. ACM Comput Surv 36(4):335–371. http://portal.acm.org/citation.cfm?id=1041681
Babaoglu O, Meling H, Montresor A (2002) Anthill: a framework for the development of agent-based peer-to-peer systems. In: Proceedings of the 22th international conference on distributed computing systems, Vienna, Austria, July 2002
Barak A, Laadan O (1998) The mosix multicomputer operating system for high performance cluster computing. Future Gener Comput Syst. http://www.mosix.cs.huji.ac.il
Barford P, Bestavros A, Bradley A, Crovella M (1998) Changes in web client access patterns: characteristics and caching implications. Technical report
Butt AR, Zhang R, Hu YC (2006) A self-organizing flock of condors. J Parallel Distrib Comput 66(1):145–161
Buyya R, Vazhkudai S (2001) Computer power market: towards a market-oriented grid. In: Proceedings of 1st IEEE international conference on cluster computing and the grid, CCGRID2001, May 2001
Buyya R, Abramson D, Giddy J (2000) Economy driven resource management architecture for computational power grids. In: Proceedings of the international conference on parallel and distributed processing techniques and applications (PDPTA2000), Las Vegas, USA
Cao J (2004) Self-organizing agents for grid load balancing. In: Proceedings of the fifth IEEE/ACM international workshop on grid computing (GRID’04), Pittsburgh, PA, November 2004
Casavant TL, Kuhl JG (1988) A taxonomy of scheduling in general-purpose distributed computing systems. IEEE Trans Softw Eng 14(2):141–154
Chiu D-M, Jain R (1989) Analysis of the increase and decrease algorithms for congestion avoidance in computer networks. Comput Netw ISDN Syst 17:1–14
Cowie J, Liu H, Liu J, Nicol D, Ogielski A (1999) Towards realistic million-node internet simulations. In: Proceedings of the international conference on parallel and distributed processing techniques and applications
Desai R, Tilak S, Gandhi B, Lewis MJ, Abu-Ghazaleh NB (2006) Analysis of query matching criteria and resource monitoring for grid application scheduling. In: Proceedings of CCGrid2006: IEEE international symposium on cluster computing and the grid
Eager DL, Lazowska ED, Zahorjan J (1986) A comparison of receiver-initiated and sender-initiated adaptive load sharing. Perform Eval 6(1):53–68
Epema DHJ, Livny M, van Dantzig R, Evers X, Pruyne J (1995) A worldwide flock of condors: load sharing among workstation clusters. Technical Report DUT-TWI-95-130, Delft, The Netherlands
Erdil DC (2007) Adaptive dissemination protocols for hybrid grid resource scheduling. PhD dissertation, dissertations & theses @ Binghamton University (SUNY), (AAT 3289113). http://www.proquest.com.proxy.binghamton.edu
Erdil DC, Lewis MJ, Abu-Ghazaleh NB (2006) An adaptive algorithm for information dissemination in self-organizing grids. In: Proceedings of the 2nd IEEE international conference on e-science and Grid Computing (eScience 2006), Amsterdam, the Netherlands, 4–6 December 2006
Erdil DC, Lewis MJ, Abu-Ghazaleh NB (2006) An adaptive approach to information dissemination in self-organizing grids. In: Proceedings of the international conference on autonomic and autonomous systems (ICAS’06), Silicon Valley, CA, July 2006
Erdil DC, Lewis MJ, Abu-Ghazaleh NB (2007) Proxy-based grid information dissemination. In: Proceedings of the workshop on large-scale and volatile desktop grids (PCGrid2007) (in conjunction with IPDPS2007), Long Beach, CA, March 2007
European Union EGEE Project Group. Enabling grids for e-science. http://www.eu-egee.org
Fedak G, Germain C, N’eri V, Cappello F (2001) Xtremweb: a generic global computing system. In: Proceedings of the IEEE international symposium on cluster computing and the grid (CCGRID’01), May 2001
Foster I, Kesselman C, Tuecke S (2001) The anatomy of the grid: enabling scalable virtual organizations. In: Lecture notes in computer science, vol 2150. Springer, Berlin
Iamnitchi A, Doraimani S, Garzoglio G (2006) Filecules in high-energy physics: characteristics and impact on resource management. In: Proceedings of the 15th IEEE international symposium on high performance distributed computing (HPDC-15), Paris, France, June 2006
Iyengar V, Tilak S, Abu-Ghazaleh NB, Lewis MJ (2004) Nonuniform information dissemination for dynamic grid resource discovery. In: Proceedings of IEEE NCA04: the 3rd IEEE international symposium on network computing and applications
Jung J, Krishnamurthy B, Rabinovich M (2002) Flash crowds and denial of service attacks: characterization and implications for cdns and web sites. In: Proceedings of the IEEE international world wide web conference
Kondo D, Taufer M, Brooks CL III, Casanova H, Chien AA (2004) Characterizing and evaluating desktop grids: an empirical study. In: Proceedings of the 18th international parallel and distributed processing symposium (IPDPS’04), April 2004
Kremien O, Kramer J (1992) Methodical analysis of adaptive load sharing algorithms. IEEE Trans Parallel Distrib Syst 3(6):747–760
Ledlie J, Seltzer M (2005) Distributed, secure load balancing with skew, heterogeneity, and churn. In: Proceedings of the 24th annual joint conference of the IEEE computer and communications societies (INFOCOM 2005), March 2005, vol 2, pp 1419–1430
Lenders VLC (2006) Field-based routing and its application to wireless ad hoc networks. PhD dissertation, dissertations & theses @ Swiss Federal Institute of Technology Zurich (Diss ETH No 16681). http://www.lenders.ch/publications/books/thesis.pdf
Li H, Groep D, Wolters L (2004) Workload characteristics of a multi-cluster supercomputer. In: Feitelson DG, Rudolph L, Schwiegelshohn U (eds) JSSPP. LNCS, vol 3277. Springer, Berlin, pp 176–194
Li L, Halpern J, Haas Z (2002) Gossip-based ad hoc routing. In: IEEE Infocom
Livny M, Melman M (1982) Load balancing in homogeneous broadcast distributed systems. In: Proceedings of the computer network performance symposium, New York, NY, USA. ACM, New York, pp 47–55
Lv Q, Cao P, Cohen E, Li K, Shenker S (2002) Search and replication in unstructured peer-to-peer networks. SIGMETRICS Performance Evaluation Review
Molina TE, Yang G, Lin AW, Peltier ST, Ellisman MH (2005) A generalized service-oriented architecture for remote control of scientific imaging instruments. In: E-SCIENCE’05: proceedings of the first international conference on e-science and grid computing, Washington, DC, USA. IEEE Computer Society, Los Alamitos, pp 550–556
Montresor A, Meling H, Babaoglu O (2002) Messor: load-balancing through a swarm of autonomous agents. Technical Report UBLCS-02-08, Department of Computer Science, University of Bologna
Silberstein M, Geiger D, Schuster A, Livny M (2006) Scheduling mixed workloads in multi-grids: the grid execution hierarchy. In: Proceedings of the 15th IEEE symposium on high performance distributed computing (HPDC-15), Paris, France, June 2006
Tcp congestion control. http://tools.ietf.org/html/rfc2581
TeraGrid Project Group. TeraGrid. http://www.teragrid.org
Wang Y-T, Morris R (1985) Load sharing in distributed systems. IEEE Trans Comput C-34(3):204–217
Wright D (2001) Cheap cycles from the desktop to the dedicated cluster: combining opportunistic and dedicated scheduling with Condor. In: Proceedings of the Linux clusters: the HPC revolution conference, Champaign, Urbana, IL, June 2001
Wrthimer D, Cobb J, Lebofsky M, Anderson D, Korpela E (2001) Seti@home: massively distributed computing for seti. Comput Sci Eng 3(1):78–83
Zegura E, Calvert K. GT internetwork topology models (GT-ITM). http://www.cc.gatech.edu/projects/gtitm
Zhang Q, Cherkasova L, Smirni E (2006) Flexsplit: a workload-aware, adaptive load balancing strategy for media clusters. In: Proceedings of SPIE, multimedia computing and networking, vol 6071
Zhou S (1988) A trace-driven simulation study of dynamic load balancing. IEEE Trans Softw Eng 14(9):1327–1341
Zhou D, Lo V (2006) Wavegrid: a scalable fast-turnaround heterogeneous peer-based desktop grid system. In: Proceedings of the 20th international parallel and distributed processing symposium (IPDPS’06)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Erdil, D.C., Lewis, M.J. Dynamic grid load sharing with adaptive dissemination protocols. J Supercomput 59, 1139–1166 (2012). https://doi.org/10.1007/s11227-010-0507-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-010-0507-y