Abstract
Grid computing environments are distributed systems composed by heterogeneous and geographically distributed resources. This type of systems mainly emerged to satisfy the increasing computing power demand within the scientific community. Despite the advantages of such paradigm, there are still several challenges related to the discovery, monitoring and selection of grid resources. Moreover, the dynamic nature and changing characteristics of such environments worsen the applications performance. Thus, improving their efficiency is a fundamental issue. The present contribution analyses two self-adaptive solutions focused on enhancing the grid resource selection process by using resources in an efficient way. On the one hand, the Efficient Resources Selection model which is defined from the user’s point of view (it avoids controlling or modifying the infrastructure) and it is based on the Scatter Search method for achieving a suitable selection of resources. On the other hand, Montera2, a framework designed for addressing an efficient execution of distributed applications on the grid; it defines and employs a dynamic scheduling algorithm to determine the size and number of tasks to be executed. Both approaches have been tested on a real European infrastructure belonging to the well-known European Grid Infrastructure (EGI) project. The study also compares both solutions with the standard scheduling technique that governs this infrastructure, the gLite WMS scheduler, showing a much better performance by reducing the final makespan by a factor of 20 if compared to the gLite WMS scheduler. An analysis of task and time overheads for both approaches is also included. Furthermore, comparisons with many other solutions proposed in the literature are presented, showing the advantages of our approaches.







Similar content being viewed by others
Notes
The Computing Element, CE, is a set of services located in every grid site that provides access for Grid jobs to a local resource management system.
When the performance of the infrastructure gets worse there are very few actions that users can carry out to mitigate it.
The Information System, IS, records information about both the status of resources and tasks.
References
Foster, I.: What is the grid? A three point checklist. GRIDtoday 1(6), 22–25 (2002)
Berman, F., et al.: Adaptive computing on the grid using AppLeS. IEEE Trans. Parallel Distrib. Syst. 14(4), 369–382 (2003)
Vadhiyar, S.S., Dongarra, J.J.: Self adaptivity in grid computing. Concurr. Comput. Pract. Exp. 17(2–4), 235–257 (2005)
Huedo, E., Montero, R.S., Llorente, I.M.: A framework for adaptive execution in grids. Softw. Pract. Exp. 34(7), 631–651 (2004)
Keung, H.N.L.C., Dyson, J.R.D., Jarvis, S.A., Nudd, G.R.: Self-adaptive and self-optimising resource monitoring for dynamic grid environments. In: DEXA’04 Proceedings of the Database and Expert Systems Applications, 15th International Workshop, IEEE Computer Society, Zaragoza, Spain, pp. 689–693 (2004)
Gao, Y., Rong, H., Huang, J.Z.: Adaptive grid job scheduling with genetic algorithms. Future Gener. Comput. Syst. 21(1), 151–161 (2005)
Xhafa, F., Abraham, A.: Metaheuristics for scheduling in distributed computing environments. Stud. Comput. Intell. 146, 1–37 (2008)
Xhafa, F., Abraham, A.: Computational models and heuristic methods for grid scheduling problems. Future Gener. Comput. Syst. 26(4), 608–621 (2010)
Yu, J., Buyya, R., Ramamohanarao, K.: Workflow scheduling algorithms for grid computing. Metaheuristics Sched. Distrib. Comput. Environ. 146, 173–214 (2008)
Yu, J., Buyya, R.: A taxonomy of workflow management systems for grid computing. J. Grid Comput. 3(3–4), 171–200 (2006)
Rahman, M., Ranjan, R., Buyya, R., Benatallah, B.: A taxonomy and survey on autonomic management of applications in grid computing environments. Concurr. Comput. Pract. Exp. 23(16), 1990–2019 (2011)
Deelman, E., Singh, G., Su, M., Blythe, J.: Pegasus: a framework for mapping complex scientific workflows onto distributed systems. Sci. Program. 13(3), 219–237 (2005)
Cameron, D., Gholami, A., Karpenko, D., Konstantinov, A.: Adaptive data management in the ARC grid middleware. J. Phys. Conf. Ser. 331, 1990–2018 (2011)
Batista, D.M., Da Fonseca, L.S.: A survey of self-adaptive grids. IEEE Commun. Mag. 48(7), 94–100 (2010)
Laguna, M., Martí, R.: Scatter Search. Metaheuristic Procedures for Training Neural Networks. Springer, Berlin (2006)
Resende, M., Ribeiro, C., Glover, F., Martí, R.: Scatter Search and Path Relinking: Fundamentals, Advances and Applications. Handbook of Metaheuristics. Springer, Berlin (2009)
Botón-Fernández, M., Vega-Rodríguez, M.A., Prieto Castrillo, F.: A self-adaptive selection model based on the scatter search for grid applications, computer aided systems theory. IUCTC. Universidad de Las Palmas de Gran Canaria, Las Palmas de Gran Canaria, Spain, pp. 233–235 (2013)
Rodríguez-Pascual, M., Martín Llorente, I., Mayo-García, R.: Montera: a framework for the efficient executions of Monte Carlo codes on the Grid. Comput. Inform. 32, 113–144 (2013)
Vazquez-Poletti, J.L., Huedo, E., Montero, R.S., Martín Llorente, I.: CD-HIT workflow execution on grids using replication heuristics. In: 8th IEEE International Symposium on Cluster Computing and the Grid, pp. 735–740 (2008)
Herrera Sanz, J., et al.: Loosely-coupled loop scheduling in computational grids. In: 20th Internationa Parallel and Distributed Processing Symposium, 6 (2006)
Groen, D., Harftst, S., Zwart, S.P.: On the origin of grid species: the living application. In: Proceedings of the 9th International Conference on Computational Science: Part I, LNCS, vol. 5544, Springer, Berlin, Heidelberg, pp. 205–212 (2009)
Murugavel, S.S., Vadhiyar, S.S., Nanjundiah, R.S.: Adaptive executions of multi-physics coupled applications on batch grids. J. Grid Comput. 9(4), 455–478 (2011)
Shin, W.C., Yang, C.-T., Tseng, S.-S.: Using a performance-based skeleton to implement divisible load applications on grid computing environments. J. Inf. Sci. Eng. 25(1), 59–81 (2009)
González-Vélez, H., Cole, M.: Adaptive structured parallelism for distributed heterogeneous architectures: a methodological approach with pipelines and farms. Concurr. Comput. Pract. Exp. 22, 2073–2094 (2010)
Kretsis, A., Kokkinos, P., Varvarigos, E.A.: Implementing and evaluating scheduling policies in gLite middleware. Concurr. Comput. Pract. Exp. 25(3), 349366 (2013)
Hirales-Carbajal, A., et al.: Multiple workflow scheduling strategies with user run time estimates on a grid. J. Grid Comput. 10(2), 325–346 (2012)
Ludascher, B et al.: Kepler: An extensible system for design and execution of scientific workflows. In: Proceedings of the 16th International Conference on Scientific and Statistical Database Management 2004, pp. 423–424 (2004)
Quezada-Pina, A., et al.: Adaptive parallel job scheduling with resource admissible allocation on two-level hierarchical grids. Future Gener. Comput. Syst. 28(7), 1–12 (2012)
Leal, K., Huedo, E.: A decentralized model for scheduling independent tasks in federated grids. Future Gener. Comput. Syst. 25(8), 840–852 (2009)
Li, Y., Yang, Y., Ma, M., Zhou, L.: A hybrid load balancing strategy of sequential tasks for grid computing environments. Future Gener. Comput. Syst. 25(8), 819–828 (2009)
Li, Y., Mascagni, M.: Grid-based Monte Carlo application. In: Grid, pp. 13–24 (2002)
Díaz, J., Reyes, S., Niño, A., Muñoz-Caro, C.: Derivation of self-scheduling algorithms for heterogeneous distributed computer systems: application to internet-based grids of computers. Future Gener. Comput. Syst. 25(6), 617–626 (2009)
Herrera Sanz, J.: Modelo de Programación Para Infraestructuras Grid Computacionales. Ph.D. Thesis. Universidad Complutense de Madrid, Madrid (2009)
Netto, M.A.S., Buyya, R.: Coordinated rescheduling of bag-of-tasks for executions on multiple resource providers. Concurr. Comput. Pract. Exp. 24(12), 1362–1376 (2011)
Camarasu-Pop, S., et al.: Monte Carlo simulation on heterogeneous distributed systems: a computing framework with parallel merging and checkpointing strategies. Future Gener. Comput. Syst. 29(3), 728–738 (2013)
Mościcki, J.T., Lamanna, M., Bubak, M., Sloot, P.M.A.: Processing moldable tasks on the grid: late job binding with lightweight user-level overlay. Future Gener. Comput. Syst. 27(6), 725–736 (2011)
Sim, J., Garrochinho, T., Veiga, L.: A checkpointing-enabled and resource-aware Java Virtual Machine for efficient and robust e-Science applications in grid environments. Concurr. Comput. Pract. Exp. 24(13), 1421–1442 (2012)
Hsu, C.-C., Huang, K.-C., Wang, F.-J.: Online scheduling of workflow applications in grid environments. Future Gener. Comput. Syst. 27(6), 860–870 (2011). doi:10.1016/j.future.2010.10.015
Trinder, P.W., et al.: Resource analyses for parallel and distributed coordination. Concurr. Comput. Pract. Exp. 25(3), 309–348 (2011)
Tao, Y., et al.: Dependable grid workflow scheduling based on resource availability. J. Grid Comput. 11(1), 47–61 (2012)
Pinel, F., Pecero, J.E., Bouvry, P., Khan, S.U.: A review on task performance prediction in multi-core based systems, pp. 615–620. Presented at the Proceedings of the 2011 IEEE 11th International Conference on Computer and Information Technology, Washington, DC, USA: IEEE Computer Society (2011)
Alfaro, M.E., Huelsenbeck, J.P.: Comparative performance of Bayesian and AIC-based measures of phylogenetic model uncertainty. Syst. Biol. 55(1), 89–96 (2006)
Andreo, P.: Monte Carlo techniques in medical radiation physics. Phys. Med. Biol. 36, 861 (1991)
Lemarinier, P., Bouteiller, A., Herault, T., Krawezik, G., Cappello, F.: Improved message logging versus improved coordinated checkpointing for fault tolerant MPI, pp. 115–124. Presented at the CLUSTER ’04: Proceedings of the 2004 IEEE International Conference on Cluster Computing, IEEE Computer Society (2004)
Coti, C., et al.: Blocking vs. non-blocking coordinated checkpointing for large-scale fault tolerant MPI, p. 18. Presented at the SC ’06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, ACM (2006)
Bouteiller, A., Lemarinier, P., Krawezik, K., Capello, F.: Coordinated checkpoint versus message log for fault tolerant MPI, pp. 242–250. Presented at the Cluster Computing, 2003. Proceedings. 2003 IEEE International Conference on (2003)
Caron, E., Garonne, V., Tsaregorodtsev, A.: Evaluation of meta-scheduler architectures and task assignment policies for high throughput computing. Technical Report 2005–27, Ecole Normale Suprieure de Lyon, France (2005)
Banino, C., et al.: Scheduling strategies for master-slave tasking on heterogeneous processor platforms. IEEE Trans. Parallel Distrib. Syst. 15(4), 319–330 (2004)
Vazquez-Poletti, J.L., Huedo, E., Montero, R.S.: A comparison between two grid scheduling philosophies: EGEE WMS and Grid Way. Multiagent Grid Syst. 3(4), 429–439 (2007)
Bosa, K., Schreiner, W.: Austrian Grid: Report on Experiments with Globus 4 and gLite, Technical Report, Johannes Kepler University, Linz, Austria (2008)
Xhafa, F., Abraham, A.: Meta-heuristics for grid scheduling problems. In: Metaheuristics for Scheduling in Distributed Computing Environments, pp. 1–37 (2008)
Dong, F., Akl, S.G.: Scheduling Algorithms for Grid Computing: State of the Art and Open Problems. Technical Report 2006–504, Queen’s University, Kingston, Ontario, USA (2006)
Jha, S., Cole, M., Katz, D.S., Parashar, M., Rana, O., Weissman, J.: Distributed computing practice for large-scale science and engineering applications. Concurr. Comput. Pract. Exp. 25(11), 1559–1585 (2013)
Chtepen, M., Dhoedt, B., Cleays, F., Vanrolleghem, P.: Evaluation of replication and rescheduling heuristics for grid systems with varying resource availability, pp. 622–627. Presented at the Proceedings of the 18th IASTED international conference on parallel and distributed computing and systems (2006)
Yu, C., Marinescu, D.C.: Algorithms for divisible load scheduling of data-intensive applications. J. Grid Comput. 8(1), 133–155 (2010). doi:10.1007/s10723-009-9129-0
Olivier, S., Porterfield, A., Wheeler, K., Spiegel, M.: OpenMP task scheduling strategies for multicore NUMA systems. Int. J. High Perform. Comput. Appl. 26(2), 110–124 (2012)
Ramírez-Alcaraz, A., et al.: Job allocation strategies with user run time estimates for online scheduling in hierarchical grids. J. Grid Comput. 9(1), 95–116 (2011)
Rodríguez-Pascual, M., Guasp, J., Castejón Magaña, F., Rubio-Montero, A.J., Martín Llorente, I., Mayo-García, R.: Improvements on the Fusion Code FAFNER2. IEEE Trans. Plasma Sci. 38, 21022110 (2010). doi:10.1109/TPS.2010.2057450
Juve, G., et al.: Comparing futuregrid, amazon ec2, and open science grid for scientific workflows. Comput. Sci. Eng. 15(4), 20–29 (2013)
Wrzesinska, G., Maasen, J., Bal, H.E.: Self-adaptive applications on the grid. In: 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, San Jose, CA, USA, pp. 121–129 (2007)
Curnow, H.J., Wichmann, B.A.: A syntetic Benchmark. Comput. J. 19(1), 43–49 (1976)
Hockney, R.W., Jesshope, C.R.: Parallel Computers Two: Architecture, Programming and Algorithms. Taylor & Francis, New York (1988)
Montero, R.S., Huedo, E., Martín Llorente, I.: Benchmarking of high throughput computing applications on grids. Parallel Comput. 32(4), 267–279 (2006)
Trer, P., Domagalski, P.: Standardization of an API for distributed resource management systems. In: Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), pp. 619–626 (2007)
Vázquez, C., Huedo, E., Montero, R.S.: Federation of TeraGrid, EGEE and OSG infrastructures through a metascheduler. Future Gener. Comput. Syst. 26(7), 979–985 (2010)
Peris, A.D., Hernndez, J., Huedo, E., Martín Llorente, I.: Data location-aware job scheduling in the grid. Application to the GridWay metascheduler. J. Phys. Conf. Ser. 219, 62043 (2010)
Acknowledgments
Antonio Juan Rubio-Montero, from CIEMAT, was a great support on the creation and debugging of Montera2 and the administration of the local resources employed. Results obtained in this paper were computed on the European Grid Infrastructure (http://www.egi.eu) and its European Commission co-funded project EGI-InSPIRE (RI-261323). The authors thank the European Grid Infrastructure and supporting National Grid Initiatives for providing the technical support, computing and storage facilities. Authors have also count on the support from COST Action BETTY (IC1201). Last but not least, the authors would also like to acknowledge the support of the European Funds for Regional Development.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Botón-Fernández, M., Rodríguez-Pascual, M., Vega-Rodríguez, M.A. et al. A Comparative Analysis of Adaptive Solutions for Grid Environments. Int J Parallel Prog 43, 786–811 (2015). https://doi.org/10.1007/s10766-014-0342-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10766-014-0342-5