Abstract
We present a framework for a parallel programming model by remote procedure calls, which bridge large-scale computing resource pools managed by multiple Grid-enabled job scheduling systems. With this system, the user can exploit not only remote servers and clusters, but also the computing resources provided by Grid-enabled job scheduling systems located on different sites. This framework requires a Grid remote procedure call (RPC) system to decouple the computation in a remote node from the Grid RPC mechanism and uses document-based communication rather than connection-based communication. We implemented the proposed framework as an extension of the OmniRPC system, which is a Grid RPC system for parallel programming. We designed a general interface to easily adapt the OmniRPC system to various Grid-enabled job scheduling systems, including XtremWeb, CyberGRIP, Condor and Grid Engine. We show the preliminary performance of these implementations using a phylogenetic application. We found that the proposed system can achieve approximately the same performance as OmniRPC and can handle interruptions in worker programs on remote nodes.
Similar content being viewed by others
References
Abramson, D., Buyya, R., Giddy, J.: A computational economy for Grid computing and its implementation in the Nimrod-G resource broker. Future Gener. Comp. Syst. 18(8), 1061–1074 (2002)
Arnold, D., Agrawal, S., Blackford, S., Dongarra, J., Miller, M., Seymour, K., Sagi, K., Shi, Z., Vadhiyar, S.: Users’ Guide to NetSolve V1.4.1. Innovative Computing Dept. Technical Report ICL-UT-02-05, University of Tennessee (2002)
Desprez, F.: DIET: building problem solving environments for the Grid. In: Bougé, L., Prasanna, V.K. (eds.) HiPC, Lecture Notes in Computer Science, vol. 3296, p. 4 (2004)
Tröger, P.: Distributed Resource Management Application API Working Group, http://www.drmaa.org/ (2007)
Djilali, S.: P2P-RPC: Programming scientific applications on peer-to-peer systems with remote procedure call. In: CCGRID ’03: Proceedings of the 3st International Symposium on Cluster Computing and the Grid, p. 406 (2003)
Djilali, S., Herault, T., Lodygensky, O., Morlier, T., Fedak, G., Cappello, F.: RPC-V: toward fault-tolerant RPC for internet connected desktop Grids with volatile nodes. In: SC ’04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing, p. 39 (2004)
Fedak, G., Germain, C., Neri, V., Cappello, F.: XtremWeb: a generic global computing system. In: CCGRID ’01: Proceedings of the 1st International Symposium on Cluster Computing and the Grid, p. 582 (2001)
Foster, I., Kesselman, C.: Globus: a metacomputing infrastructure toolkit. Int. J. Supercomput. Appl. High Perform. Comput. 11(2), 115–128 (1997)
Frey, J., Tannenbaum, T., Livny, M., Foster, I., Tuecke, S.: Condor-G: a computation management agent for multi-institutional Grids. Cluster Comput. 5(3), 237–246 (2002)
Gentzsch, W.: Sun Grid Engine: towards creating a compute power Grid. In: CCGRID, pp. 35–39 (2001)
Germain, C., Néri, V., Fedak, G., Cappello, F.: XtremWeb: building an experimental platform for global computing. In: GRID ’00: Proceedings of the First IEEE/ACM International Workshop on Grid Computing, pp. 91–101 (2000)
GridSAM – Grid Job Submission and Monitoring Web Service, http://gridsam.sourceforge.net/ (2007)
Savva, A., Pulsipher, D., McGough, S.: Job Sub mission Description Language Working Group, https://forge.gridforum.org/projects/jsdl-wg/ (2007)
Linderoth, J., Kulkarni, S., Goux, J.-P., Yoder, M.: An enabling framework for master-worker applications on the computational Grid. In: Proceedings of the Ninth IEEE Symposium on High Performance Distributed Computing (HPDC9), pp. 43–50 (2000)
Litzkow, M.J., Livny, M., Mutka, M.W.: Condor – A Hunter of Idle Workstations. In: ICDCS, pp. 104–111 (1988)
Miyazawa, K., Kadooka, Y., Yamashita, T., Suzuki, T., Tago, Y.: Development of Grid middleware cybergrip and its applications. In: 1st IEEE International Conference on e-Science and Grid Computing, PSE Workshop (2005)
Nakada, H., Matsuoka, S., Tanaka, Y., Sekiguchi, S.: The design and implementation of a fault-tolerant RPC system: Ninf-C. In: HPCASIA ’04: Proceedings of the High Performance Computing and Grid in Asia Pacific Region, Seventh International Conference on (HPCAsia’04), pp. 9–18 (2004)
Nakajima, Y., Sato, M., Boku, T., Takahashi, D., Gotoh, H.: Performance evaluation of OmniRPC in a Grid environment. In: SAINT-W ’04: Proceedings of the 2004 Symposium on Applications and the Internet-Workshops (SAINT 2004 Workshops), pp. 614–620 (2004a)
Nakajima, Y., Sato, M., Goto, H., Boku, T., Takahashi, D.: Implementation and performance evaluation of CONFLEX-G: Grid-enabled molecular conformational space search program with OmniRPC. In: ICS ’04: Proceedings of the 18th Annual International Conference on Supercomputing, pp. 154–163 (2004b)
National Institute of Advanced Industrial Science and Technology: Ninf Project, http://ninf.apgrid.org/ (2007)
Oasis: OASIS Web Services Resource Framework (WSRF) TC, http://www.oasis-open.org/committees/ (2006)
OmniRPC Project: OmniRPC: a Grid RPC system for Parallel Programming. http://www.omni.hpcc.jp/OmniRPC/
Sun Microsystems: Open Source Grid Engine, http://gridengine.sunsource.net/
Sato, M., Boku, T., Takahashi, D.: OmniRPC: a Grid RPC ystem for parallel programming in cluster and Grid environment. pp. 206–213 (2003)
Sato, M., Hirano, M., Tanaka, Y., Sekiguchi, S.: OmniRPC: a Grid RPC facility for cluster and global computing in openmp. In: WOMPAT ’01: Proceedings of the International Workshop on OpenMP Applications and Tools, pp. 130–136 (2001)
Sato, M., Nakada, H., Sekiguchi, S., Matsuoka, S., Nagashima, U., Takagi, H.: Ninf: a network based information library for global world-wide computing infrastructure. In: HPCN Europe ’97: Proceedings of the International Conference and Exhibition on High-Performance Computing and Networking, pp. 491–502 (1997)
Seymour, K., Nakada, H., Matsuoka, S., Dongarra, J., Lee, C., Casanova, H.: Overview of GridRPC: a remote procedure call API for Grid computing. In: GRID ’02: Proceedings of the Third International Workshop on Grid Computing, pp. 274–278 (2002)
Tanaka, Y., Nakada, H., Sekiguchi, S., Suzumura, T., Matsuoka, S.: Ninf-G: a Reference implementation of RPC-based programming middleware for Grid computing. J. Grid Comput. 1(1), 41–51 (2003)
Tanaka, Y., Takemiya, H., Nakada, H., Sekiguchi, S.: Design, implementation and performance evaluation of GridRPC programming middleware for a large-scale computational Grid. In: 5th IEEE/ACM International Workshop on Grid Computing, pp. 298–305 (2004)
UniGrids, http://www.unigrids.org/
United Devices, Grid MP. http://www.ud.com
Yang, Z.: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. BioSci. 13, 555–556 (1997)
Author information
Authors and Affiliations
Corresponding author
Additional information
Yoshihiro Nakajima is a Research Fellow of the Japan Society for the Promotion of Science
Rights and permissions
About this article
Cite this article
Nakajima, Y., Sato, M., Aida, Y. et al. Integrating Computing Resources on Multiple Grid-Enabled Job Scheduling Systems Through a Grid RPC System. J Grid Computing 6, 141–157 (2008). https://doi.org/10.1007/s10723-007-9082-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10723-007-9082-8