Skip to main content
Log in

Adaptive resource utilization and remote access capabilities in high‐performance distributed systems: The Open HPC++ approach

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Development of high-performance distributed applications, called metaapplications, is extremely challenging because of their complex runtime environment coupled with their requirements of high-performance and Quality of Service (QoS). Such applications typically run on a set of heterogeneous machines with dynamically varying loads, connected by heterogeneous networks possibly supporting a wide variety of communication protocols. In spite of the size and complexity of such applications, they must provide the high-performance and QoS mandated by their users. In order to achieve the goal of high-performance, they need to adaptively utilize their computational and communication resources. Apart from the requirements of adaptive resource utilization, such applications have a third kind of requirement related to remote access QoS. Different clients, although accessing a single server resource, may have differing QoS requirements from their remote connections. A single server resource may also need to provide different QoS for different clients, depending on various issues such as the amount of trust between the server and a given client. These QoS requirements can be encapsulated under the abstraction of remote access capabilities. Metaapplications need to address all the above three requirements in order to achieve the goal of high-performance and satisfy user expectations of QoS. This paper presents Open HPC++, a programming environment for high-performance applications running in a complex and heterogeneous run-time environment. Open HPC++ provides application level tools and mechanisms to satisfy application requirements of adaptive resource utilization and remote access capabilities. Open HPC++ is designed on the lines of CORBA and uses an Object Request Broker (ORB) to support seamless communication between distributed application components. In order to provide adaptive utilization of communication resources, it uses the principle of open implementation to open up the communication mechanisms of its ORB. By virtue of its open architecture, the ORB supports multiple, possibly custom, communication protocols, along with automatic and user controlled protocol selection at run-time. An extension of the same mechanism is used to support the concept of remote access capabilities. In order to support adaptive utilization of computational resources, Open HPC++ also provides a flexible yet powerful set of load-balancing mechanisms that can be used to implement custom load-balancing strategies. The paper also presents performance evaluations of Open HPC++ adaptivity and load-balancing mechanisms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. J. Arabe, A. Beguelin, B. Lowekamp, E. Seligman, M. Starkey and P. Stephan, Dome: Parallel programming in a heterogeneous multi-user environment, Technical Report CMU-CS-95-13730786, Computer Science Department, Carnegie Mellon University (April 1995).

  2. R. Blumofe and D. Park, Scheduling large scale parallel computations on NOWs, in: Proceedings of the Third International Symposium on HPDC (August 1994) pp. 96-105.

  3. R. Bramley, D. Gannon, T. Stuckey, J. Villacis, E. Akman, J. Balasubramanian, F. Breg, S. Diwan and M. Govindaraju, The linear system analyzer, Technical Report TR-511, Department of Computer Science, Indiana University (June 1998).

  4. F. Breg and D. Gannon, Compiler support for an RMI implementation using NexusJava, Technical Report TR500, Computer Science Department, Indiana University (September 1997).

  5. K. Dincer and G. Fox, Building a world-wide virtual machine based on web and HPCC technologies, in: Proceedings of Supercomputing '96 (November 1996).

  6. S. Diwan and D. Gannon, Adaptive utilization of communication and computational resources in high-performance distributed systems: The EMOP approach, in: Proceedings of the IEEE International Symposium on High Performance Distributed Computing (HPDC) (July 1998) pp. 2-9.

  7. S. Diwan and D. Gannon, A capabilities based communication model for high-performance distributed applications: The Open HPC++ approach, in: Proceedings of IPPS/SPDP 1999 (April 1999).

  8. R. Filman, Injecting Ilities, in: Proc. of the Workshop on Aspect Oriented Programming, ICSE-20, Kyoto, Japan (April 1998).

  9. I. Foster and C. Kesselman, Globus: A metacomputing infrastructure toolkit, The International Journal of Supercomputing Applications and High Performance Computing 11(2) (1997) 115-128.

    Google Scholar 

  10. I. Foster, C. Kesselman and S. Tuecke, Multimethod communication for high-performance metacomputing applications, in: Proceedings of Supercomputing '96 (November 1996).

  11. D. Gannon, P. Beckman, E. Johnson and T. Green, HPC++ and the HPC++Lib toolkit, in: Compilation Issues on Distributed Memory Systems (Springer-Verlag, 1997) chapter 3.

  12. A. Grimshaw, W. Wulf, J. French, A. Weaver and P. Reynolds, Jr., A synopsis of the Legion project, Technical Report CS-94-20, Department of Computer Science, University of Virginia (1994).

  13. INPRISE Corporation, Inc., VisiBroker for C++ 3.3: Programmer's Guide, available via WWW at URL: http://www.inprise.com/techpubs/books/visibroker/vbcpp33/index.html (1998).

  14. K. Keahey and D. Gannon, PARDIS: CORBA-based architecture for application-level parallel distributed computation, in: Proceedings of Supercomputing '97 (1997).

  15. G. Kiczales, Open implementation home page, available via WWW at URL: http://www.parc.xerox.com/spl/projects/oi/ (May 1997).

  16. Microelectronics and Computer Technology Corporation, The Object Infrastructure Project, available via WWW at URL: http://www.mcc.com/projects/oip/.

  17. Object Management Group, CORBA/IIOP 2.2 Specification (February 1998).

  18. R. Sessions, COM and DCOM: Microsoft's Vision for Distributed Objects (Wiley, New York, 1997).

    Google Scholar 

  19. H. Topcuoglu, S. Hariri, D. Kim, Y. Kim, X. Bing, B. Ye, I. Ra and J. Valente, The design and evaluation of a virtual distributed computing environment, Cluster Computing 1(1) (1998) 81-93.

    Article  Google Scholar 

  20. M. Zaki, W. Li and S. Parthasarathy, Customized dynamic load balancing for NOW, Technical Report TR602, Computer Science Department, University of Rochester (December 1995).

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Diwan, S., Gannon, D. Adaptive resource utilization and remote access capabilities in high‐performance distributed systems: The Open HPC++ approach. Cluster Computing 3, 1–14 (2000). https://doi.org/10.1023/A:1019055531388

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1019055531388

Keywords

Navigation