Possibilities of Optimal Execution of Parallel Programs Containing Simple and Iterated Loops on Heterogeneous Parallel Computational Systems with Distributed Memory

Avetisyan, A. I.; Gaisaryan, S. S.; Samovarov, O. I.

doi:10.1023/A:1013707600643

Possibilities of Optimal Execution of Parallel Programs Containing Simple and Iterated Loops on Heterogeneous Parallel Computational Systems with Distributed Memory

Published: January 2002

Volume 28, pages 28–40, (2002)
Cite this article

Programming and Computer Software Aims and scope Submit manuscript

A. I. Avetisyan¹,
S. S. Gaisaryan¹ &
O. I. Samovarov¹

38 Accesses
2 Citations
Explore all metrics

Abstract

The problem of load balancing when executing parallel programs on computational systems with distributed memory is currently of great interest. The most general statement of this problem is that for one parallel loop: execution of a heterogeneous loop on a heterogeneous computational system. When stated in this way, the problem is NP-complete even in the case of two nodes, and no acceptable heuristics for solving it are found. Since the development of heuristics is a rather complicated task, we decided to examine the problem by elementary methods in order to refine (and, possibly, simplify) the original problem statement. The results of our studies are discussed in this paper. Estimates of efficiency of parallel loop execution as functions of the number of nodes of homogeneous and heterogeneous parallel computational systems are obtained. These estimates show that the use of heterogeneous parallel systems reduces the efficiency even in the case when their communication subsystems are scaleable (see the definition in Section 4). The use of local networks (heterogeneous parallel computational systems with nonscaleable communication subsystems) for parallel computations with heavy data exchange is not advantageous and is possible only for a small number of nodes (about five). An algorithm of optimal distribution of data between the nodes of a homogeneous or heterogeneous computational system is suggested. Results of numerical experiments substantiate the conclusions obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers

Article 01 December 2017

Application of Methods for Optimizing Parallel Algorithms for Solving Problems of Distributed Computing Systems

Model and Method for Optimizing Computational Processes in Parallel Computing Systems

Article 01 December 2019

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

REFERENCES

Schikuta, E. and Stockinger, H., Parallel Input/Output for Clusters: Methodologies and Systems, High Performance Cluster Computing, Buyya, R., Ed., New Jersey: Prentice Hall, 1999, vol. 1, p. 442.
Google Scholar
Avetisyan, A.I., Arapov, I.V., Gaissaryan, S.S, and Padaryan, V.A., ParJava Environment for Development of SPMD Programs for Homogeneous and Heterogeneous Networks JavaVM, Trans. Inst. System Programming RAS, 2000, vol. 2, pp. 27-48.
Google Scholar
Lastovetsky, A.L., Kalinov, A.Ya., Ledovskikh, I.N., Arapov, D.M., and Posypkin, N.A., A Language and Programming System for High-Performance Parallel Computations on Heterogeneous Networks, Program-mirovanie, 2000, vol. 26, no. 4, pp. 55-80.
Google Scholar
Avetisyan, A.I., Arapov, I.V., Gaisaryan, S.S., and Padaryan, V.A., The Environment for Development of Parallel Java Programs for Homogeneous and Heterogeneous Networks JavaVM, Proc. of All-Russian Sci. Conf. “High-Performance Computations and Their Applications,” Chernogolovka, 2000, pp. 46-50.
Garey, M. and Johnson, D.S., Computers and Intractability, San Francisco: Freeman, 1979. Translated under the title Vychislitel'nye mashiny i trudno reshaemye zadachi, Moscow: Mir, 1982.
Google Scholar
Kwok, Y. and Ahmad, I., Parallel Program Scheduling Techniques, High Performance Cluster Computing, Buyya, R., Ed., New Jersey: Prentice Hall, 1999, vol. 1, pp. 553-578.
Google Scholar
Kwok, Y. and Ahmad, I., Dynamic Critical-Path Scheduling: An Effective Technique for Allocating Task Graphs onto Multiprocessors, IEEE Trans. Parallel Distributed Systems, 1996, vol. 7, no. 5, pp. 506-621.
Google Scholar
Gasavant, T.L. and Kuhl, J.G., A Taxonomy of Scheduling in General-Purpose Distributed Computing Systems, IEEE Trans. Software Eng., 1998, vol. 14, no. 2, pp. 141-154.
Google Scholar
Zaki, J., Parthasarathy, S., and Weu, Li, Customized Dynamic Load Balancing, High Performance Cluster Computing, Buyya, R., Ed., New Jersey: Prentice Hall, 1999, vol. 1, pp. 579-603.
Google Scholar
Cortes, A., Ripoli, A., Senar, M.A., Cedo, F., and Luque, E., On the Stability of a Distributed Dynamic Load Balancing Algorithm, Proc. of the 1998 Int. Conf. on Parallel and Distributed Systems, Tainan, Taiwan, 1998, pp. 435-446.
Orlando, S. and Perego, R., A Template for Non-uniform Parallel Loops Based on Dynamic Scgeduling and Prefetching Techniques, Proc. of the 1996 ACM Int. Conf. on Supercomputing, 1996, Philadelphia.
Calder, B., Grunwald, D., Lindsay, D., Martin, J., Mozer, M., and Zorn, B., Corpus-based Static Branch Prediction, SIGPLAN Notices, 1995, no. 5, pp. 79-92.
Ortega, J.M., Introduction to Parallel and Vector Solution of Linear Systems, New York: Plenum, 1988. Translated under the title Vvedenie v parallel'nye i vectornye metody resheniya lineinykh sistem, Moscow: Mir, 1991.
Google Scholar
SRCC MSU Server, http://www.parallel.ru

Download references

Author information

Authors and Affiliations

Institute of System Programming, Russian Academy of Sciences, ul. Bol'shaya Kommunisticheskaya 25, Moscow, 109004, Russia
A. I. Avetisyan, S. S. Gaisaryan & O. I. Samovarov

Authors

A. I. Avetisyan
View author publications
You can also search for this author in PubMed Google Scholar
S. S. Gaisaryan
View author publications
You can also search for this author in PubMed Google Scholar
O. I. Samovarov
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Avetisyan, A.I., Gaisaryan, S.S. & Samovarov, O.I. Possibilities of Optimal Execution of Parallel Programs Containing Simple and Iterated Loops on Heterogeneous Parallel Computational Systems with Distributed Memory. Programming and Computer Software 28, 28–40 (2002). https://doi.org/10.1023/A:1013707600643

Download citation

Issue Date: January 2002
DOI: https://doi.org/10.1023/A:1013707600643

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Possibilities of Optimal Execution of Parallel Programs Containing Simple and Iterated Loops on Heterogeneous Parallel Computational Systems with Distributed Memory

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers

Application of Methods for Optimizing Parallel Algorithms for Solving Problems of Distributed Computing Systems

Model and Method for Optimizing Computational Processes in Parallel Computing Systems

REFERENCES

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Possibilities of Optimal Execution of Parallel Programs Containing Simple and Iterated Loops on Heterogeneous Parallel Computational Systems with Distributed Memory

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers

Application of Methods for Optimizing Parallel Algorithms for Solving Problems of Distributed Computing Systems

Model and Method for Optimizing Computational Processes in Parallel Computing Systems

Explore related subjects

REFERENCES

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation