Abstract
This article is devoted to the redistribution of arrays that are distributed in a GEN_BLOCK fashion over a processor grid. While GEN_BLOCK redistribution is essential for load balancing, prior research about redistribution has been focused on block-cyclic redistribution. The proposed scheduling algorithm exploits a spatial locality in message passing from a seemingly irregular array redistribution. The algorithm attempts to obtain near optimal scheduling by trying to minimize communication step size and the number of steps. According to experiments on CRAY T3E and IBM SP2, the algorithm shows good performance in typical distributed memory machines.
Similar content being viewed by others
References
Y.-C. Chung, C.-H. Hsu, and S.-W. Bai. A basic-cyclic calculation technique for efficient dynamic data redistribution. IEEE Transaction on Parallel and Distributed Systems, 9(4), 1998.
E. G. Coffman. Computer and Job-Shop Scheduling Theory. John Wiley & Sons, NY, 1976.
F. Desprez, J. Dongarra, A. Petitet, C. Randriamaro, and Y. Robert. Scheduling block-cyclic array redistribution. IEEE Transactions on Parallel and Distributed Systems, 9(2):192–205, 1998, also presented in CRPC-TR97714–S, February 1997.
J. Duato, S. Yalamanchili, and L. Ni. Interconnection Networks, pp. 155. IEEE Computer Society Press, 1997.
M. Guo, I. Nakata, and Y. Yamashita. Contention-free communication scheduling for array redistribution. In Proceedings of the International Conference on Parallel and Distributed Systems, pp. 658–667, December 1998 and also presented at Parallel Computing, 26(10), 2000.
S. K. S. Gupta, S. D. Kaushik, C.-H. Huang, and P. Sadayappan. Compiling array expressions for ef.cient execution on distributed-memory machines. Technical report OSU-CISRC-4, 1994, revised March 1995.
High Performance Fortran Forum. High Performance Fortran Language Speci.cation version 2.0. Rice University, Houston, TX, October 1996.
High Performance Fortran Forum. HPF-2Scope of Activities and Motivating Applications, November 1994.
C.-H. Hsu, S.-W. Bai, and Y.-C. Chung. A generalized basic-cycle calculation method for efficient array redistribution. IEEE Transaction on Parallel and Distributed Systems, 11(12):1201–1215, 2000.
E. T. Kalns and L. M. Ni. Processor mapping techniques toward efficient data redistribution. In Proceedings of the 8th International Parallel Processing Symposium, Cancun, Mexico April 1994.
S. D. Kaushik, C.-H. Huang, J. Ramanujam, and P. Sadayappan. Multi-phase array redistribution: Modeling and evaluation. In Proceedings of the 9th International Parallel Processing Symposium, pp. 441–445, April 1995.
Y. W. Lim, P. B. Bhat, and V. K. Prasanna. Efficient algorithm for block-cyclic redistribution of arrays. In IEEE Symposium on Parallel and Distributed Process, October 1996, and will be published in Algorithmica.
Message Passing Interface Forum. MPI: A Message-Passing Interface Standard. University of Tennessee, Knoxville, TN, May 1994.
G. F. Pfister and V. A. Norton. Hot spot contention and combining in multistage interconnection networks. IEEE Transactions on Computers, 34:943–948, 1985.
J. M. Stichnoth, D. O'Hallaron, and T. R. Gross. Generating communication for array statements: Design, implementation, and evaluation. Journal of Parallel Distributed Computing, 21(1):150–159, 1994.
R. Thakur, A. Choudhary, and G. Fox. Runtime array redistribution in HPF programs. In Proceedings of SHPCC'94, pp. 309–316, 1994.
R. Thakur, A. Choudhary, and J. Ramanujam. Efficient algorithms for array redistribution. IEEE Transactions on Parallel and Distributed Systems, 7(6):587–594, 1996.
D. W. Walker and S. W. Otto. Redistribution of block-cyclic data distribution using MPI. Concurrency: Practice and Experience, 8(9):707–728, 1996.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Yook, HG., Park, MS. Scheduling GEN_BLOCK Array Redistribution. The Journal of Supercomputing 22, 251–267 (2002). https://doi.org/10.1023/A:1015313917431
Issue Date:
DOI: https://doi.org/10.1023/A:1015313917431