Abstract
Irregular array redistribution has been paid attention recently since it can distribute different size of data segment to heterogeneous processors according to their computational ability. It’s also the reason why it has been kept an eye on load balance. High Performance Fortran Version 2 (HPF2) provides GEN_BLOCK distribution format which facilitates generalized block distributions. In this paper, we present a two-phase degree-reduction (TPDR) method for scheduling HPF2 irregular array redistribution. Using a bipartite communication graph, the first phase of TPDR schedules communication links adjacent to processors that with degree greater than two. A communication step will be scheduled follow each degree-reduction iteration. The second phase of TPDR schedules remaining messages of all processors that with degree-2 and degree-1 using an adjustable coloring mechanism. An extended algorithm based on TPDR is also presented in this paper. Effectiveness of the proposed methods not only avoids node contention but also shortens the overall communication cost. The proposed methods are also practicable due to low algorithmic complexity. To evaluate the performance of our methods, we have implemented both algorithms along with the divide-and-conquer algorithm and two scheduling mechanism. The simulation results show improvement of total communication costs.
Similar content being viewed by others
References
Bandera G, Zapata EL (1999) Sparse matrix block-cyclic redistribution. In: Proceeding of IEEE int’l parallel processing symposium (IPPS’99), San Juan, Puerto Rico, April 1999
Desprez F, Dongarra J, Petitet A (1998) Scheduling block-cyclic data redistribution. IEEE Trans Parallel Distrib Syst 9(2):192–205
Hsu C-H, Bai S-W, Chung Y-C, Yang C-S (2000) A generalized basic-cycle calculation method for efficient array redistribution. IEEE Trans Parallel Distrib Syst 11(12):1201–1216
Hsu C-H, Yang D-L, Chung Y-C, Dow C-R (2001) A generalized processor mapping technique for array redistribution. IEEE Trans Parallel Distrib Syst 12(7):743–757
Guo M (2003) Communication generation for irregular codes. J Supercomput 25(3):199–214
Guo M, Nakata I (2001) A framework for efficient array redistribution on distributed memory multicomputers. J Supercomput 20(3):243–265
Guo M, Nakata I, Yamashita Y (2000) Contention-free communication scheduling for array redistribution. Parallel Comput 26(8):1325–1343
Guo M, Nakata I, Yamashita Y (1997) An efficient data distribution technique for distributed memory parallel computers. In: JSPP’97, 1997, pp 189–196
Guo M, Pan Y, Liu Z (2003) Symbolic communication set generation for irregular parallel applications. J Supercomput 25:199–214
Kalns ET, Ni LM (1995) Processor mapping technique toward efficient data redistribution. IEEE Trans Parallel Distrib Syst 6(12):469–476
Kaushik SD, Huang CH, Ramanujam J, Sadayappan P (1995) Multiphase data redistribution: modeling and evaluation. In: Proceeding of IPPS’95, 1995, pp 441–445
Lee S, Yook H, Koo M, Park M (2001) Processor reordering algorithms toward efficient GEN_BLOCK redistribution. In: Proceedings of the ACM symposium on applied computing, 2001
Lim YW, Bhat PB, Prasanna VK (1999) Efficient algorithms for block-cyclic redistribution of arrays. Algorithm 24(3-4):298–330
Park N, Prasanna VK, Raghavendra CS (1999) Efficient algorithms for block-cyclic data redistribution between processor sets. IEEE Trans Parallel Distrib Syst 10(12):1217–1240
Petitet AP, Dongarra JJ (1999) Algorithmic redistribution methods for block-cyclic decompositions. IEEE Trans Parallel Distrib Syst 10(12):1201–1216
Prylli L, Touranchean B (1997) Fast runtime block cyclic data redistribution on multiprocessors. J Parallel Distrib Comput 45:63–72
Ramaswamy S, Simons B, Banerjee P (1996) Optimization for efficient data redistribution on distributed memory multicomputers. J Parallel Distrib Comput 38:217–228
Wakatani A, Wolfe M (1995) Optimization of data redistribution for distributed memory multicomputers. short communication. Parallel Comput 21(9):1485–1490
Wang H, Guo M, Wei D (2004) Divide-and-conquer algorithm for irregular redistributions in parallelizing compilers. J Supercomput 29(2):157–170
Wang H, Guo M, Wei D (2006) Message scheduling for irregular data redistribution in parallelizing compilers. IEICE Trans Inf Syst E89-D(2):418–424
Yook H-G, Park M-S (1999) Scheduling GEN_BLOCK array redistribution. In: Proceedings of the IASTED international conference parallel and distributed computing and systems, November, 1999
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hsu, CH., Chen, SC. & Lan, CY. Scheduling contention-free irregular redistributions in parallelizing compilers. J Supercomput 40, 229–247 (2007). https://doi.org/10.1007/s11227-006-0024-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-006-0024-1