Study on Parallel Computing

Chen, Guo-Liang; Sun, Guang-Zhong; Zhang, Yun-Quan; Mo, Ze-Yao

doi:10.1007/s11390-006-0665-9

Study on Parallel Computing

Architechture
Published: September 2006

Volume 21, pages 665–673, (2006)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Guo-Liang Chen¹,
Guang-Zhong Sun¹,
Yun-Quan Zhang² &
…
Ze-Yao Mo³

489 Accesses
17 Citations
Explore all metrics

Abstract

In this paper, we present a general survey on parallel computing. The main contents include parallel computer system which is the hardware platform of parallel computing, parallel algorithm which is the theoretical base of parallel computing, parallel programming which is the software support of parallel computing. After that, we also introduce some parallel applications and enabling technologies. We argue that parallel computing research should form an integrated methodology of “architecture — algorithm — programming — application”. Only in this way, parallel computing research becomes continuous development and more realistic.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Parallel Environments

A Brief History of the Parallel Dawn in Karl-Marx-Stadt/Chemnitz

Tenth International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar’2012)

References

Chen G. Parallel Algorithm of Sorting and Selection. University of Science and Technology of China Press, 1990.
Chen G, Chen L. Computational Theory and Parallel Algorithms of VLSI. Univ. Science and Technology of China Press, 1991.
Tang C et al. Parallel Graph Algorithm. University of Science and Technology of China Press, 1991.
Chen G. Parallel Computing — Architecture, Algorithm, Programming. 2nd Edition, Higher Education Press, 2003.
Chen G, Wu J et al. Parallel Computer Architectures. Higher Education Press, 2002.
Chen G. Design and Analysis of Parallel Algorithms. 2nd Edition, Higher Education Press, 2002.
Chen G, An H et al. Parallel Algorithms Practice. Higher Education Press, 2003.
Blelloch G E, Maggs B M. Parallel algorithms. ACM Computing Surveys, 1996, 28(1): 51–54.
Article Google Scholar
Fortune S, Wyllie J C. Parallelism in random access machines. In Conference Record of the 10th Annual ACM Symp. Theory of Computing, San Diego, California, 1978, pp. 114–118.
Goldschlager L M. A universial interconnection pattern for parallel computers. J. the ACM, 1982, 29(4): 1073–1086.
Article MATH MathSciNet Google Scholar
Cole R, Zajicek O. APRAM: Incorporating asynchrony into the PRAM model. In Proc. 1st Annual ACM Symp. Parallel Algorithms and Architectures, Santa Fe, New Mexico, 1989, pp. 158–168.
Gibbons P, Matias Y, Ramachandran V. The QRQW PRAM: Accounting for contention in parallel algorithms. In Proc. the SPAA’94, Cape May, New Jersey, 1994, pp. 638–648.
Aggarwal A, Chandra A, Snir M. On communication latencies in PRAM computations. In Proc. SPAA’89, Santa Fe, New Mexico, 1989, pp. 11–21.
Valiant L. A bridging model for parallel computation. Communications of the ACM, 1990, 33: 103–111.
Article Google Scholar
Culler D, Karp R, Patterson D et al. LogP: Towards a realistic model of parallel computation. In Proc. ASPLOS IV, New York, 1993, pp. 1–12.
Aggarwal A, ALpern B, Chandra A, Snir M. A model for hierarchical memory. In Proc. the 19th Annual ACM Symp. Theory of Computing, Chicago, Illinois, USA, 1987, pp. 305–314.
Aggarwal A, ALpern B, Chandra A, Snir M. Hierarchical memory with block transfer. In Proc. of the 28th Annual IEEE Symp. Foundations of Computer Science, Los Angeles, CA, 1987, pp. 204–216.
Alpern B, Carter L, Feig E, Selker T. The uniform memory hierarchy model of computation. Algorithmica, 1993.
Vitter J, Shriver E. Algorithms for parallel memory II: Hierarchical multilevel memories. Technical Reports, CS–1993–02, Department of Computer Science, Duke University, 1993.
Li Z, Mills P H, Reif J H. Models and resource metrics for parallel and distributed computation. In the 28th Int. Conf. System Sciences (HICSS’95), Hawaii, USA, 1995, pp. 51–61.
Zhang Y. Performance optimizations on parallel numerical software package and study on memory complexity [Dissertation]. Institute of Software, CAS, 2000.
Zhang Y. DRAM(h): A parallel computation model for high performance numerical computing. Chinese Journal of Computers, 2003, 12(26): 1660–1670.
Google Scholar
Zhang Y, Sun J, Tang Z, Chi X. Memory complexity in high performance computing. In Proc. the 3rd Int. Conf. High Performance Computing in Asia-Pacific Region, Singapore, 1998, pp. 142–151.
Cameron K, Sun X H. Quantifying locality effect in data access delay: Memory log P. In Proc. the 2003 IEEE Int. Parallel and Distributed Processing Symp., Nice, France, 2003, pp. 212–219.
Gerasoulis A, Yang T. On the granularity and clustering of directed acyclic task graphs. IEEE Trans. Parallel and Distributed Systems, 1993, 4(6): 686–701.
Article Google Scholar
Shirazi B A, Hurson A, Kavi K. Scheduling and Load Balancing in Parallel and Distributed Systems. IEEE Computer Science Press, 1995.
Kwok Y, Ahmed I. Dynamic critical-path scheduling: An effective technique for allocating task graph to multiprocessors. IEEE Trans. Parallel and Distributed Systems, 1996, 7: 506–521.
Google Scholar
Topcuoglu H, Hariri S, Min-You W. Performance-effective and low-complexity task scheduling for heterogeneous computing. IEEE Trans. Parallel and Distributed Systems, 2002, 13(3): 260–274.
Article Google Scholar
Amdahl G M. Validity of the single-processor approach to achieving large scale computing capabilities. In AFIPS Conference Proc., Atlantic City, New Jersey, 1967, pp. 483–485.
Gustafson J L. Revaluating Amdahl’s law. Communications of the ACM, 1987, 31: 532–533.
Article Google Scholar
Grama A Y, Gupta A, Kumar V. Isoefficiency: Measuring the scalability of parallel algorithms and architectures. IEEE Parallel and Distributed Technology, 1993: 1(3), 12–21.
Article Google Scholar
Sun X, Rover D. Scalability of parallel algorithm-machine combinations. IEEE Trans. Parallel and Distributed System, 1994, 5(6): 599–613.
Article Google Scholar
Zhang X, Yan Y, He K. Latency metric: An experimental method for measuring and evaluating parallel program and architecture scalability. Journal of Parallel and Distributed Computing, 1994, 22(3): 392–410.
Article Google Scholar
Quinn M J. Parallel Programming in C with MPI and OpenMP. McGraw Hill, 2004.
http://www.llnl.gov/computing/tutorials/parallel_comp/
Yao Z, Zheng Q, Chen G. GOOMPI: A generic object oriented message passing interface. In Proc. NPC, 2004, pp. 261–271.
http://www.vcpc.univie.ac.at/information/mirror/HPFF/.
http://www-unix.mcs.anl.gov/mpi/.
http://www-unix.mcs.anl.gov/mpi/mpich/.
http://www.lam-mpi.org/.
http://www.co-array.org/.
http://upc.lbl.gov/.
http://www.mmm.ucar.edu/mm5/.
http://www.wrf-model.org/.
http://www.nas.nasa.gov/Software/NPB/.
http://www.netlib.org/linpack/.
http://www.samss.org.cn.
http://www.netlib.org/benchmark/hpl/.
http://icl.cs.utk.edu/hpcc/.
CFD, http://www.cfd-online.com/.
Ferziger J H, Peric M. Computational Methods for Fluid Dynamics. Springer-Verlag, 1999.
Thompson J F, Soni B K, Weaherill N P (eds.). Handbook of Grid Generation. CRC Press, Boca Raton, FL, 1999.
MATH Google Scholar
Rheinboldt W C. Methods for Solving Systems of Nonlinear Equations. Second Edition, SIAM, Philadelphia, 1998.
Saad Y. Iterative Methods for Sparse Linear Systems. Second Edition, SIAM, Philadelphia, 2003.
Teresco J D. Hierarchical partitioning and dynamic load balancing for scientific computation. In PARA’04 State-of-the-Art in Scientific Computing, Copenhagen, Denmark, 2004.
Schloegel K, Karypis G, Kumar V. Graph Partitioning for High Performance Scientific Simulations. Chapter 18, Sourcebook of Parallel Computing, Dongarra J, Foster I, Fox G et al. (eds.), New York: Morgan Kaufmann Publishers, 2003.
Meiron D, Deiterding R. Load balancing strategies for parallel SAMR algorithms. SURF 2005 technical report, Available at http://scdrm.caltech.edu/publications/cit-asci-tr, 2005.
Sagan H. Space-Filling Curves. New York: Springer-Verlag, 1994.
MATH Google Scholar
Mo Z, Zhang J, Cai Q. Dynamic load balancing for short-range parallel molecular dynamics simulations. Int. J. Computer Math., 2002, 79(2): 165–177.
Article MATH Google Scholar
Mo Z, Zhang B. Multilevel averaging weight method for dynamic load imbalance problems. Int. J. Computer Math., 2001, 76(4): 463–477.
MATH MathSciNet Google Scholar
Cao X, Mo Z. A new scalable parallel method for molecular dynamics based on Cell-Block data structure. In Proc. ISPA2004, Hong Kong, Cao J, Yang L T, Lau F (eds.), Lecture Notes in Computer Science, 2004, 3358: 757–764.
Bisseling R H. Parallel Scientific Computation: A Structured Approach Using BSP and MPI. Oxford University Press, 2004.
Knoll D A, Keyes D E. Jacobian-free NewtonKrylov methods: A survey of approaches and applications. Journal of Computational Physics (JCP), 2004, 193: 357–397.
Article MATH MathSciNet Google Scholar
Trottenberg U, Osterlee C W, Schuller A. Multigrid. Academic Press, 2001.
Mo Z, Shen L, Wittum G. Parallel adaptive multigrid algorithm for 2-D 3-T diffusion equations. Int. J. Computer Math., 2004, 81(3): 361–374.
Article MATH MathSciNet Google Scholar
Falgout R D, Jones J E, Yang U M. The Design and Implementation of Hypre, a Library of Parallel High Performance Preconditioners. Chapter in Numerical Solution of Partial Differential Equations on Parallel Computers, Bruaset A M, Bjørstad P, Tveito A (eds.), Springer-Verlag, to appear. Also available as LLNL Technical Report UCRL-JRNL-205459, 2004.
Balay S, Groppy W D, McInnes L C et al. PETSc 2.0 Users Manual. Technical Report ANL-95/11, Argonne National Laboratory, Argonne, IL, Mar 2000.
Bastian P, Birken K et al. UG—A flexible software toolbox for solving partial differential equations. Computation and Visualization in Science, 1997, 1: 27–40.
Article MATH Google Scholar
Wissink A M, Hornung R D, Kohn S R et al. Large scale parallel structured AMR calculations using the SAMRAI framework. In Proc. High-Performance Computing and Networking Conf. (SC’2001), Denver, 2001, pp. 22–28.
Lewis E E, Miller W F. Computational Methods of Neutron Transport. John Wiley & Sons Publisher, 1984.
Mo Z, Fu L, Parallel flux sweep algorithm for neutron transport on unstructured grid. J. Supercomputing, 2004, 30(1): 5–17.
Article MATH Google Scholar
Plimpton S, Hendrickson B, Burns S et al. Parallel algorithms for radiation transport on unstructured grids. In Proc. SuperComputing’2000, Dallas, Nov. 4–10, 2000, pp. 25–31.
Mo Z, Zhang A, Cao X. Towards a parallel framework of grid-based numerical algorithms on DAGs. In Proc. 18th Int. Symp. Parallel and Distributed Computing (IPDPS’06), Greece, April 25–29, 2006, pp. 416–424.
Dongarra J, Foster I, Fox G et al. (eds.). Sourcebook of Parallel Computing. Morgan Kaufmann Publishers, New York, 2003.
Bernholdt D E. Parallel computational chemistry: An overview of NWChem. Chapter 7 of Sourcebook of Parallel Computing, Dongarra J, Foster I, Fox G et al. (eds.), New York: Morgan Kaufmann Publishers, 2003.
Nieplocha J, Ju J, Krishnan M K et al. The global arrays user’s manual. Pacific Northwest National Laboratory Technical Report No.13130, October 1, 2002.
http://www.supercomputing.org/.
Jordan H F, Alaghband G, Jordan H E. Fundamentals of Parallel Computing. Prentice Hall. 2003.
Chakravorty S, Kale L V. A fault tolerant protocol for massively parallel systems. In Proc. 18th International Parallel and Distributed Processing Symposium (IPDPS), Santa Fe, New Mexico, 2004, pp. 212–219.
Stou Q F. Algorithms minimizing peak energy on mesh-connected systems. In Proc. 18th ACM Symp. Parallelism in Algorithms and Architectures (SPAA), Cambridge, MA, USA, 2006, pp. 331–334.
Shan J, Chen Y, Diao Q et al. Parallel information extraction on shared memory multi-processor system. In Proc. Int. Conf. Parallel Processing (ICPP), Columbus, Ohio, USA, 2006, pp. 215–224.
So B, Ghuloum A, Wu Y. Optimizing data parallel operations on many-core platforms. First Workshop on Software Tools for Multi-Core Systems (STMCS), Manhattan, NY, 2006, pp. 66–70.
Mattson T G, Sanders B A, Massingill B L. Patterns for Parallel Programming. Prentice Hall. 2005.

Download references

Author information

Authors and Affiliations

Anhui Province-MOST Key Co-Lab of High Performance Computing and Its Applications, Department of Computer Science and Technology, University of Science and Technology of China, Hefei, 230027, P.R. China
Guo-Liang Chen & Guang-Zhong Sun
Laboratory of Parallel Computing, Institute of Software, Chinese Academy of Sciences, Beijing, 100080, P.R. China
Yun-Quan Zhang
Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Beijing, 100088, P.R. China
Ze-Yao Mo

Authors

Guo-Liang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Guang-Zhong Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yun-Quan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ze-Yao Mo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guo-Liang Chen.

Additional information

Survey: Supported by the National Natural Science Foundation of China under Grant No.60533020.

Guo-Liang Chen is a professor and academician of the Chinese Academy of Sciences. He works with Dept. Computer Sci. & Tech., University of Science and Technology of China. His major research areas include parallel computing theory and algorithms.

Guang-Zhong Sun is a lecturer in the Dept. Computer Sci. & Tech., University of Science and Technology of China (USTC). His research interests include parallel algorithms and scheduling theory.

Yun-Quan Zhang is an associate professor and vice director of the Lab. of Parallel Computing, Institute of Software, CAS. His research interests include performance evaluation, parallel software design and parallel computational model.

Ze-Yao Mo is a professor. He has been doing researches on parallel algorithms and parallel application software for larger scale scientific and engineering numerical simulations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, GL., Sun, GZ., Zhang, YQ. et al. Study on Parallel Computing. J Comput Sci Technol 21, 665–673 (2006). https://doi.org/10.1007/s11390-006-0665-9

Download citation

Received: 18 April 2006
Revised: 14 June 2006
Issue Date: September 2006
DOI: https://doi.org/10.1007/s11390-006-0665-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Study on Parallel Computing

Abstract

Access this article

Similar content being viewed by others

Parallel Environments

A Brief History of the Parallel Dawn in Karl-Marx-Stadt/Chemnitz

Tenth International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar’2012)

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Study on Parallel Computing

Abstract

Access this article

Similar content being viewed by others

Parallel Environments

A Brief History of the Parallel Dawn in Karl-Marx-Stadt/Chemnitz

Tenth International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar’2012)

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation