Abstract
This paper describes several parallel algorithmic variations of the Neville elimination. This elimination solves a system of linear equations making zeros in a matrix column by adding to each row an adequate multiple of the preceding one. The parallel algorithms are run and compared on different multi- and many-core platforms using parallel programming techniques as MPI, OpenMP and CUDA.
Similar content being viewed by others
References
Intel (2005) Intel multi-core processor architecture developer backgrounder. White paper
Owens JD, Houston M, Luebke D, Green S, Stone JE, Phillips JC (2008) GPU computing. Proc IEEE 96(5):879–899
Gasca M, Peña JM (1992) Total positivity and Neville elimination. Linear Algebra Appl 165:25–44
Gasca M, Peña JM (1994) A matricial description of Neville elimination with applications to total positivity. Linear Algebra Appl 202:33–45
Demmel J, Koev P (2005) The accurate and efficient solution of a totally positive generalized Vandermonde linear system. SIAM J Matrix Anal Appl 27:142–152
Gemignani L (2008) Neville elimination for rank-structured matrices. Linear Algebra Appl 428(4):978–991
Alonso P, Cortina R, Díaz I, Ranilla J (2004) Neville elimination: a study of the efficiency using checkerboard partitioning. Linear Algebra Appl 393:3–14
Alonso P, Díaz I, Cortina R, Ranilla J (2008) Scalability of Neville elimination using checkerboard partitioning. Int J Comput Math 85(3–4):309–317
Alonso P, Cortina R, Díaz I, Ranilla J (2009) Blocking Neville elimination algorithm for exploiting cache memories. Appl Math Comput 209(1):2–9
Cortina R (2008) El método de Neville: un enfoque basado en Computación de Altas Prestaciones. Ph.D. thesis, Univ. of Oviedo, Spain
Chandra R et al (2001) Parallel programming in OpenMP. Morgan Kaufmann, San Mateo
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Alonso, P., Cortina, R., Martínez-Zaldívar, F.J. et al. Neville elimination on multi- and many-core systems: OpenMP, MPI and CUDA. J Supercomput 58, 215–225 (2011). https://doi.org/10.1007/s11227-009-0360-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-009-0360-z