IPM based sparse LP solver on a heterogeneous processor

Eleyat, Mujahed; Natvig, Lasse

doi:10.1007/s10287-012-0137-3

IPM based sparse LP solver on a heterogeneous processor

Original Paper
Published: 18 January 2012

Volume 9, pages 287–299, (2012)
Cite this article

Computational Management Science Aims and scope Submit manuscript

Mujahed Eleyat^1,2 &
Lasse Natvig¹

62 Accesses
Explore all metrics

Abstract

We present the parallelization of a linear programming solver using a primal-dual interior point method on one of the heterogeneous processors, namely the Cell BE processor. Focus is given to Cholesky factorization as it is the most computationally expensive kernel in interior point methods. To make it easier to develop and port to other heterogeneous systems, we propose a two-phase implementation procedure where we first develop a shared-memory multithreaded application that executes only on the main processor, and then offload the compute-intensive tasks to execute on the synergistic processors (Cell accelerator cores). We used parent–child supernode amalgamation to increase sizes of the blocks, but we noticed that the existence of many small blocks cause significant performance degradation. To reduce the overhead of small blocks, we extend the block fan-out algorithm such that small blocks are aggregated into large blocks without adding extra zeros. We also use another type of amalgamation that can merge any two consecutive supernodes and use it to avoid having very small blocks in a composed block. The suggested block aggregation method is able to speedup the whole LP solver of up to 2.5 when compared to using parent–child supernode amalgamation alone.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Task-Based Sparse Hybrid Linear Solver for Distributed Memory Heterogeneous Architectures

OpenMP/MPI Hybrid Parallel ILU(k) Preconditioner for FEM Based on Extended Hierarchical Interface Decomposition for Multi-core Clusters

Design of a Task-Parallel Version of ILUPACK for Graphics Processors

References

Ashcraft C, Grimes R (1989) The influence of relaxed supernode partitions on the multifrontal method. ACM Trans Math Softw 15: 291–309
Article Google Scholar
Eleyat M, Natvig L (2010a) Implementation of a linear programming solver on the Cell BE processor. In: Procedia computer science, international conference on computational science, pp 1049–1058
Eleyat M, Natvig L (2010b) Mixed-precision parallel linear programming solver. In: SBAC-PAD 2010, 22nd int’l symposium on computer architecture and high performance computing, pp 41–46
Feng W, Cameron KW (2011) The Green500 List-June 2011. The Green 500. http://www.green500.org/lists/2011/06/top/list.php. Accessed July 2011
Gay D (1985) Electronic mail distribution of linear programming test problems. In: Mathematical Programming Society COAL newsletter, pp 10–12
Kahle JA, Day HP, Hofstee CR, Johns MN, Maeurer TR, Shippy D (2005) Introduction to the Cell multiprocessor. IBM J Res Develop 49: 589–604
Article Google Scholar
Kumar S, Hughes C, Nguyen A (2007) Carbon: architectural support for fine-grained parallelism on chip multiprocessors. In: Proceedings intl. symp. on comp. arch (ISCA), pp 162–173. doi:10.1145/1250662.1250683
Kurzak J, Buttari A, Dongarra J (2008) Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization. IEEE Trans Parallel Distrib Syst 19: 1175–1186
Article Google Scholar
Lee H, Kim J, Hong SJ, Lee S (2003) Task scheduling using a block dependency DAG for block-oriented sparse Cholesky factorization. Parallel Comput 29: 135–159
Article Google Scholar
Luenberger D (2007) Linear and nonlinear programming. Springer Science, New York
Google Scholar
Makhorin A (2008) GLPK (GNU Linear Programming Kit). Moscow Aviation Institute. http://www.gnu.org/software/glpk/ . Accessed 12 Sept 2009
Mehrotra S (1992) On the implementation of a primal-dual interior point method. SIAM J Optim 2: 575–601. doi:10.1137/0802028
Article Google Scholar
Ng EG, Peyton BW (1993) Block sparse Cholesky algorithms on advanced uniprocessor computers. SIAM J Sci Comput 14: 1034–1056
Article Google Scholar
Rothberg E, Gupta A (1994) An efficient block-orientated approach to parallel sparse Cholesky factorization. SIAM J Sci Comput 15: 1413–1439
Article Google Scholar
Rothberg E, Schreiber (1999) Efficient methods for out-of-core sparse Cholesky factorization. SIAM J Sci Comput 21: 129–144
Article Google Scholar
Rozin E, Toledo S (2005) Locality of reference in sparse Cholesky methods. Electron Trans Numer Anal 21: 81–106
Google Scholar
Shi G, Kindratenko V, Ufimtsev I, Martinez T, Phillips J, Gottlieb S (2009) Implementation of scientific computing applications on the Cell broadband engine. Sci Program 17: 135–151
Google Scholar
Smelyanskiy M, Lee VW, Kim D, Nguyen AD, Dubey P (2007) Scaling performance of interior-point method on large-scale chip multiprocessor system. In: ACM/IEEE Supercomputing SC’07
Vishwas B, Gadia A, Chaudhuri M (2009) Implementing a parallel matrix factorization library on the cell broadband engine. Sci Programm 17: 3–29
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer and Information Science, Norwegian University of Science and Technology, Sem Sælands vei 7-9, 7034, Trondheim, Norway
Mujahed Eleyat & Lasse Natvig
Miriam AS, Storgata 7, 1771, Halden, Norway
Mujahed Eleyat

Authors

Mujahed Eleyat
View author publications
You can also search for this author in PubMed Google Scholar
Lasse Natvig
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mujahed Eleyat.

Additional information

This research has been supported by Miriam AS, http://www.miriam.as, and the Norwegian Research Council. The authors are members of HiPEAC2 NoE.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Eleyat, M., Natvig, L. IPM based sparse LP solver on a heterogeneous processor. Comput Manag Sci 9, 287–299 (2012). https://doi.org/10.1007/s10287-012-0137-3

Download citation

Received: 01 October 2010
Accepted: 04 January 2012
Published: 18 January 2012
Issue Date: May 2012
DOI: https://doi.org/10.1007/s10287-012-0137-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

IPM based sparse LP solver on a heterogeneous processor

Abstract

Access this article

Similar content being viewed by others

Task-Based Sparse Hybrid Linear Solver for Distributed Memory Heterogeneous Architectures

OpenMP/MPI Hybrid Parallel ILU(k) Preconditioner for FEM Based on Extended Hierarchical Interface Decomposition for Multi-core Clusters

Design of a Task-Parallel Version of ILUPACK for Graphics Processors

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

IPM based sparse LP solver on a heterogeneous processor

Abstract

Access this article

Similar content being viewed by others

Task-Based Sparse Hybrid Linear Solver for Distributed Memory Heterogeneous Architectures

OpenMP/MPI Hybrid Parallel ILU(k) Preconditioner for FEM Based on Extended Hierarchical Interface Decomposition for Multi-core Clusters

Design of a Task-Parallel Version of ILUPACK for Graphics Processors

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation