Skip to main content
Log in

IPM based sparse LP solver on a heterogeneous processor

  • Original Paper
  • Published:
Computational Management Science Aims and scope Submit manuscript

Abstract

We present the parallelization of a linear programming solver using a primal-dual interior point method on one of the heterogeneous processors, namely the Cell BE processor. Focus is given to Cholesky factorization as it is the most computationally expensive kernel in interior point methods. To make it easier to develop and port to other heterogeneous systems, we propose a two-phase implementation procedure where we first develop a shared-memory multithreaded application that executes only on the main processor, and then offload the compute-intensive tasks to execute on the synergistic processors (Cell accelerator cores). We used parent–child supernode amalgamation to increase sizes of the blocks, but we noticed that the existence of many small blocks cause significant performance degradation. To reduce the overhead of small blocks, we extend the block fan-out algorithm such that small blocks are aggregated into large blocks without adding extra zeros. We also use another type of amalgamation that can merge any two consecutive supernodes and use it to avoid having very small blocks in a composed block. The suggested block aggregation method is able to speedup the whole LP solver of up to 2.5 when compared to using parent–child supernode amalgamation alone.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ashcraft C, Grimes R (1989) The influence of relaxed supernode partitions on the multifrontal method. ACM Trans Math Softw 15: 291–309

    Article  Google Scholar 

  • Eleyat M, Natvig L (2010a) Implementation of a linear programming solver on the Cell BE processor. In: Procedia computer science, international conference on computational science, pp 1049–1058

  • Eleyat M, Natvig L (2010b) Mixed-precision parallel linear programming solver. In: SBAC-PAD 2010, 22nd int’l symposium on computer architecture and high performance computing, pp 41–46

  • Feng W, Cameron KW (2011) The Green500 List-June 2011. The Green 500. http://www.green500.org/lists/2011/06/top/list.php. Accessed July 2011

  • Gay D (1985) Electronic mail distribution of linear programming test problems. In: Mathematical Programming Society COAL newsletter, pp 10–12

  • Kahle JA, Day HP, Hofstee CR, Johns MN, Maeurer TR, Shippy D (2005) Introduction to the Cell multiprocessor. IBM J Res Develop 49: 589–604

    Article  Google Scholar 

  • Kumar S, Hughes C, Nguyen A (2007) Carbon: architectural support for fine-grained parallelism on chip multiprocessors. In: Proceedings intl. symp. on comp. arch (ISCA), pp 162–173. doi:10.1145/1250662.1250683

  • Kurzak J, Buttari A, Dongarra J (2008) Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization. IEEE Trans Parallel Distrib Syst 19: 1175–1186

    Article  Google Scholar 

  • Lee H, Kim J, Hong SJ, Lee S (2003) Task scheduling using a block dependency DAG for block-oriented sparse Cholesky factorization. Parallel Comput 29: 135–159

    Article  Google Scholar 

  • Luenberger D (2007) Linear and nonlinear programming. Springer Science, New York

    Google Scholar 

  • Makhorin A (2008) GLPK (GNU Linear Programming Kit). Moscow Aviation Institute. http://www.gnu.org/software/glpk/ . Accessed 12 Sept 2009

  • Mehrotra S (1992) On the implementation of a primal-dual interior point method. SIAM J Optim 2: 575–601. doi:10.1137/0802028

    Article  Google Scholar 

  • Ng EG, Peyton BW (1993) Block sparse Cholesky algorithms on advanced uniprocessor computers. SIAM J Sci Comput 14: 1034–1056

    Article  Google Scholar 

  • Rothberg E, Gupta A (1994) An efficient block-orientated approach to parallel sparse Cholesky factorization. SIAM J Sci Comput 15: 1413–1439

    Article  Google Scholar 

  • Rothberg E, Schreiber (1999) Efficient methods for out-of-core sparse Cholesky factorization. SIAM J Sci Comput 21: 129–144

    Article  Google Scholar 

  • Rozin E, Toledo S (2005) Locality of reference in sparse Cholesky methods. Electron Trans Numer Anal 21: 81–106

    Google Scholar 

  • Shi G, Kindratenko V, Ufimtsev I, Martinez T, Phillips J, Gottlieb S (2009) Implementation of scientific computing applications on the Cell broadband engine. Sci Program 17: 135–151

    Google Scholar 

  • Smelyanskiy M, Lee VW, Kim D, Nguyen AD, Dubey P (2007) Scaling performance of interior-point method on large-scale chip multiprocessor system. In: ACM/IEEE Supercomputing SC’07

  • Vishwas B, Gadia A, Chaudhuri M (2009) Implementing a parallel matrix factorization library on the cell broadband engine. Sci Programm 17: 3–29

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mujahed Eleyat.

Additional information

This research has been supported by Miriam AS, http://www.miriam.as, and the Norwegian Research Council. The authors are members of HiPEAC2 NoE.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Eleyat, M., Natvig, L. IPM based sparse LP solver on a heterogeneous processor. Comput Manag Sci 9, 287–299 (2012). https://doi.org/10.1007/s10287-012-0137-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10287-012-0137-3

Keywords

Navigation