skip to main content
10.1145/1851476.1851512acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
research-article

GPU-based parallel householder bidiagonalization

Authors Info & Claims
Published:21 June 2010Publication History

ABSTRACT

In this paper, we discuss the GPU-based implementation and optimization of Householder bidiagonalization, a matrix factorization method which is an integral part of full Singular Value Decomposition (SVD) - an important algorithm for many problems in the research domain of Multimedia Content Analysis (MMCA). On cluster computers, complex adaptive run-time techniques often must be implemented to overcome the growing negative performance impact of load imbalances and to ensure reasonable speedup. We show that the nature of the many-core platform can avoid the necessity of applying such complex run-time parallelization techniques in software while achieving a performance of 64 gigaflops/s on a single-GPU GTX 295 in double precision, 82% of the theoretical peak performance.

References

  1. }}M. M. Baskaran and R. Bordawekar. Optimizing sparse matrix-vector multiplication on gpus. Technical Report RC24704, IBM, 2008.Google ScholarGoogle Scholar
  2. }}D. Evans and M. Gusev. Systolic svd and qr decomposition by householder reflections. Int. J. Comp. Math., 79(4):417--439, Jan. 2002.Google ScholarGoogle ScholarCross RefCross Ref
  3. }}N. Galoppo, N. K. Govindaraju, M. Henson, and D. Manocha. Lu-gpu: Efficient algorithms for solving dense linear systems on graphics hardware. In SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing, Washington, DC, USA, 2005. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. }}G. Golub and C. Reinsch. Singular value decomposition and least squares solutions. Numerische Mathematik, 14(5):403--20, Apr. 1970.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. }}F. Liu and F. Seinstra. Adaptive parallel householder bidiagonalization. In Proceedings of the 15th International Euro-Par Conference (Euro-Par 2009), pages 821--833, Delft, The Netherlands, Aug. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. }}F. Seinstra, J. Geusebroek, D. Koelma, C. Snoek, M. Worring, and A. Smeulders. High-performance distributed video content analysis with parallel-horus. IEEE Multimedia, 14(4):64--75, Oct. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. }}R. V. van Nieuwpoort and J. W. Romein. Using many-core hardware to correlate radio astronomy signals. In Proceedings of the ACM International Conference on Supercomputing (ICS'09), pages 440--449, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. }}J. H. Wilkinson. Householder's method for the solution of the algebraic eigenproblem. The Computer Journal, 3(1):23--27, Apr. 1960.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. GPU-based parallel householder bidiagonalization

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        HPDC '10: Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
        June 2010
        911 pages
        ISBN:9781605589428
        DOI:10.1145/1851476

        Copyright © 2010 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 21 June 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate166of966submissions,17%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader