Skip to main content
Log in

Kernel Polynomial Method on GPU

  • Published:
International Journal of Parallel Programming Aims and scope Submit manuscript

Abstract

The simulation of lattice model systems for quantum materials is one of the most important approaches to understand quantum properties of matter in condensed matter physics. The main task in the simulation is to diagonalize a Hamiltonian matrix for the system and evaluate the electronic density of energy states. Kernel polynomial method (KPM) is one of the promising simulation methods. Because KPM contains a fine-grain recursive part in the algorithm, it is hard to parallelize it under the thread level parallelism such as on a supercomputer or a cluster computer. This paper focuses on methods to parallelize KPM on a massively parallel environment of GPU, aiming to achieve high parallelism for more speedups than the recent CPUs. This paper proposes two implementation methods called the full map and the sliding window methods, and evaluates the performances in the recent GPU platform. To enlarge available simulation sizes and at the same time to enhance the performance, this paper also describes additional optimization techniques depending on the GPU architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bednorz J.G., Müller K.A.: Possible high T c superconductivity in the Ba-La-Cu-O system. Z. Phys. B Condens. Matter 64(2), 189–193 (1986)

    Article  Google Scholar 

  2. Dagotto E.: Correlated electrons in high-temperature superconductors. Rev. Mod. Phys. 66(3), 763–840 (1994)

    Article  Google Scholar 

  3. Ferrario M., Ciccotti G., Binder K.: Computater Simulations in Condensed Matter: From Materials to Chemical Biology, vol. 1, 2. Springer, Berlin (2006)

    Book  Google Scholar 

  4. Foulkes W., Mitas L., Needs R., Rajagopal G.: Quantum monte carlo simulations of solids. Rev. Mod. Phys. 73(1), 33–83 (2001)

    Article  Google Scholar 

  5. Grimes, R., Kincaid, D., Young., D.: ITPACK 2.0 User’s Guide. Technical Report CNA-150, Center for Numerical Analysis, University of Texas (1979)

  6. Grotendorst, J., Mark, D., Muramatsu, A.: Quantum Simulations of Complex Many-Body Systems: From Theory to Algorithms. NIC-Directors (2002)

  7. McCalpin, J.D.: Memory Bandwidth and Machine Balance in Current High Performance Computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter (1995)

  8. Nguyen H.: GPU Gems 3, 1st edn. Addison-Wesley Professional, Reading (2007)

    Google Scholar 

  9. NVIDIA Corporation: CUDA: Compute unified device architecture programming guide. http://developer.nvidia.com/cuda

  10. Ohno K., Esfarjani K., Kawazoe Y.: Computational Materials Science. Springer, Berlin (1999)

    Book  Google Scholar 

  11. Schollwöck U.: The density-matrix renormalization group. Rev. Mod. Phys. 77(1), 259–315 (2005)

    Article  Google Scholar 

  12. Varga R.: Geršgorin and His Circles. Springer Series in Computational Mathematics. Springer, Berlin (2004)

    Google Scholar 

  13. Weiße A., Wellein G., Alvermann A., Fehske H.: The kernel polynomial method. Rev. Mod. Phys. 78(1), 275–306 (2006)

    Article  MATH  Google Scholar 

  14. White S.: Density matrix formulation for quantum renormalization groups. Phys. Rev. Lett. 69(19), 2863–2866 (1992)

    Article  Google Scholar 

  15. White S.: Density-matrix algorithms for quantum renormalization groups. Phys. Rev. B 48(14), 10345–10356 (1993)

    Article  Google Scholar 

  16. Yamada S., Okumura M., Machida M.: Direct extension of density-matrix renormalization group to two-dimensional quantum lattice systems: studies of parallel algorithm, accuracy, and performance. J. Phys. Soc. Jpn. 78(9), 094004 (2009)

    Article  Google Scholar 

  17. Yamashita M., Nakata N., Senshu Y., Nagata M., Yamamoto H.M., Kato R., Shibauchi T., Matsuda Y.: Highly mobile gapless excitations in a two-dimensional candidate quantum spin liquid. Science 328(5983), 1246–1248 (2010)

    Article  Google Scholar 

  18. Zhang, S., Yamagiwa, S., Okumura, M., Yunoki, S.: Performance accelaration of kernel polynomial method applying graphics processing units. In: IPDPS/APDCM 2011, pp. 564–571. IEEE CS (2011)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shinichi Yamagiwa.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, S., Yamagiwa, S., Okumura, M. et al. Kernel Polynomial Method on GPU. Int J Parallel Prog 41, 59–88 (2013). https://doi.org/10.1007/s10766-012-0204-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10766-012-0204-y

Keywords

Navigation