Kernel Polynomial Method on GPU

Zhang, Shixun; Yamagiwa, Shinichi; Okumura, Masahiko; Yunoki, Seiji

doi:10.1007/s10766-012-0204-y

Shixun Zhang¹,
Shinichi Yamagiwa²,
Masahiko Okumura³ &
…
Seiji Yunoki^4,5,6

439 Accesses
1 Citation
Explore all metrics

Abstract

The simulation of lattice model systems for quantum materials is one of the most important approaches to understand quantum properties of matter in condensed matter physics. The main task in the simulation is to diagonalize a Hamiltonian matrix for the system and evaluate the electronic density of energy states. Kernel polynomial method (KPM) is one of the promising simulation methods. Because KPM contains a fine-grain recursive part in the algorithm, it is hard to parallelize it under the thread level parallelism such as on a supercomputer or a cluster computer. This paper focuses on methods to parallelize KPM on a massively parallel environment of GPU, aiming to achieve high parallelism for more speedups than the recent CPUs. This paper proposes two implementation methods called the full map and the sliding window methods, and evaluates the performances in the recent GPU platform. To enlarge available simulation sizes and at the same time to enhance the performance, this paper also describes additional optimization techniques depending on the GPU architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bednorz J.G., Müller K.A.: Possible high T _c superconductivity in the Ba-La-Cu-O system. Z. Phys. B Condens. Matter 64(2), 189–193 (1986)
Article Google Scholar
Dagotto E.: Correlated electrons in high-temperature superconductors. Rev. Mod. Phys. 66(3), 763–840 (1994)
Article Google Scholar
Ferrario M., Ciccotti G., Binder K.: Computater Simulations in Condensed Matter: From Materials to Chemical Biology, vol. 1, 2. Springer, Berlin (2006)
Book Google Scholar
Foulkes W., Mitas L., Needs R., Rajagopal G.: Quantum monte carlo simulations of solids. Rev. Mod. Phys. 73(1), 33–83 (2001)
Article Google Scholar
Grimes, R., Kincaid, D., Young., D.: ITPACK 2.0 User’s Guide. Technical Report CNA-150, Center for Numerical Analysis, University of Texas (1979)
Grotendorst, J., Mark, D., Muramatsu, A.: Quantum Simulations of Complex Many-Body Systems: From Theory to Algorithms. NIC-Directors (2002)
McCalpin, J.D.: Memory Bandwidth and Machine Balance in Current High Performance Computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter (1995)
Nguyen H.: GPU Gems 3, 1st edn. Addison-Wesley Professional, Reading (2007)
Google Scholar
NVIDIA Corporation: CUDA: Compute unified device architecture programming guide. http://developer.nvidia.com/cuda
Ohno K., Esfarjani K., Kawazoe Y.: Computational Materials Science. Springer, Berlin (1999)
Book Google Scholar
Schollwöck U.: The density-matrix renormalization group. Rev. Mod. Phys. 77(1), 259–315 (2005)
Article Google Scholar
Varga R.: Geršgorin and His Circles. Springer Series in Computational Mathematics. Springer, Berlin (2004)
Google Scholar
Weiße A., Wellein G., Alvermann A., Fehske H.: The kernel polynomial method. Rev. Mod. Phys. 78(1), 275–306 (2006)
Article MATH Google Scholar
White S.: Density matrix formulation for quantum renormalization groups. Phys. Rev. Lett. 69(19), 2863–2866 (1992)
Article Google Scholar
White S.: Density-matrix algorithms for quantum renormalization groups. Phys. Rev. B 48(14), 10345–10356 (1993)
Article Google Scholar
Yamada S., Okumura M., Machida M.: Direct extension of density-matrix renormalization group to two-dimensional quantum lattice systems: studies of parallel algorithm, accuracy, and performance. J. Phys. Soc. Jpn. 78(9), 094004 (2009)
Article Google Scholar
Yamashita M., Nakata N., Senshu Y., Nagata M., Yamamoto H.M., Kato R., Shibauchi T., Matsuda Y.: Highly mobile gapless excitations in a two-dimensional candidate quantum spin liquid. Science 328(5983), 1246–1248 (2010)
Article Google Scholar
Zhang, S., Yamagiwa, S., Okumura, M., Yunoki, S.: Performance accelaration of kernel polynomial method applying graphics processing units. In: IPDPS/APDCM 2011, pp. 564–571. IEEE CS (2011)

Download references

Author information

Authors and Affiliations

Department of Computer Science, Graduate School of System and Information Science, University of Tsukuba, Tsukuba, Ibaraki, 305-8573, Japan
Shixun Zhang
Faculty of Engineering, Information and Systems, University of Tsukuba/JST PRESTO, Tsukuba, Ibaraki, 305-8573, Japan
Shinichi Yamagiwa
CCSE, Japan Atomic Energy Agency, 5-1-5 Kashiwanoha, Kashiwa, Chiba, 277-8587, Japan
Masahiko Okumura
Computational Condensed Matter Physics Laboratory, RIKEN ASI, Wako, Saitama, 351-0198, Japan
Seiji Yunoki
JST CREST, Kawaguchi, Saitama, 332-0012, Japan
Seiji Yunoki
Computational Materials Science Research Team, RIKEN AICS, Kobe, Hyogo, 650-0047, Japan
Seiji Yunoki

Authors

Shixun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shinichi Yamagiwa
View author publications
You can also search for this author in PubMed Google Scholar
Masahiko Okumura
View author publications
You can also search for this author in PubMed Google Scholar
Seiji Yunoki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shinichi Yamagiwa.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, S., Yamagiwa, S., Okumura, M. et al. Kernel Polynomial Method on GPU. Int J Parallel Prog 41, 59–88 (2013). https://doi.org/10.1007/s10766-012-0204-y

Download citation

Received: 10 April 2012
Accepted: 21 June 2012
Published: 04 July 2012
Issue Date: February 2013
DOI: https://doi.org/10.1007/s10766-012-0204-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Kernel Polynomial Method on GPU

Abstract

Access this article

Similar content being viewed by others

NVIDIA SimNet™: An AI-Accelerated Multi-Physics Simulation Framework

The Peridigm Meshfree Peridynamics Code

A State of the Art Review of the Particle Finite Element Method (PFEM)

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Kernel Polynomial Method on GPU

Abstract

Access this article

Similar content being viewed by others

NVIDIA SimNet™: An AI-Accelerated Multi-Physics Simulation Framework

The Peridigm Meshfree Peridynamics Code

A State of the Art Review of the Particle Finite Element Method (PFEM)

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation