poster

Machine Learning Potential Function Generation for ab initio Electronic Structure Calculations

Authors:

Duong Thuy Hoang,

Paul RulisAuthors Info & Claims

PEARC '23: Practice and Experience in Advanced Research Computing 2023: Computing for the Common Good

Pages 448 - 449

https://doi.org/10.1145/3569951.3597581

Published: 10 September 2023 Publication History

Get Access

Abstract

First-principles electronic structure calculations based on density functional theory (DFT) are well-known to have a high computational cost that scales algorithmically as O(N3), where N is the number of electrons. Reducing that cost is a key goal of the computational materials physics community and machine learning (ML) is viewed as an essential tool for that task. However, ML model training requires an appropriate match between the input descriptors and the target property as well as copious quantities of training data. Therefore, we present a computer program that is designed to automate the generation of local atomic environment descriptors for single element systems that may be used for training neural networks to predict electronic potential function coefficients, {Ai}, which are used within the DFT based orthogonalized linear combination of atomic orbitals (OLCAO) method [2]. In our approach, the total electronic potential function of a periodic crystal, is expressed as a sum of localized atom-centered Gaussian functions. Each Gaussian function, i, in the set of all Gaussian functions has a fixed αi coefficient. The set of {Ai} coefficients are updated in each cycle of the self-consistent field (SCF) iterations in accordance with the charge density that was computed in the previous SCF step. However, if the choice of coefficients {Ai} can be accurately predicted for a given system, then the SCF process can be skipped entirely, satisfying an important requirement of our goal to reduce the computational cost. The prediction method uses suitable neural networks (NNs) where the input values are a set of local atomic environment descriptors and the output values are the {Ai} coefficients for a targeted system. The descriptors we opted to use are the bispectrum components but other additional descriptors may be incorporated. Bispectrum components are geometric calculations that smoothly capture subtle variations in the local atomic environment and that are invariant under translation, rotation, and permutation of neighborhood atoms. The bispectrum components can also easily incorporate different types and numbers of elements, and they have been used by others for a similar purpose [3, 4]. Those requirements are difficult to achieve using other methods such as a list of bond angles and bond lengths toward nearest neighbor atoms while maintaining a fixed number of NN input features. where are expansion coefficients,

is the coupling coefficient for four-dimensional spherical harmonics, analogous to the Clebsch-Gordan coefficients for rotations in three dimensional space. One challenge in this research is defining a suitable cut-off radius for evaluation of the bispectrum component to avoid neglecting the interaction between a targeted atom and its neighbors. The cut-off radius is weighted as a function of the elements involved to accommodate different types of bonding (e.g., ionic, covalent, metallic). Additionally, for properly defining and training a neural network (see below), it is vital that we provide a clear correlation between the physical (geometric) features of the bispectrum components and the electronic features that may simultaneously be present to avoid too much redundancy in the input data. This lack of understanding can limit the development of methods to predict the electronic structure properties based on the bispectrum components, underscoring the need for further research in this area. A supervised training framework for a proposed neural network is demonstrated using a data set of pure Si models that includes amorphous Si, crystalline Si, Si with a passive defect, and Si with self-interstitials. Other models will be implemented to compare efficiency. For each model, the input/target output training pairs consist of local environment descriptors - bispectrum components (input) that encode the structure of neighboring atoms relative to the central atom i at a specific point in real space, along with the converged potential functions obtained by the SCF process (target output). The data set must be partitioned into training, test, and validation sets for use in subsequent iterations of training and validation to evaluate and optimize the model’s performance during the training process. In OLCAO, the total electronic potential function of a crystal is expressed as a sum of atom-centered potential functions. Each atom-centered potential function is represented as a sum of Gaussian functions. However, it is vital to recognize that although the potential function is an assembly of site-centered functions it cannot be said that the potential function from a given site is the potential function "of" the atom at that site. Rather, the potential function at a given site is determined by the influence of all nearby atoms. Therefore, it is intuitive to seek a ML model that follows a similar structure. In this case, it is important to find a way for the input data structure to incorporate that feature of the potential function, which consists of a mixture of influences derived from the neighboring atoms. Each component of this mixture represents a cluster or subpopulation within the local region. To capture this structure, we propose a neural network framework based on Mixture Density Network (MDN) [1] for the training process. This approach involves encoding the local, medium-range, and long-range (global) influences for each atom. In many cases, electron interactions are considered ’short-sighted,’ meaning that they are mainly affected by nearby atoms only. However, our proposed method overcomes this limitation and effectively addresses novel long-range electronic structure properties such as those found in metallic or certain magnetic materials. Results regarding the optimization of the run time for calculating the bispectrum component is discussed, including a comparison with key function program code that uses third-party libraries such as SymPy. A computer program is developed to automatically generate bispectrum components for a single-element system in a periodic unit cell. We investigated the symmetric properties of the bispectrum components, which align with the proof established in [4]. However, further development and testing of the program are necessary before it can be applied to multiple-element systems. Overall, this research contributes to the ongoing effort to develop new and improved neural network frameworks for predicting the electronic structure properties of materials with desirable features. When combined with other unique aspects of the OLCAO method it is expected that this approach will enable us to overcome the O(N3) algorithmic complexity scaling problem and thereby address multi-scale physics problems that require both direct access to the electronic wave function and a large number of atoms to realistically model.

References

[1]

Christopher M. Bishop. 1994. Mixture Density Networks. Technical Report. Aston University.

Google Scholar

[2]

Wai-Yim Ching and Paul Rulis. 2012. Electronic Structure Methods for Complex Materials: The Orthogonalized Linear Combination of Atomic Orbitals. Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199575800.001.0001

Crossref

Google Scholar

[3]

Lenz Fiedler, Normand A. Modine, Steve Schmerler, Dayton J. Vogel, Gabriel A. Popoola, Aidan P. Thompson, Sivasankaran Rajamanickam, and Attila Cangi. 2022. Predicting Electronic Structures at Any Length Scale with Machine Learning. http://arxiv.org/abs/2210.11343

Google Scholar

[4]

Aidan P. Thompson, Laura P. Swiler, Christian R. Trott, Stephen M. Foiles, and Garritt J. Tucker. 2015. A Spectral Analysis Method for Automated Generation of Quantum Accurate Interatomic Potentials. J. Comput. Phys. (2015). http://arxiv.org/abs/1409.3880

Google Scholar

Recommendations

Identification of intrinsic electron trapping sites in bulk amorphous silica from ab initio calculations

Display Omitted HighlightsWide O-Si-O angles in bulk amorphous silica are shown to trap electrons.Trapped electron levels appear 3.2eV below the bottom of the silica conduction band.Estimated concentration of electron trapping sites is 5í 1019. Using ab ...
Ab initio modeling of structure and defects at the HfO2/Si interface

Using ab initio density functional calculations we study implications of film growth kinetics on the structure and defects at the HfO2/Si(001) interface region. We introduce a simplified model for the atomic layer deposition with HfCl4 as a metal ...
Ab initio calculations of the effect of N, Nb, and Ta doping on the electronic structure and optical properties of SnO₂
Abstract
Nanostructured nitrogen-, niobium-, and tantalum-doped tin oxides are investigated by first-principle calculations. First, the band structure, bond length, density of states, and projected density of states of pure tin oxide are evaluated. Then, ...

Comments

Information & Contributors

Information

Published In

PEARC '23: Practice and Experience in Advanced Research Computing 2023: Computing for the Common Good

July 2023

519 pages

ISBN:9781450399852

DOI:10.1145/3569951

Editors:
Robert Sinkovits
San Diego Supercomputer Center
,
Alana Romanella
University of Colorado Boulder
,
Shelley Knuth
University of Colorado Boulder
,
Ken Hackworth
Pittsburgh Supercomputing Center
,
Jeff Pummill
University of Arkansas

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 September 2023

Check for updates

Qualifiers

Poster
Research
Refereed limited

Conference

PEARC '23

Sponsor:

PEARC '23: Practice and Experience in Advanced Research Computing

July 23 - 27, 2023

OR, Portland, USA

Acceptance Rates

Overall Acceptance Rate 133 of 202 submissions, 66%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
49
Total Downloads

Downloads (Last 12 months)23
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Recommendations

Identification of intrinsic electron trapping sites in bulk amorphous silica from ab initio calculations

Ab initio modeling of structure and defects at the HfO2/Si interface

Ab initio calculations of the effect of N, Nb, and Ta doping on the electronic structure and optical properties of SnO2

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations

Ab initio calculations of the effect of N, Nb, and Ta doping on the electronic structure and optical properties of SnO₂