skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Predicting Energetics Materials’ Crystalline Density from Chemical Structure by Machine Learning

Journal Article · · Journal of Chemical Information and Modeling

To expedite new molecular compound development, a long-sought goal within the chemistry community has been to predict molecules’ bulk properties of interest a priori to synthesis from a chemical structure alone. In this work, we demonstrate that machine learning methods can indeed be used to directly learn the relationship between chemical structures and bulk crystalline properties of molecules, even in the absence of any crystal structure information or quantum mechanical calculations. We focus specifically on a class of organic compounds categorized as energetic materials called high explosives (HE) and predicting their crystalline density. An ongoing challenge within the chemistry machine learning community is deciding how best to featurize molecules as inputs into machine learning models—whether expert handcrafted features or learned molecular representations via graph-based neural network models—yield better results and why. We evaluate both types of representations in combination with a number of machine learning models to predict the crystalline densities of HE-like molecules curated from the Cambridge Structural Database, and we report the performance and pros and cons of our methods. Our message passing neural network (MPNN) based models with learned molecular representations generally perform best, outperforming current state-of-the-art methods at predicting crystalline density and performing well even when testing on a data set not representative of the training data. However, these models are traditionally considered black boxes and less easily interpretable. Here, to address this common challenge, we also provide a comparison analysis between our MPNN-based model and models with fixed feature representations that provides insights as to what features are learned by the MPNN to accurately predict density.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC52-07NA27344
OSTI ID:
1784602
Report Number(s):
LLNL-JRNL-816318; 1025411
Journal Information:
Journal of Chemical Information and Modeling, Vol. 61, Issue 5; ISSN 1549-9596
Publisher:
American Chemical SocietyCopyright Statement
Country of Publication:
United States
Language:
English

References (47)

A Survey of Quantitative Descriptions of Molecular Structure journal September 2012
Fast Calculation of Molecular Polar Surface Area as a Sum of Fragment-Based Contributions and Its Application to the Prediction of Drug Transport Properties journal October 2000
Performance of some variable selection methods when multicollinearity is present journal July 2005
The Cambridge Structural Database
  • Groom, Colin R.; Bruno, Ian J.; Lightfoot, Matthew P.
  • Acta Crystallographica Section B Structural Science, Crystal Engineering and Materials, Vol. 72, Issue 2, p. 171-179 https://doi.org/10.1107/S2052520616003954
journal April 2016
Extended-Connectivity Fingerprints journal April 2010
Crystal Engineering for Creating Low Sensitivity and Highly Energetic Materials journal September 2018
PLS regression methods journal June 1988
Deep Neural Nets as a Method for Quantitative Structure–Activity Relationships journal February 2015
A Simple Representation of Three-Dimensional Molecular Structure journal August 2017
Explainability Methods for Graph Convolutional Neural Networks conference June 2019
Prediction of Detonation Velocity and N−O Composition of High Energy C−H−N−O Explosives by Means of Artificial Neural Networks journal March 2019
A practical guide to pharmaceutical polymorph screening & selection journal August 2014
A new family of power transformations to improve normality or symmetry journal December 2000
The first general index of molecular complexity journal June 1981
The unusual stability of TATB (1,3,5-triamino-2,4,6-trinitrobenzene): A review of the scientific literature report July 1990
Applying machine learning techniques to predict the properties of energetic materials journal June 2018
Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals journal April 2019
3D Molecular Representations Based on the Wave Transform for Convolutional Neural Networks journal February 2018
Large-Scale Computational Screening of Molecular Organic Semiconductors Using Crystal Structure Prediction journal June 2018
Lead- and drug-like compounds: the rule-of-five revolution journal December 2004
Deep Architectures and Deep Learning in Chemoinformatics: The Prediction of Aqueous Solubility for Drug-Like Molecules journal July 2013
Iterative partial equalization of orbital electronegativity—a rapid access to atomic charges journal January 1980
Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules journal January 2018
Control and prediction of the organic solid state: a challenge to theory and experiment journal September 2018
Information theory, distance matrix, and molecular branching journal November 1977
Applications of Energetic Materials by a Theoretical Method (Discover Energetic Materials by a Theoretical Method) journal January 2013
Deep learning for energetic material detonation performance
  • Barnes, Brian C.
  • SHOCK COMPRESSION OF CONDENSED MATTER - 2019: Proceedings of the Conference of the American Physical Society Topical Group on Shock Compression of Condensed Matter, AIP Conference Proceedings https://doi.org/10.1063/12.0001089
conference January 2020
PLS-regression: a basic tool of chemometrics journal October 2001
Prediction of densities for solid energetic molecules with molecular surface electrostatic potentials journal March 2008
The crystal structure of 1,3,5-triamino-2,4,6-trinitrobenzene journal May 1965
Analyzing Learned Molecular Representations for Property Prediction journal July 2019
The Properties of Known Drugs. 1. Molecular Frameworks journal January 1996
Sensitivity of Energetic Materials: Theoretical Relationships to Detonation Performance and Molecular Structure journal July 2017
Prediction of Physicochemical Parameters by Atomic Contributions journal August 1999
Prediction of Energetic Material Properties from Electronic Structure Using 3D Convolutional Neural Networks journal October 2020
A promising high-energy-density material journal August 2017
Deep Learning for Optoelectronic Properties of Organic Semiconductors journal March 2020
Support-vector networks journal September 1995
SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules journal February 1988
An electrostatic interaction correction for improved crystal density prediction journal October 2009
A widely applicable set of descriptors journal January 2000
Accurate Predictions of Crystal Densities Using Quantum Mechanical Molecular Volumes journal October 2007
Machine Learned Model for Solid Form Volume Estimation Based on Packing-Accessible Surface and Molecular Topological Fragments journal November 2020
Highly discriminating distance-based topological index journal July 1982
Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties journal April 2018
Random Forests journal January 2001
Crystal density predictions for nitramines based on quantum chemistry journal March 2007