skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Rapid Identification of X-ray Diffraction Patterns Based on Very Limited Data by Interpretable Convolutional Neural Networks

Journal Article · · Journal of Chemical Information and Modeling

Large volumes of data from material characterizations call for rapid and automatic data analysis to accelerate materials discovery. Herein, we report a convolutional neural network (CNN) that was trained based on theoretical data and very limited experimental data for fast identification of experimental X-ray diffraction (XRD) patterns of metal–organic frameworks (MOFs). To augment the data for training the model, noise was extracted from experimental data and shuffled; then it was merged with the main peaks that were extracted from theoretical spectra to synthesize new spectra. For the first time, one-to-one material identification was achieved. Theoretical MOFs patterns (1012) were augmented to a whole data set of 72 864 samples. It was then randomly shuffled and split into training (58 292 samples) and validation (14 572 samples) data sets at a ratio of 4:1. For the task of discriminating, the optimized model showed the highest identification accuracy of 96.7% for the top 5 ranking on a test data set of 30 hold-out samples. Neighborhood component analysis (NCA) on the experimental XRD samples shows that the samples from the same material are clustered in groups in the NCA map. Analysis on the class activation maps of the last CNN layer further discloses the mechanism by which the CNN model successfully identifies individual MOFs from the XRD patterns. Furthermore, this CNN model trained by the data augmentation technique would not only open numerous potential applications for identifying XRD patterns for different materials, but also pave avenues to autonomously analyze data by other characterization tools such as FTIR, Raman, and NMR spectroscopies.

Research Organization:
Univ. of Missouri, Columbia, MO (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
FE0031645
OSTI ID:
1853798
Journal Information:
Journal of Chemical Information and Modeling, Vol. 60, Issue 4; ISSN 1549-9596
Publisher:
American Chemical SocietyCopyright Statement
Country of Publication:
United States
Language:
English

References (30)

Accelerating Materials Development via Automation, Machine Learning, and High-Performance Computing journal August 2018
Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks journal May 2019
Neural network based classification of crystal symmetries from x-ray diffraction patterns journal June 2019
A robotic platform for flow synthesis of organic compounds informed by AI planning journal August 2019
Organic synthesis in a modular robotic system driven by a chemical programming language journal November 2018
Controlling an organic synthesis robot with machine learning to search for new reactivity journal July 2018
Universal fragment descriptors for predicting properties of inorganic crystals journal June 2017
New frontiers for the materials genome initiative journal April 2019
Machine learning for molecular and materials science journal July 2018
Discovery of Intermetallic Compounds from Traditional to Machine-Learning Approaches journal December 2017
Design of self-assembly dipeptide hydrogels and machine learning via their chemical features journal May 2019
A general-purpose machine learning framework for predicting properties of inorganic materials journal August 2016
Machine learning-assisted directed protein evolution with combinatorial libraries journal April 2019
Insightful classification of crystal structures using deep learning journal July 2018
Decoding crystallography from high-resolution electron imaging and diffraction datasets with deep learning journal October 2019
Comparison of dissimilarity measures for cluster analysis of X-ray diffraction data from combinatorial libraries journal February 2017
High-throughput determination of structural phase diagram and constituent phases using GRENDEL journal October 2015
Unsupervised phase mapping of X-ray diffraction data by nonnegative matrix factorization integrated with custom clustering journal August 2018
Crystallographic prediction from diffraction and chemistry data for higher throughput classification using machine learning journal February 2020
Unsupervised classification of single-particle X-ray diffraction snapshots by spectral clustering journal January 2011
Deep Learning of Atomically Resolved Scanning Transmission Electron Microscopy Images: Chemical Identification and Tracking Local Transformations journal October 2016
A deep-learning technique for phase identification in multiphase inorganic compounds using synthetic XRD powder patterns journal January 2020
Classification of crystal structure using a convolutional neural network journal June 2017
Automated estimation of materials parameter from X-ray absorption and electron energy-loss spectra with similarity measures journal March 2019
Development of a Cambridge Structural Database Subset: A Collection of Metal–Organic Frameworks for Past, Present, and Future journal March 2017
Gradient-based learning applied to document recognition journal January 1998
Deep learning journal May 2015
Deep learning for sensor-based activity recognition: A survey journal March 2019
MultiCAM: Multiple Class Activation Mapping for Aircraft Recognition in Remote Sensing Images journal March 2019
Learning Deep Features for Discriminative Localization conference June 2016