Inverse design of crystals using generalized invertible crystallographic representation

Deep learning has fostered many novel applications in materials informatics. However, the inverse design of inorganic crystals, $\textit{i.e.}$ generating new crystal structure with targeted properties, remains a grand challenge. An important ingredient for such generative models is an invertible representation that accesses the full periodic table. This is challenging due to limited data availability and the complexity of 3D periodic crystal structures. In this paper, we present a generalized invertible representation that encodes the crystallographic information into the descriptors in both real space and reciprocal space. Combining with a generative variational autoencoder (VAE), a wide range of crystallographic structures and chemistries with desired properties can be inverse-designed. We show that our VAE model predicts novel crystal structures that do not exist in the training and test database (Materials Project) with targeted formation energies and band gaps. We validate those predicted crystals by first-principles calculations. Finally, to design solids with practical applications, we address the sparse label problem by building a semi-supervised VAE and demonstrate its successful prediction of unique thermoelectric materials

[1]  Rex A. Palmer,et al.  Structure determination by X-ray crystallography , 1977 .

[2]  Scott Kirkpatrick,et al.  Optimization by simulated annealing: Quantitative studies , 1984 .

[3]  L. T. Wille Searching potential energy surfaces by simulated annealing , 1987, Nature.

[4]  J. C. Schön,et al.  First Step Towards Planning of Syntheses in Solid‐State Chemistry: Determination of Promising Structure Candidates by Global Optimization , 1996 .

[5]  S. Goedecker Minima hopping: an efficient search method for the global minimum of the potential energy surface of complex molecular systems. , 2004, The Journal of chemical physics.

[6]  David J. Singh,et al.  BoltzTraP. A code for calculating band-structure dependent quantities , 2006, Comput. Phys. Commun..

[7]  Raúl J. Martín-Palma,et al.  Nanotechnology for Microelectronics and Optoelectronics , 2006 .

[8]  J. Behler Atom-centered symmetry functions for constructing high-dimensional neural network potentials. , 2011, The Journal of chemical physics.

[9]  Li Zhu,et al.  CALYPSO: A method for crystal structure prediction , 2012, Comput. Phys. Commun..

[10]  K. Müller,et al.  Fast and accurate modeling of molecular atomization energies with machine learning. , 2011, Physical review letters.

[11]  Kristin A. Persson,et al.  Commentary: The Materials Project: A materials genome approach to accelerating materials innovation , 2013 .

[12]  R. Kondor,et al.  On representing chemical environments , 2012, 1209.3140.

[13]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[14]  Felix A Faber,et al.  Crystal structure representations for machine learning models of formation energies , 2015, 1503.07406.

[15]  Muratahan Aykol,et al.  The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies , 2015 .

[16]  Tonio Buonassisi,et al.  Identifying defect-tolerant semiconductors with high minority-carrier lifetimes: beyond hybrid lead halide perovskites , 2015, 1504.02144.

[17]  C. Qi Deep Learning on Point Sets for 3 D Classification and Segmentation , 2016 .

[18]  Wei Chen,et al.  An ab initio electronic transport database for inorganic materials , 2017, Scientific Data.

[19]  Matt J. Kusner,et al.  Grammar Variational Autoencoder , 2017, ICML.

[20]  Matthias Rupp,et al.  Unified representation of molecules and crystals for machine learning , 2017, Mach. Learn. Sci. Technol..

[21]  Haoyan Huo,et al.  Unified Representation for Machine Learning of Molecules and Crystals , 2017 .

[22]  Su-Huai Wei,et al.  Design of Lead-Free Inorganic Halide Perovskites for Solar Cells via Cation-Transmutation. , 2017, Journal of the American Chemical Society.

[23]  Christopher Burgess,et al.  beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[24]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Dmitry Vetrov,et al.  Entangled Conditional Adversarial Autoencoder for de Novo Drug Discovery. , 2018, Molecular pharmaceutics.

[26]  Ting Liu,et al.  Recent advances in convolutional neural networks , 2015, Pattern Recognit..

[27]  Alán Aspuru-Guzik,et al.  Inverse molecular design using machine learning: Generative models for matter engineering , 2018, Science.

[28]  A. Zunger Inverse design in search of materials with target functionalities , 2018 .

[29]  Tonio Buonassisi,et al.  Accelerating Materials Development via Automation, Machine Learning, and High-Performance Computing , 2018, Joule.

[30]  Alán Aspuru-Guzik,et al.  Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules , 2016, ACS central science.

[31]  Qi Liu,et al.  Constrained Graph Variational Autoencoders for Molecule Design , 2018, NeurIPS.

[32]  Jeffrey C Grossman,et al.  Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties. , 2017, Physical review letters.

[33]  K-R Müller,et al.  SchNet - A deep learning architecture for molecules and materials. , 2017, The Journal of chemical physics.

[34]  Jianjun Hu,et al.  First-principle-based computational doping of SrTiO$$_{3}$$3 using combinatorial genetic algorithms , 2018 .

[35]  Yoshua Bengio,et al.  DEFactor: Differentiable Edge Factorization-based Probabilistic Graph Generation , 2018, ArXiv.

[36]  M. Scheffler,et al.  Insightful classification of crystal structures using deep learning , 2017, Nature Communications.

[37]  Regina Barzilay,et al.  Junction Tree Variational Autoencoder for Molecular Graph Generation , 2018, ICML.

[38]  Alán Aspuru-Guzik,et al.  Inverse Design of Solid-State Materials via a Continuous Representation , 2019, Matter.

[39]  Niloy Ganguly,et al.  NeVAE: A Deep Generative Model for Molecular Graphs , 2018, AAAI.

[40]  Brian L. DeCost,et al.  Accelerated Development of Perovskite-Inspired Materials via High-Throughput Synthesis and Machine-Learning Diagnosis , 2018, Joule.

[41]  Nataliya Sokolovska,et al.  CrystalGAN: Learning to Discover Crystallographic Structures with Generative Adversarial Networks , 2018, AAAI Spring Symposium: Combining Machine Learning with Knowledge Engineering.

[42]  Koji Morikawa,et al.  Study of Deep Generative Models for Inorganic Chemical Compositions , 2019, ArXiv.

[43]  Yoshua Bengio,et al.  Data-Driven Approach to Encoding and Decoding 3-D Crystal Structures , 2019, ArXiv.

[44]  Aniketa Shinde,et al.  Unveiling new stable manganese based photoanode materials via theoretical high-throughput screening and experiments. , 2019, Chemical communications.

[45]  Artem R. Oganov,et al.  Structure prediction drives materials discovery , 2019, Nature Reviews Materials.

[46]  Ting Hu,et al.  Deep Learning on Point Clouds and Its Application: A Survey , 2019, Sensors.

[47]  Xiang Li,et al.  Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials , 2019, npj Computational Materials.

[48]  Chi Chen,et al.  Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals , 2018, Chemistry of Materials.

[49]  Anubhav Jain,et al.  Local structure order parameters and site fingerprints for quantification of coordination environment and crystal structure similarity , 2020 .

[50]  Sungwon Kim,et al.  Generative Adversarial Networks for Crystal Structure Prediction , 2020, ACS central science.

[51]  Jianjun Hu,et al.  Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials , 2019, npj Computational Materials.

[52]  Sungwon Kim,et al.  Machine-enabled inverse design of inorganic solid materials: promises and challenges , 2020, Chemical science.