Machine learning with force-field inspired descriptors for materials: fast screening and mapping energy landscape.

We present a complete set of chemo-structural descriptors to significantly extend the applicability of machine-learning (ML) in material screening and mapping energy landscape for multicomponent systems. These new descriptors allow differentiating between structural prototypes, which is not possible using the commonly used chemical-only descriptors. Specifically, we demonstrate that the combination of pairwise radial, nearest neighbor, bond-angle, dihedral-angle and core-charge distributions plays an important role in predicting formation energies, bandgaps, static refractive indices, magnetic properties, and modulus of elasticity for three-dimensional (3D) materials as well as exfoliation energies of two-dimensional (2D) layered materials. The training data consists of 24549 bulk and 616 monolayer materials taken from JARVIS-DFT database. We obtained very accurate ML models using gradient boosting algorithm. Then we use the trained models to discover exfoliable 2D-layered materials satisfying specific property requirements. Additionally, we integrate our formation energy ML model with a genetic algorithm for structure search to verify if the ML model reproduces the DFT convex hull. This verification establishes a more stringent evaluation metric for the ML model than what commonly used in data sciences. Our learnt model is publicly available on the JARVIS-ML website (https://www.ctcms.nist.gov/jarvisml) property predictions of generalized materials.

[1]  E. Benavente,et al.  Intercalation chemistry of molybdenum disulfide , 2002 .

[2]  Louis G. Birta,et al.  Modelling and Simulation , 2013, Simulation Foundations, Methods and Applications.

[3]  Gowoon Cheon,et al.  Data Mining for New Two- and One-Dimensional Weakly Bonded Solids and Lattice-Commensurate Heterostructures. , 2017, Nano letters.

[4]  Cormac Toher,et al.  Charting the complete elastic properties of inorganic crystalline compounds , 2015, Scientific Data.

[5]  Kamal Choudhary,et al.  Elastic properties of bulk and low-dimensional materials using Van der Waals density functional. , 2018, Physical review. B.

[6]  Svetlozar Nestorov,et al.  The Computational Materials Repository , 2012, Computing in Science & Engineering.

[7]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[8]  Jeffrey C Grossman,et al.  Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties. , 2017, Physical review letters.

[9]  Tzu-Ray Shan,et al.  Classical atomistic simulations of surfaces and heterogeneous interfaces with the charge-optimized many body (COMB) potentials , 2013 .

[10]  Kamal Choudhary,et al.  Computational screening of high-performance optoelectronic materials using OptB88vdW and TB-mBJ formalisms , 2018, Scientific data.

[11]  O. Borodin,et al.  ReaxFF molecular dynamics simulations on lithiated sulfur cathode materials. , 2015, Physical chemistry chemical physics : PCCP.

[12]  R. Sarpong,et al.  Bio-inspired synthesis of xishacorenes A, B, and C, and a new congener from fuscol† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c9sc02572c , 2019, Chemical science.

[13]  Kamal Choudhary,et al.  High-throughput Identification and Characterization of Two-dimensional Materials using Density functional theory , 2017, Scientific Reports.

[14]  Alok Choudhary,et al.  Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations , 2017 .

[15]  Kristin A. Persson,et al.  Commentary: The Materials Project: A materials genome approach to accelerating materials innovation , 2013 .

[16]  Richard G Hennig,et al.  A grand canonical genetic algorithm for the prediction of multi-component phase diagrams and testing of empirical potentials , 2013, Journal of physics. Condensed matter : an Institute of Physics journal.

[17]  Alexander Lindmaa,et al.  Theoretical prediction of properties of atomistic systems : Density functional theory and machine learning , 2017 .

[18]  G. Pilania,et al.  Machine learning bandgaps of double perovskites , 2016, Scientific Reports.

[19]  S. Curtarolo,et al.  AFLOW: An automatic framework for high-throughput materials discovery , 2012, 1308.5715.

[20]  A. V. Duin,et al.  ReaxFF: A Reactive Force Field for Hydrocarbons , 2001 .

[21]  Alok Choudhary,et al.  A General-Purpose Machine Learning Framework for Predicting Properties of Inorganic Materials , 2016 .

[22]  Atsuto Seko,et al.  Representation of compounds for machine-learning prediction of physical properties , 2016, 1611.08645.

[23]  Muratahan Aykol,et al.  Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD) , 2013 .

[24]  J. Banavar,et al.  Computer Simulation of Liquids , 1988 .

[25]  Cormac Toher,et al.  Universal fragment descriptors for predicting properties of inorganic crystals , 2016, Nature Communications.

[26]  F. Jona,et al.  Structural properties of copper , 2001 .

[27]  Nicola Nosengo,et al.  Can artificial intelligence create the next wonder material? , 2016, Nature.

[28]  A. van de Walle,et al.  Institute of Physics Publishing Modelling and Simulation in Materials Science and Engineering Self-driven Lattice-model Monte Carlo Simulations of Alloy Thermodynamic Properties and Phase Diagrams , 2002 .

[29]  R. Kondor,et al.  On representing chemical environments , 2012, 1209.3140.

[30]  G. Ozin,et al.  Colloidal synthesis of 1T-WS2 and 2H-WS2 nanosheets: applications for photocatalytic hydrogen evolution. , 2014, Journal of the American Chemical Society.

[31]  Paolo Ruggerone,et al.  Computational Materials Science X , 2002 .

[32]  Kristof T. Schütt,et al.  How to represent crystal structures for machine learning: Towards fast prediction of electronic properties , 2013, 1307.1266.

[33]  A. D. Corso Pseudopotentials periodic table: From H to Pu , 2014 .

[34]  O. A. von Lilienfeld,et al.  Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity. , 2016, The Journal of chemical physics.

[35]  Anders S. Christensen,et al.  Alchemical and structural distribution based representation for universal quantum machine learning. , 2017, The Journal of chemical physics.

[36]  Felix A Faber,et al.  Crystal structure representations for machine learning models of formation energies , 2015, 1503.07406.

[37]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[38]  Qiang Sun,et al.  Structures and Phase Transition of a MoS2 Monolayer , 2014 .

[39]  Kiyoyuki Terakura,et al.  Machine learning reveals orbital interaction in materials , 2017, Science and technology of advanced materials.

[40]  K. Müller,et al.  Fast and accurate modeling of molecular atomization energies with machine learning. , 2011, Physical review letters.

[41]  Wei Chen,et al.  A Statistical Learning Framework for Materials Science: Application to Elastic Moduli of k-nary Inorganic Polycrystalline Compounds , 2016, Scientific Reports.

[42]  Kamal Choudhary,et al.  Dynamical properties of AlN nanostructures and heterogeneous interfaces predicted using COMB potentials , 2016 .

[43]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[44]  Junmei Wang,et al.  Development and testing of a general amber force field , 2004, J. Comput. Chem..

[45]  E. Reed,et al.  Elastic properties of bulk and low-dimensional materials using OptB88vdW functional in density functional theory , 2018 .