Simulation and design of energy materials accelerated by machine learning

In the light of mature mathematical algorithms and material database construction, a basic research framework of machine learning (ML) method integrated with computational chemistry toolkits exhibits great potentials and advantages in the field of material researches. In this review, we introduce a work flow of ML in energy materials and demonstrate its recent applications in accelerating the material exploration, especially significant progresses in designing novel catalysts, organic and inorganic battery materials and metal–organic framework materials. As a rising research direction, we also identify the prospects and challenges of ML. More automated and intelligent workflows will be widely used in energy material design with the development of ML. Our review provides a guideline to study and design energy materials in the framework of ML.

[1]  Thuc‐Quyen Nguyen,et al.  Molecular solubility and hansen solubility parameters for the analysis of phase separation in bulk heterojunctions , 2012 .

[2]  A. Helmy,et al.  Multilayer Black Phosphorus as a Versatile Mid-Infrared Electro-optic Material. , 2015, Nano letters.

[3]  S. Jang,et al.  High-Density Lithium-Ion Energy Storage Utilizing the Surface Redox Reactions in Folded Graphene Films , 2015 .

[4]  Kristin A. Persson,et al.  Commentary: The Materials Project: A materials genome approach to accelerating materials innovation , 2013 .

[5]  Luke E K Achenie,et al.  Machine-Learning-Augmented Chemisorption Model for CO2 Electroreduction Catalyst Screening. , 2015, The journal of physical chemistry letters.

[6]  Zachary W. Ulissi,et al.  Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution , 2018, Nature Catalysis.

[7]  Youyong Li,et al.  Narrow bandgap conjugated polymers based on a high-mobility polymer template for visibly transparent photovoltaic devices , 2016 .

[8]  C. Brodley,et al.  Decision tree classification of land cover from remotely sensed data , 1997 .

[9]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[10]  M. Saidaminov,et al.  Making and Breaking of Lead Halide Perovskites. , 2016, Accounts of chemical research.

[11]  Ryther Anderson,et al.  Role of Pore Chemistry and Topology in the CO2 Capture Capabilities of MOFs: From Molecular Simulation to Machine Learning , 2018, Chemistry of Materials.

[12]  Anubhav Jain,et al.  The Materials Application Programming Interface (API): A simple, flexible and efficient API for materials data based on REpresentational State Transfer (REST) principles , 2015 .

[13]  Vladan Stevanović,et al.  TE Design Lab: A virtual laboratory for thermoelectric material design , 2016 .

[14]  Xiaogang Wang,et al.  Deep Learning Face Representation from Predicting 10,000 Classes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Sadayuki Tsugawa,et al.  Vision-based vehicles in Japan: machine vision systems and driving control systems , 1994, IEEE Trans. Ind. Electron..

[16]  Youyong Li,et al.  Adsorption and catalytic decomposition of hydrazine on metal-free SiC3 siligraphene , 2019, Applied Surface Science.

[17]  M. Ratner,et al.  Bithiopheneimide-dithienosilole/dithienogermole copolymers for efficient solar cells: information from structure-property-device performance correlations and comparison to thieno[3,4-c]pyrrole-4,6-dione analogues. , 2012, Journal of the American Chemical Society.

[18]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevan e Ve tor Ma hine , 2001 .

[19]  Youyong Li,et al.  Efficient Polymer Solar Cells with a High Open Circuit Voltage of 1 Volt , 2013 .

[20]  D. Zhao,et al.  Design and generation of extended zeolitic metal-organic frameworks (ZMOFs): synthesis and crystal structures of zinc(II) imidazolate polymers with zeolitic topologies. , 2007, Chemistry.

[21]  Shyue Ping Ong,et al.  Deep neural networks for accurate predictions of crystal stability , 2017, Nature Communications.

[22]  Youyong Li,et al.  Two-Dimensional MnO2 as a Better Cathode Material for Lithium Ion Batteries , 2015 .

[23]  S. Jang,et al.  Thermodynamic and redox properties of graphene oxides for lithium-ion battery applications: a first principles density functional theory modeling approach. , 2016, Physical chemistry chemical physics : PCCP.

[24]  Hieu Chi Dam,et al.  Novel mixture model for the representation of potential energy surfaces. , 2016, The Journal of chemical physics.

[25]  Ekin D Cubuk,et al.  Metallic Metal-Organic Frameworks Predicted by the Combination of Machine Learning Methods and Ab Initio Calculations. , 2018, The journal of physical chemistry letters.

[26]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[27]  D. Aurbach Review of selected electrode–solution interactions which determine the performance of Li and Li ion batteries , 2000 .

[28]  Søren Dahl,et al.  The Brønsted-Evans-Polanyi relation and the volcano plot for ammonia synthesis over transition metal catalysts , 2001 .

[29]  Miguel A. L. Marques,et al.  Predicting the Thermodynamic Stability of Solids Combining Density Functional Theory and Machine Learning , 2017 .

[30]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[31]  O. A. von Lilienfeld,et al.  Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity. , 2016, The Journal of chemical physics.

[32]  Meng-Che Tsai,et al.  Organometal halide perovskite solar cells: degradation and stability , 2016 .

[33]  Jean-Louis Reymond,et al.  Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17 , 2012, J. Chem. Inf. Model..

[34]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[35]  Charlie Tsai,et al.  Scaling Relations for Adsorption Energies on Doped Molybdenum Phosphide Surfaces , 2017 .

[36]  Luke E K Achenie,et al.  High-throughput screening of bimetallic catalysts enabled by machine learning , 2017 .

[37]  Qing-You Zhang,et al.  Machine Learning Methods to Predict Density Functional Theory B3LYP Energies of HOMO and LUMO Orbitals , 2017, J. Chem. Inf. Model..

[38]  Omar M Yaghi,et al.  Metal-organic frameworks with exceptionally high capacity for storage of carbon dioxide at room temperature. , 2005, Journal of the American Chemical Society.

[39]  Boris Kozinsky,et al.  AiiDA: Automated Interactive Infrastructure and Database for Computational Science , 2015, ArXiv.

[40]  K. Lu,et al.  Semiconductor Metal–Organic Frameworks: Future Low‐Bandgap Materials , 2017, Advanced materials.

[41]  S. Frick,et al.  Compressed Sensing , 2014, Computer Vision, A Reference Guide.

[42]  Andrew A Peterson,et al.  Acceleration of saddle-point searches with machine learning. , 2016, The Journal of chemical physics.

[43]  Itsuki Miyazato,et al.  Rapid estimation of activation energy in heterogeneous catalytic reactions via machine learning , 2018, J. Comput. Chem..

[44]  Shinji Nagasawa,et al.  Computer-Aided Screening of Conjugated Polymers for Organic Solar Cell: Classification by Random Forest. , 2018, The journal of physical chemistry letters.

[45]  G. Henkelman,et al.  A climbing image nudged elastic band method for finding saddle points and minimum energy paths , 2000 .

[46]  Yang Yang,et al.  Low-Bandgap Near-IR Conjugated Polymers/Molecules for Organic Electronics. , 2015, Chemical reviews.

[47]  H. Jónsson,et al.  Nudged elastic band method for finding minimum energy paths of transitions , 1998 .

[48]  Alán Aspuru-Guzik,et al.  The Harvard Clean Energy Project: Large-Scale Computational Screening and Design of Organic Photovoltaics on the World Community Grid , 2011 .

[49]  George E. Dahl,et al.  Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error. , 2017, Journal of chemical theory and computation.

[50]  Vanchiappan Aravindan,et al.  Lithium-ion conducting electrolyte salts for lithium batteries. , 2011, Chemistry.

[51]  Maxim Ziatdinov,et al.  Learning surface molecular structures via machine vision , 2017, npj Computational Materials.

[52]  Thomas Bligaard,et al.  Trends in the exchange current for hydrogen evolution , 2005 .

[53]  P. Sautet,et al.  Fast prediction of selectivity in heterogeneous catalysis from extended Brønsted-Evans-Polanyi relations: a theoretical insight. , 2009, Angewandte Chemie.

[54]  G. Henkelman,et al.  Improved tangent estimate in the nudged elastic band method for finding minimum energy paths and saddle points , 2000 .

[55]  S. Jang,et al.  Application of DFT-based machine learning for developing molecular electrode materials in Li-ion batteries , 2018, RSC advances.

[56]  J. Nørskov,et al.  Scaling relationships for adsorption energies on transition metal oxide, sulfide, and nitride surfaces. , 2008, Angewandte Chemie.

[57]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[58]  Christoph J. Brabec,et al.  Introducing a New Potential Figure of Merit for Evaluating Microstructure Stability in Photovoltaic Polymer-Fullerene Blends , 2017 .

[59]  J. E. Lee,et al.  Self-Assembled Dendritic Pt Nanostructure with High-Index Facets as Highly Active and Durable Electrocatalyst for Oxygen Reduction. , 2017, ChemSusChem.

[60]  Xunjin Zhu,et al.  Molecular engineering of simple phenothiazine-based dyes to modulate dye aggregation, charge recombination, and dye regeneration in highly efficient dye-sensitized solar cells. , 2014, Chemistry.

[61]  Vladan Stevanović,et al.  Correcting Density Functional Theory for Accurate Predictions of Compound Enthalpies of Formation:Fitted elemental-phase Reference Energies (FERE) , 2012 .

[62]  Lei Wang,et al.  Li−Fe−P−O2 Phase Diagram from First Principles Calculations , 2008 .

[63]  J. Friedman Stochastic gradient boosting , 2002 .

[64]  Alessandro Troisi,et al.  Combining electronic and structural features in machine learning models to predict organic solar cells properties , 2019, Materials Horizons.

[65]  Youyong Li,et al.  Two-dimensional π-conjugated metal-organic nanosheets as single-atom catalysts for the hydrogen evolution reaction. , 2019, Nanoscale.

[66]  Adalberto Fazzio,et al.  Switching a normal insulator into a topological insulator via electric field with application to phosphorene. , 2015, Nano letters.

[67]  Clay B. Holroyd,et al.  The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. , 2002, Psychological review.

[68]  John J. Irwin,et al.  ZINC 15 – Ligand Discovery for Everyone , 2015, J. Chem. Inf. Model..

[69]  Tom K Woo,et al.  Rapid and Accurate Machine Learning Recognition of High Performing Metal Organic Frameworks for CO2 Capture. , 2014, The journal of physical chemistry letters.

[70]  A. Choudhary,et al.  Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science , 2016 .

[71]  Itsuki Miyazato,et al.  Searching for Hidden Perovskite Materials for Photovoltaic Systems by Combining Data Science and First Principle Calculations , 2018 .

[72]  Ichigaku Takigawa,et al.  Toward Effective Utilization of Methane: Machine Learning Prediction of Adsorption Energies on Metal Alloys , 2018 .

[73]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[74]  Jinsang Kim,et al.  Energy Level Modulation of HOMO, LUMO, and Band‐Gap in Conjugated Polymers for Organic Photovoltaic Applications , 2013 .

[75]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[76]  S. Curtarolo,et al.  AFLOW: An automatic framework for high-throughput materials discovery , 2012, 1308.5715.

[77]  Shubin Liu,et al.  Highly porous and stable metal–organic frameworks for uranium extraction , 2013 .

[78]  W. Kim,et al.  Feasibility of Activation Energy Prediction of Gas-Phase Reactions by Machine Learning. , 2018, Chemistry.

[79]  Bin Liu,et al.  A p-type Ti(IV)-based metal-organic framework with visible-light photo-response. , 2014, Chemical communications.

[80]  Ana M. R. Senos,et al.  Digital tools for material selection in product design , 2010 .

[81]  P. Luksch,et al.  New developments in the Inorganic Crystal Structure Database (ICSD): accessibility in support of materials research and design. , 2002, Acta crystallographica. Section B, Structural science.

[82]  Cormac Toher,et al.  Universal fragment descriptors for predicting properties of inorganic crystals , 2016, Nature Communications.

[83]  Beibei Xu,et al.  Photoactive layer based on T-shaped benzimidazole dyes used for solar cell: from photoelectric properties to molecular design , 2017, Scientific Reports.

[84]  Jörg Ackermann,et al.  Design of organic semiconductors: tuning the electronic properties of pi-conjugated oligothiophenes with the 3,4-ethylenedioxythiophene (EDOT) building block. , 2005, Chemistry.

[85]  M. Wohlfahrt‐Mehrens,et al.  Ageing mechanisms in lithium-ion batteries , 2005 .

[86]  A. Troisi,et al.  Toward Predicting Efficiency of Organic Solar Cells via Machine Learning and Improved Descriptors , 2018, Advanced Energy Materials.

[87]  Shou-Cheng Zhang,et al.  Learning atoms for materials discovery , 2018, Proceedings of the National Academy of Sciences.

[88]  Anubhav Jain,et al.  Python Materials Genomics (pymatgen): A robust, open-source python library for materials analysis , 2012 .

[89]  Michael O’Keeffe,et al.  The Chemistry and Applications of Metal-Organic Frameworks , 2013, Science.

[90]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[91]  Mike Preuss,et al.  Planning chemical syntheses with deep neural networks and symbolic AI , 2017, Nature.

[92]  H. Jónsson,et al.  Origin of the Overpotential for Oxygen Reduction at a Fuel-Cell Cathode. , 2004, The journal of physical chemistry. B.

[93]  Paul Raccuglia,et al.  Machine-learning-assisted materials discovery using failed experiments , 2016, Nature.

[94]  Mikkel N. Schmidt,et al.  Machine learning-based screening of complex molecules for polymer solar cells. , 2018, The Journal of chemical physics.

[95]  Thomas Bligaard,et al.  Trends in the catalytic CO oxidation activity of nanoparticles. , 2008, Angewandte Chemie.

[96]  Yao Liu,et al.  NDI‐Based Small Molecule as Promising Nonfullerene Acceptor for Solution‐Processed Organic Photovoltaics , 2015 .

[97]  Youyong Li,et al.  First-Principles Insight into Electrocatalytic Reduction of CO2 to CH4 on a Copper Nanoparticle , 2018 .

[98]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[99]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[100]  Ryosuke Jinnouchi,et al.  Predicting Catalytic Activity of Nanoparticles by a DFT-Aided Machine-Learning Algorithm. , 2017, The journal of physical chemistry letters.

[101]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[102]  Joshua H. Carpenter,et al.  High‐Efficiency Nonfullerene Organic Solar Cells: Critical Factors that Affect Complex Multi‐Length Scale Morphology and Device Performance , 2017 .

[103]  Yanhui Yang,et al.  Non-Fullerene Acceptor-Based Solar Cells: From Structural Design to Interface Charge Separation and Charge Transport , 2017, Polymers.

[104]  K. Müller,et al.  Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space , 2015, The journal of physical chemistry letters.

[105]  Alok Choudhary,et al.  Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations , 2017 .

[106]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[107]  M. Sigman,et al.  The human Turing machine: a neural framework for mental programs , 2011, Trends in Cognitive Sciences.

[108]  Jinlan Wang,et al.  Accelerated discovery of stable lead-free hybrid organic-inorganic perovskites via machine learning , 2018, Nature Communications.

[109]  S. Jang,et al.  Self-polymerized dopamine as an organic cathode for Li- and Na-ion batteries , 2017 .

[110]  Maciej Haranczyk,et al.  In silico design of porous polymer networks: high-throughput screening for methane storage materials. , 2014, Journal of the American Chemical Society.

[111]  R. Dennis Cook,et al.  Cross-Validation of Regression Models , 1984 .

[112]  Youyong Li,et al.  MoS2 supported single platinum atoms and their superior catalytic activity for CO oxidation: a density functional theory study , 2015 .

[113]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[114]  Shengbo Zhang A review on electrolyte additives for lithium-ion batteries , 2006 .

[115]  Aki Vehtari,et al.  Nudged elastic band calculations accelerated with Gaussian process regression. , 2017, The Journal of chemical physics.

[116]  Youyong Li,et al.  Understanding the Effect of Ligands on C2H2 Storage and C2H2/CH4, C2H2/CO2 Separation in Metal–Organic Frameworks with Open Cu(II) Sites , 2017 .

[117]  Dawei Qi,et al.  Fullerene-Based Photoactive Layers for Heterojunction Solar Cells: Structure, Absorption Spectra and Charge Transfer Process , 2014, Materials.

[118]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[119]  Michael Walter,et al.  The atomic simulation environment-a Python library for working with atoms. , 2017, Journal of physics. Condensed matter : an Institute of Physics journal.

[120]  Nathan S. Lewis,et al.  Machine-Learning Methods Enable Exhaustive Searches for Active Bimetallic Facets and Reveal Active Site Motifs for CO2 Reduction , 2017 .

[121]  C. Wilmer,et al.  Large-scale screening of hypothetical metal-organic frameworks. , 2012, Nature chemistry.

[122]  Jeff Dahn,et al.  Comparative thermal stability of carbon intercalation anodes and lithium metal anodes for rechargeable lithium batteries , 1994 .

[123]  Thomas Bligaard,et al.  From the Sabatier principle to a predictive theory of transition-metal heterogeneous catalysis , 2015 .

[124]  Youyong Li,et al.  Structure, band gap and energy level modulations for obtaining efficient materials in inverted polymer solar cells , 2013 .

[125]  Ekin D. Cubuk,et al.  Holistic computational structure screening of more than 12 000 candidates for solid lithium-ion conductor materials , 2017 .

[126]  Ashraf Uddin,et al.  Stability of perovskite solar cells , 2016 .

[127]  Alireza Khorshidi,et al.  Amp: A modular approach to machine learning in atomistic simulations , 2016, Comput. Phys. Commun..