Machine learning in experimental materials chemistry

Abstract The development of advanced materials is an important aspect of modern life. However, the discovery of novel materials involves searching the vast chemical space to find materials with desired properties. Recent developments in the applications of Machine Learning (ML) in materials chemistry show promise to accelerate the material discovery process. In this perspective article, we highlight the importance of ML in materials chemistry. We discuss few examples of ML applications in synthesis, characterization, and predicting activities of materials. Finally, we discuss the challenges in this field and how the progress in ML in chemistry is leveraged together with advanced robotics to perform automated optimization of material discovery.

[1]  F. Arnaud,et al.  From core referencing to data re-use: two French national initiatives to reinforce paleodata stewardship (National Cyber Core Repository and LTER France Retro-Observatory) , 2017 .

[2]  P. Rinke,et al.  Data‐Driven Materials Science: Status, Challenges, and Perspectives , 2019, Advanced science.

[3]  Stefan Palkovits A Primer about Machine Learning in Catalysis – A Tutorial with Code , 2020 .

[4]  W. Park,et al.  A deep-learning technique for phase identification in multiphase inorganic compounds using synthetic XRD powder patterns , 2020, Nature Communications.

[5]  Tadashi Hattori,et al.  Neural network as a tool for catalyst development , 1995 .

[6]  Hoo-Chang Hoo-Chang Shin Shin,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, Ieee Transactions on Medical Imaging.

[7]  S. Nishimura,et al.  Data-Driven Identification of Reaction Network in Oxidative Coupling of Methane Reaction via Experimental Data. , 2020, The journal of physical chemistry letters.

[8]  Andrew J. Medford,et al.  Extracting Knowledge from Data through Catalysis Informatics , 2018, ACS Catalysis.

[9]  A. McCallum,et al.  Materials Synthesis Insights from Scientific Literature via Text Extraction and Machine Learning , 2017 .

[10]  Volker L. Deringer,et al.  Machine Learning Interatomic Potentials as Emerging Tools for Materials Science , 2019, Advanced materials.

[11]  J. E. Gubernatis,et al.  Machine learning in materials design and discovery: Examples from the present and suggestions for the future , 2018, Physical Review Materials.

[12]  Christopher J. Bartel,et al.  Machine learning for heterogeneous catalyst design and discovery , 2018 .

[13]  Randal S. Olson,et al.  TPOT: A Tree-based Pipeline Optimization Tool for Automating Machine Learning , 2016, AutoML@ICML.

[14]  Qianxiao Li,et al.  Challenges and opportunities of polymer design with machine learning and high throughput experimentation , 2019, MRS Communications.

[15]  R. B. Sunoj,et al.  A unified machine-learning protocol for asymmetric catalysis as a proof of concept demonstration using asymmetric hydrogenation , 2020, Proceedings of the National Academy of Sciences.

[16]  Zachary W. Ulissi,et al.  Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution , 2018, Nature Catalysis.

[17]  Connor W. Coley,et al.  Machine Learning in Computer-Aided Synthesis Planning. , 2018, Accounts of chemical research.

[18]  Klavs F. Jensen,et al.  Autonomous discovery in the chemical sciences part I: Progress , 2020, Angewandte Chemie.

[19]  Michael J. Keiser,et al.  Adversarial Controls for Scientific Machine Learning. , 2018, ACS chemical biology.

[20]  Alán Aspuru-Guzik,et al.  ChemOS: Orchestrating autonomous experimentation , 2018, Science Robotics.

[21]  Huiyu Zhou,et al.  Using deep neural network with small dataset to predict material defects , 2019, Materials & Design.

[22]  M. Marques,et al.  Recent advances and applications of machine learning in solid-state materials science , 2019, npj Computational Materials.

[23]  David G Lidzey,et al.  Mapping Polymer Molecular Order in the SEM with Secondary Electron Hyperspectral Imaging , 2019, Advanced science.

[24]  A. Bahadori,et al.  Evolving an Accurate Decision Tree‐Based Model for Predicting Carbon Dioxide Solubility in Polymers , 2020 .

[25]  Vishwesh Venkatraman,et al.  Predicting ionic liquid melting points using machine learning , 2018, Journal of Molecular Liquids.

[26]  Jakoah Brgoch,et al.  Disentangling Structural Confusion through Machine Learning: Structure Prediction and Polymorphism of Equiatomic Ternary Phases ABC. , 2017, Journal of the American Chemical Society.

[27]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[28]  S. Nishimura,et al.  Data Driven Determination of Reaction Conditions in Oxidative Coupling of Methane via Machine Learning , 2019, ChemCatChem.

[29]  Maciej Haranczyk,et al.  Capturing chemical intuition in synthesis of metal-organic frameworks , 2019, Nature Communications.

[30]  Alán Aspuru-Guzik,et al.  ChemOS: An orchestration software to democratize autonomous discovery , 2020, PloS one.

[31]  A. Choudhary,et al.  Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science , 2016 .

[32]  Frank R Burden,et al.  Quantitative structure-property relationship modeling of diverse materials properties. , 2012, Chemical reviews.

[33]  Hagar I. Labouta,et al.  Meta-Analysis of Nanoparticle Cytotoxicity via Data-Mining the Literature. , 2019, ACS nano.

[34]  R. Asahi,et al.  Phosphorescent Material Search Using a Combination of High-Throughput Evaluation and Machine Learning. , 2019, Inorganic chemistry.

[35]  G. Schneider,et al.  Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery. , 2019, Chemical reviews.

[36]  Takashi Kamachi,et al.  Machine Learning for Catalysis Informatics: Recent Applications and Prospects , 2020 .

[37]  D. Winkler,et al.  Discovery and Optimization of Materials Using Evolutionary Approaches. , 2016, Chemical reviews.

[38]  Paul Raccuglia,et al.  Machine-learning-assisted materials discovery using failed experiments , 2016, Nature.

[39]  J. Aires-de-Sousa,et al.  Machine learning to predict the specific optical rotations of chiral fluorinated molecules. , 2019, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[40]  Itsuki Miyazato,et al.  High-Throughput Experimentation and Catalyst Informatics for Oxidative Coupling of Methane , 2020, ACS Catalysis.

[41]  Watcharop Chaikittisilp,et al.  Linking synthesis and structure descriptors from a large collection of synthetic records of zeolite materials , 2019, Nature Communications.

[42]  Christopher Wolverton,et al.  Accelerated discovery of metallic glasses through iteration of machine learning and high-throughput experiments , 2018, Science Advances.

[43]  Michael A. Webb,et al.  Recent advances in machine learning towards multiscale soft materials design , 2019, Current Opinion in Chemical Engineering.

[44]  Travis Williams,et al.  Enabling Catalyst Discovery through Machine Learning and High-Throughput Experimentation , 2020 .

[45]  Anton O Oliynyk,et al.  Discovery of Intermetallic Compounds from Traditional to Machine-Learning Approaches. , 2018, Accounts of chemical research.

[46]  Z. Deng,et al.  A Critical Review of Machine Learning of Energy Materials , 2020, Advanced Energy Materials.

[47]  H. Nakai,et al.  Virtual Reaction Condition Optimization based on Machine Learning for a Small Number of Experiments in High-dimensional Continuous and Discrete Variables , 2019, Chemistry Letters.

[48]  Gadi Rothenberg,et al.  Data mining in catalysis: Separating knowledge from garbage , 2008 .

[49]  Seokho Kang,et al.  Deep-learning-based inverse design model for intelligent discovery of organic molecules , 2018, npj Computational Materials.

[50]  A. Frenkel,et al.  “Inverting” X-ray Absorption Spectra of Catalysts by Machine Learning in Search for Activity Descriptors , 2019, ACS Catalysis.

[51]  Yuzuru Tanaka,et al.  The Rise of Catalyst Informatics: Towards Catalyst Genomics , 2019, ChemCatChem.

[52]  A. Frenkel,et al.  Subnanometer Substructures in Nanoassemblies Formed from Clusters under a Reactive Atmosphere Revealed Using Machine Learning , 2018, The Journal of Physical Chemistry C.

[53]  Jakoah Brgoch,et al.  Predicting the Band Gaps of Inorganic Solids by Machine Learning. , 2018, The journal of physical chemistry letters.

[54]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[55]  Lawrence A. Adutwum,et al.  How To Optimize Materials and Devices via Design of Experiments and Machine Learning: Demonstration Using Organic Photovoltaics. , 2018, ACS nano.

[56]  Yue Liu,et al.  Materials discovery and design using machine learning , 2017 .

[57]  Tim Mueller,et al.  Machine learning for interatomic potential models. , 2020, The Journal of chemical physics.

[58]  Reiner Sebastian Sprick,et al.  A mobile robotic chemist , 2020, Nature.

[59]  Andreas Heyden,et al.  A Multiple Filter Based Neural Network Approach to the Extrapolation of Adsorption Energies on Metal Surfaces for Catalysis Applications , 2020, Journal of chemical theory and computation.