Data-centric science for materials innovation

© 2018 Materials Research Society MRS BULLETIN • VOLUME 43 • SEPTEMBER 2018 • www.mrs.org/bulletin Data-intensive scientifi c discovery The challenges of dealing with the rapid growth of data in materials science-related fi elds has long been recognized. 1 – 3 With more recent advances in computer science, the tools for advancing data-intensive scientifi c discovery have opened the door for more engagement from the scientifi c community. As suggested by Gray, this has created “The Fourth Paradigm: Data-Intensive Scientifi c Discovery.” 4 He pointed out that experimental, theoretical, and computational science were all being affected by the data deluge, and a fourth “data-intensive” science paradigm was emerging. Indeed, we are witnessing materials science being greatly affected in the new era of “datacentric” materials science, which will likely become the new paradigm for materials research and education. For more than a decade, MRS Bulletin has published issues related to the nexus of data science and materials science, including materials informatics 5 and microstructural informatics. 6 In this issue, we continue to expand on those themes by focusing on the numerous efforts in developing and utilizing databases of electronic structure calculations, and their impact on addressing different classes of problems in materials science.

[1]  Liming Chen,et al.  Predicting the stability of ternary intermetallics with density functional theory and machine learning. , 2018, The Journal of chemical physics.

[2]  John D. Perkins,et al.  An open experimental database for exploring inorganic materials , 2018, Scientific Data.

[3]  Yanchao Wang,et al.  Crystal structure prediction via particle-swarm optimization , 2010 .

[4]  Marco Buongiorno Nardelli,et al.  The high-throughput highway to computational materials design. , 2013, Nature materials.

[5]  Ryan O'Hayre,et al.  Predicting density functional theory total energies and enthalpies of formation of metal-nonmetal compounds by linear regression , 2016 .

[6]  Cormac Toher,et al.  Universal fragment descriptors for predicting properties of inorganic crystals , 2016, Nature Communications.

[7]  Special quasirandom structure in heterovalent ionic systems , 2014, 1408.6875.

[8]  Alok Choudhary,et al.  Combinatorial screening for new materials in unconstrained composition space with machine learning , 2014 .

[9]  Krishna Rajan,et al.  Informatics derived materials databases for multifunctional properties , 2015, Science and technology of advanced materials.

[10]  Ian Foster,et al.  Strategies for accelerating the adoption of materials informatics , 2018, MRS Bulletin.

[11]  Muratahan Aykol,et al.  The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies , 2015 .

[12]  Hisashi Kashima,et al.  Matrix- and tensor-based recommender systems for the discovery of currently unknown inorganic compounds , 2017, 1710.00659.

[13]  Krishna Rajan,et al.  Informatics for Materials Science and Engineering: Data-Driven Discovery for Accelerated Experimentation and Application , 2013 .

[14]  Shunsuke Muto,et al.  High Spatial Resolution Hyperspectral Imaging with Machine-Learning Techniques , 2018 .

[15]  Krishna Rajan,et al.  Combinatorial Materials Sciences: Experimental Strategies for Accelerated Knowledge Discovery , 2008 .

[16]  Bryce Meredig,et al.  A hybrid computational-experimental approach for automated crystal structure solution. , 2013, Nature materials.

[17]  Ferreira,et al.  Special quasirandom structures. , 1990, Physical review letters.

[18]  Chiho Kim,et al.  Machine learning in materials informatics: recent applications and prospects , 2017, npj Computational Materials.

[19]  Atsuto Seko,et al.  Compositional descriptor-based recommender system for the materials discovery. , 2017, The Journal of chemical physics.

[20]  G. Pizzi,et al.  Provenance, workflows, and crystallographic tools in materials science: AiiDA, spglib, and seekpath , 2018, MRS Bulletin.

[21]  Krishna Rajan,et al.  Mapping Chemical Selection Pathways for Designing Multicomponent Alloys: an informatics framework for materials design , 2015, Scientific Reports.

[22]  K. Rajan,et al.  A fast hybrid methodology based on machine learning, quantum methods, and experimental measurements for evaluating material properties , 2017 .

[23]  Christopher M Wolverton,et al.  Dissolving the Periodic Table in Cubic Zirconia: Data Mining to Discover Chemical Trends , 2014 .

[24]  Klaus-Robert Müller,et al.  Machine learning of accurate energy-conserving molecular force fields , 2016, Science Advances.

[25]  Logan T. Ward,et al.  Automated crystal structure solution from powder diffraction data: Validation of the first-principles-assisted structure solution method , 2017 .

[26]  S. Broderick,et al.  Topological Data Analysis for the Characterization of Atomic Scale Morphology from Atom Probe Tomography Images , 2018 .

[27]  Alok Choudhary,et al.  Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations , 2017 .

[28]  Claudia Draxl,et al.  NOMAD: The FAIR concept for big data-driven materials science , 2018, MRS Bulletin.

[29]  Chris J Pickard,et al.  Ab initio random structure searching , 2011, Journal of physics. Condensed matter : an Institute of Physics journal.

[30]  Atsuto Seko,et al.  Representation of compounds for machine-learning prediction of physical properties , 2016, 1611.08645.

[31]  Christopher Wolverton,et al.  Accelerated discovery of metallic glasses through iteration of machine learning and high-throughput experiments , 2018, Science Advances.

[32]  David Feller The role of databases in support of computational chemistry calculations , 1996 .

[33]  Pavlo O. Dral,et al.  Quantum chemistry structures and properties of 134 kilo molecules , 2014, Scientific Data.

[34]  Anubhav Jain,et al.  Harnessing the Materials Project for machine-learning and accelerated discovery , 2018, MRS Bulletin.

[35]  Alexandre Tkatchenko,et al.  Quantum-chemical insights from deep tensor neural networks , 2016, Nature Communications.

[36]  Cormac Toher,et al.  Data-driven design of inorganic materials with the Automatic Flow Framework for Materials Discovery , 2018, MRS Bulletin.

[37]  Stefan Goedecker,et al.  Crystal structure prediction using the minima hopping method. , 2010, The Journal of chemical physics.

[38]  Atsuto Seko,et al.  Progress in nanoinformatics and informational materials science , 2018, MRS Bulletin.

[39]  Diego A. Gómez-Gualdrón,et al.  The materials genome in action: identifying the performance limits for methane storage , 2015 .

[40]  A. Oganov,et al.  Crystal structure prediction using ab initio evolutionary techniques: principles and applications. , 2006, The Journal of chemical physics.

[41]  Alok Choudhary,et al.  A General-Purpose Machine Learning Framework for Predicting Properties of Inorganic Materials , 2016 .

[42]  George E. Dahl,et al.  Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error. , 2017, Journal of chemical theory and computation.

[43]  Anubhav Jain,et al.  Finding Nature’s Missing Ternary Oxide Compounds Using Machine Learning and Density Functional Theory , 2010 .

[44]  Arthur Dalby,et al.  Description of several chemical structure file formats used by computer programs developed at Molecular Design Limited , 1992, J. Chem. Inf. Comput. Sci..