The joint automated repository for various integrated simulations (JARVIS) for data-driven materials design

The Joint Automated Repository for Various Integrated Simulations (JARVIS) is an integrated infrastructure to accelerate materials discovery and design using density functional theory (DFT), classical force-fields (FF), and machine learning (ML) techniques. JARVIS is motivated by the Materials Genome Initiative (MGI) principles of developing open-access databases and tools to reduce the cost and development time of materials discovery, optimization, and deployment. The major features of JARVIS are: JARVIS-DFT, JARVIS-FF, JARVIS-ML, and JARVIS-tools. To date, JARVIS consists of ≈40,000 materials and ≈1 million calculated properties in JARVIS-DFT, ≈500 materials and ≈110 force-fields in JARVIS-FF, and ≈25 ML models for material-property predictions in JARVIS-ML, all of which are continuously expanding. JARVIS-tools provides scripts and workflows for running and analyzing various simulations. We compare our computational data to experiments or high-fidelity computational methods wherever applicable to evaluate error/uncertainty in predictions. In addition to the existing workflows, the infrastructure can support a wide variety of other technologically important applications as part of the data-driven materials design paradigm. The JARVIS datasets and tools are publicly available at the website: https://jarvis.nist.gov .

[1]  C. Brooks Computer simulation of liquids , 1989 .

[2]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[3]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[4]  Burke,et al.  Generalized Gradient Approximation Made Simple. , 1996, Physical review letters.

[5]  G. Kresse,et al.  Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set , 1996 .

[6]  Kresse,et al.  Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. , 1996, Physical review. B, Condensed matter.

[7]  Bill Kennedy,et al.  HTML: The Definitive Guide , 1996 .

[8]  K. Burke,et al.  Generalized Gradient Approximation Made Simple [Phys. Rev. Lett. 77, 3865 (1996)] , 1997 .

[9]  T. Saito,et al.  Computational materials design , 1999 .

[10]  P. Luksch,et al.  New developments in the Inorganic Crystal Structure Database (ICSD): accessibility in support of materials research and design. , 2002, Acta crystallographica. Section B, Structural science.

[11]  U. Kattner Phase diagrams for lead-free solder alloys , 2002 .

[12]  David J. Singh,et al.  BoltzTraP. A code for calculating band-structure dependent quantities , 2006, Comput. Phys. Commun..

[13]  C. Marianetti,et al.  Electronic structure calculations with dynamical mean-field theory , 2005, cond-mat/0511085.

[14]  Arash A. Mostofi,et al.  A ug 2 00 7 wannier 90 : A Tool for Obtaining Maximally-Localised Wannier Functions , 2007 .

[15]  N. Marzari,et al.  wannier90: A tool for obtaining maximally-localised Wannier functions , 2007, Comput. Phys. Commun..

[16]  P. Blaha,et al.  Accurate band gaps of semiconductors and insulators with a semilocal exchange-correlation potential. , 2009, Physical review letters.

[17]  D. Sholl,et al.  Density Functional Theory: A Practical Introduction , 2009 .

[18]  Stefano de Gironcoli,et al.  QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials , 2009, Journal of physics. Condensed matter : an Institute of Physics journal.

[19]  D. Bowler,et al.  Chemical accuracy for the van der Waals density functional , 2009, Journal of physics. Condensed matter : an Institute of Physics journal.

[20]  Gerbrand Ceder,et al.  Opportunities and challenges for first-principles materials design and applications to Li battery materials , 2010 .

[21]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[22]  James P. Sethna,et al.  The potential of atomistic simulations and the knowledgebase of interatomic models , 2011 .

[23]  Stefano Curtarolo,et al.  High-throughput combinatorial database of electronic band structures for inorganic scintillator materials. , 2011, ACS combinatorial science.

[24]  Marco Buongiorno Nardelli,et al.  AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations , 2012 .

[25]  Liping Yu,et al.  Identification of potential photovoltaic absorbers based on first-principles spectroscopic screening of materials. , 2012, Physical review letters.

[26]  Anubhav Jain,et al.  Python Materials Genomics (pymatgen): A robust, open-source python library for materials analysis , 2012 .

[27]  Vladan Stevanović,et al.  Correcting Density Functional Theory for Accurate Predictions of Compound Enthalpies of Formation:Fitted elemental-phase Reference Energies (FERE) , 2012 .

[28]  Kristin A. Persson,et al.  Commentary: The Materials Project: A materials genome approach to accelerating materials innovation , 2013 .

[29]  Gregory B Olson,et al.  Materials genomics: From CALPHAD to flight , 2014 .

[30]  Jianpeng Liu,et al.  Spin-orbit spillage as a measure of band inversion in insulators , 2014, 1407.1244.

[31]  Maciej Haranczyk,et al.  Computation-Ready, Experimental Metal–Organic Frameworks: A Tool To Enable High-Throughput Screening of Nanoporous Crystals , 2014 .

[32]  J. Pablo,et al.  The Materials Genome Initiative, the interplay of experiment, theory and computation , 2014 .

[33]  Anubhav Jain,et al.  New Light‐Harvesting Materials Using Accurate and Efficient Bandgap Calculations , 2015 .

[34]  I. Tanaka,et al.  First principles phonon calculations in materials science , 2015, 1506.08498.

[35]  Boris Kozinsky,et al.  AiiDA: Automated Interactive Infrastructure and Database for Computational Science , 2015, ArXiv.

[36]  Aron Walsh,et al.  Inorganic materials: The quest for new functionality. , 2015, Nature chemistry.

[37]  Muratahan Aykol,et al.  The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies , 2015 .

[38]  A. Choudhary,et al.  Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science , 2016 .

[39]  John M. Gregoire,et al.  Perspective: Composition–structure–property mapping in high-throughput experiments: Turning data into knowledge , 2016 .

[40]  M J D Rushton,et al.  Development of Xe and Kr empirical potentials for CeO2, ThO2, UO2 and PuO2, combining DFT with high temperature MD , 2016, Journal of physics. Condensed matter : an Institute of Physics journal.

[41]  Albert V. Davydov,et al.  MPInterfaces: A Materials Project based Python tool for high-throughput computational screening of interfacial systems , 2016, 1602.07784.

[42]  Chiho Kim,et al.  Machine learning in materials informatics: recent applications and prospects , 2017, npj Computational Materials.

[43]  Michael Walter,et al.  The atomic simulation environment-a Python library for working with atoms. , 2017, Journal of physics. Condensed matter : an Institute of Physics journal.

[44]  Kamal Choudhary,et al.  High-throughput Identification and Characterization of Two-dimensional Materials using Density functional theory , 2017, Scientific Reports.

[45]  Pinar Acar,et al.  Crystal Plasticity Modeling and Experimental Validation with an Orientation Distribution Function for Ti-7Al Alloy , 2017 .

[46]  Garth J. Williams,et al.  Corrigendum: Diffraction data of core-shell nanoparticles from an X-ray free electron laser , 2017, Scientific Data.

[47]  Kamal Choudhary,et al.  Evaluation and comparison of classical interatomic potentials through a user-friendly interactive web-interface , 2017, Scientific Data.

[48]  Ichiro Takeuchi,et al.  Fulfilling the promise of the materials genome initiative with high-throughput experimental methodologies , 2017 .

[49]  Anubhav Jain,et al.  Effective mass and Fermi surface complexity factor from ab initio band structure calculations , 2017, npj Computational Materials.

[50]  Christopher. Simons,et al.  Machine learning with Python , 2017 .

[51]  Kyle Chard,et al.  Matminer: An open source toolkit for materials data mining , 2018, Computational Materials Science.

[52]  J. E. Allison,et al.  PRISMS: An Integrated, Open-Source Framework for Accelerating Predictive Structural Materials Science , 2018, JOM.

[53]  Richard Dronskowski,et al.  Discovery of High-Performance Thermoelectric Chalcogenides through Reliable High-Throughput Material Screening. , 2018, Journal of the American Chemical Society.

[54]  John D. Perkins,et al.  An open experimental database for exploring inorganic materials , 2018, Scientific Data.

[55]  Kamal Choudhary,et al.  Machine learning with force-field inspired descriptors for materials: fast screening and mapping energy landscape. , 2018, Physical review materials.

[56]  Matthias Troyer,et al.  WannierTools: An open-source software package for novel topological materials , 2017, Comput. Phys. Commun..

[57]  Kamal Choudhary,et al.  High-throughput assessment of vacancy formation and surface energies of materials using classical force-fields , 2018, Journal of physics. Condensed matter : an Institute of Physics journal.

[58]  Kamal Choudhary,et al.  Elastic properties of bulk and low-dimensional materials using Van der Waals density functional. , 2018, Physical review. B.

[59]  Kamal Choudhary,et al.  Computational screening of high-performance optoelectronic materials using OptB88vdW and TB-mBJ formalisms , 2018, Scientific data.

[60]  M. Marques,et al.  Recent advances and applications of machine learning in solid-state materials science , 2019, npj Computational Materials.

[61]  Wei-keng Liao,et al.  Enhancing materials property prediction by leveraging computational and experimental data using deep transfer learning , 2019, Nature Communications.

[62]  Aldenor G. Santos,et al.  Occurrence of the potent mutagens 2- nitrobenzanthrone and 3-nitrobenzanthrone in fine airborne particles , 2019, Scientific Reports.

[63]  Sergei V. Kalinin,et al.  Materials science in the artificial intelligence age: high-throughput library generation, machine learning, and a pathway from correlations to the underpinning physics , 2019, MRS communications.

[64]  Karin M. Rabe,et al.  Systematic beyond-DFT study of binary transition metal oxides , 2019, npj Computational Materials.

[65]  Aftab Ahmed,et al.  Purification and Characterization of a Nonspecific Lipid Transfer Protein 1 (nsLTP1) from Ajwain (Trachyspermum ammi) Seeds , 2019, Scientific Reports.

[66]  Kamal Choudhary,et al.  Convergence and machine learning predictions of Monkhorst-Pack k-points and plane-wave cut-off in high-throughput DFT calculations. , 2018, Computational materials science.

[67]  Andreas Willfahrt,et al.  Polymer gels with tunable ionic Seebeck coefficient for ultra-sensitive printed thermopiles , 2019, Nature Communications.

[68]  Kamal Choudhary,et al.  High-throughput Discovery of Topologically Non-trivial Materials using Spin-orbit Spillage , 2018, Scientific Reports.

[69]  Claudia Draxl,et al.  The NOMAD laboratory: from data sharing to artificial intelligence , 2019, Journal of Physics: Materials.

[70]  Jie Jiang,et al.  Accelerated Discovery of Efficient Solar-cell Materials using Quantum and Machine-learning Methods. , 2019, Chemistry of materials : a publication of the American Chemical Society.

[71]  Rama Vasudevan,et al.  Density Functional Theory and Deep-learning to Accelerate Data Analytics in Scanning Tunneling Microscopy , 2019, 1912.09027.

[72]  Daniel Schwen,et al.  PFHub: The Phase-Field Community Hub , 2019, Journal of Open Research Software.

[73]  G. R. Schleder,et al.  From DFT to machine learning: recent approaches to materials science–a review , 2019, Journal of Physics: Materials.

[74]  Jay F. Whitacre,et al.  The Materials Research Platform: Defining the Requirements from User Stories , 2019, Matter.

[75]  Badri Narayanan,et al.  Machine learning enabled autonomous microstructural characterization in 3D samples , 2020, npj Computational Materials.

[76]  K. Garrity,et al.  Computational search for magnetic and non-magnetic 2D topological materials using unified spin–orbit spillage screening , 2020, npj Computational Materials.

[77]  Aliaksandr V. Yakutovich,et al.  Materials Cloud, a platform for open computational science , 2020, Scientific Data.

[78]  K. Garrity,et al.  Data-driven discovery of 3D and 2D thermoelectric materials , 2019, Journal of physics. Condensed matter : an Institute of Physics journal.

[79]  Francesca Tavazza,et al.  High-throughput density functional perturbation theory and machine learning predictions of infrared, piezoelectric, and dielectric responses , 2019, npj Computational Materials.

[80]  Francesca Tavazza,et al.  Efficient Computational Design of 2D van der Waals Heterostructures: Band-Alignment, Lattice-Mismatch, Web-app Generation and Machine-learning , 2020 .