Efficient search of compositional space for hybrid organic–inorganic perovskites via Bayesian optimization

Accelerated searches, made possible by machine learning techniques, are of growing interest in materials discovery. A suitable case involves the solution processing of components that ultimately form thin films of solar cell materials known as hybrid organic–inorganic perovskites (HOIPs). The number of molecular species that combine in solution to form these films constitutes an overwhelmingly large “compositional” space (at times, exceeding 500,000 possible combinations). Selecting a HOIP with desirable characteristics involves choosing different cations, halides, and solvent blends from a diverse palette of options. An unguided search by experimental investigations or molecular simulations is prohibitively expensive. In this work, we propose a Bayesian optimization method that uses an application-specific kernel to overcome challenges where data is scarce, and in which the search space is given by binary variables indicating whether a constituent is present or not. We demonstrate that the proposed approach identifies HOIPs with the targeted maximum intermolecular binding energy between HOIP salt and solvent at considerably lower cost than previous state-of-the-art Bayesian optimization methodology and at a fraction of the time (less than 10%) needed to complete an exhaustive search. We find an optimal composition within 15 ± 10 iterations in a HOIP compositional space containing 72 combinations, and within 31 ± 9 iterations when considering mixed halides (240 combinations). Exhaustive quantum mechanical simulations of all possible combinations were used to validate the optimal prediction from a Bayesian optimization approach. This paper demonstrates the potential of the Bayesian optimization methodology reported here for new materials discovery.Bayesian optimization: accelerated perovskites and solvent screeningA cheaper and more effective optimization method has been developed to screen composition and solvent interactions in organic–inorganic perovskites. A collaborative team led by Matthias Poloczek from University of Arizona, USA, propose a modified Bayesian optimization method to predict the optimal combination of various hybrid organic–inorganic perovskites and solvents. They screen the intermolecular binding energy of 240 possible perovskite-solvent combinations, finding that the maximum occurs for FAPbI2Cl and tetrahydrothiophene 1-oxide. The use of an application-specific kernel overcomes challenges such as data scarcity and reduces the computational cost substantially compared to previous state-of-the-art Bayesian optimization methods. The described method can be extended to study mixed ions and mixed halide/cation systems, and could be applicable to a wide range of materials design and optimization problems.

[1]  Mohammad Khaja Nazeeruddin,et al.  One-Year stable perovskite solar cells by 2D/3D interface engineering , 2017, Nature Communications.

[2]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Stefan Grimme,et al.  Effect of the damping function in dispersion corrected density functional theory , 2011, J. Comput. Chem..

[4]  Dong Uk Lee,et al.  Iodide management in formamidinium-lead-halide–based perovskite layers for efficient solar cells , 2017, Science.

[5]  Aron Walsh,et al.  Ionic transport in hybrid lead iodide perovskite solar cells , 2015, Nature Communications.

[6]  Michael J. Todd,et al.  On Khachiyan's algorithm for the computation of minimum-volume enclosing ellipsoids , 2007, Discret. Appl. Math..

[7]  Zhongqiang Zhang,et al.  Recent advances in perovskite solar cells: efficiency, stability and lead-free perovskite , 2017 .

[8]  S. Granick,et al.  Giant capsids from lattice self-assembly of cyclodextrin complexes , 2017, Nature Communications.

[9]  Thomas Wågberg,et al.  Toward a Low‐Cost Artificial Leaf: Driving Carbon‐Based and Bifunctional Catalyst Electrodes with Solution‐Processed Perovskite Photovoltaics , 2016 .

[10]  Angela F. Harper,et al.  Controlling nucleation, growth, and orientation of metal halide perovskite thin films with rationally selected additives , 2017 .

[11]  Nripan Mathews,et al.  Effect of Cation Composition on the Mechanical Stability of Perovskite Solar Cells , 2018 .

[12]  Jinsong Huang,et al.  Manipulating Crystallization of Organolead Mixed-Halide Thin Films in Antisolvent Baths for Wide-Bandgap Perovskite Solar Cells. , 2016, ACS applied materials & interfaces.

[13]  Paul Heremans,et al.  Nonhazardous Solvent Systems for Processing Perovskite Photovoltaics , 2016 .

[14]  Nando de Freitas,et al.  A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.

[15]  Kenneth David Strang,et al.  Big Data Paradigm: What is the Status of Privacy and Security? , 2017, Annals of Data Science.

[16]  Chiho Kim,et al.  A hybrid organic-inorganic perovskite dataset , 2017, Scientific Data.

[17]  Kristian Kersting,et al.  pyGPs: a Python library for Gaussian process regression and classification , 2015, J. Mach. Learn. Res..

[18]  T. Bein,et al.  Single-crystal-like optoelectronic-properties of MAPbI3 perovskite polycrystalline thin films , 2018 .

[19]  P. Delugas,et al.  Methylammonium Rotational Dynamics in Lead Halide Perovskite by Classical Molecular Dynamics: The Role of Temperature , 2015 .

[20]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[21]  C. Cramer,et al.  Universal solvation model based on solute electron density and on a continuum model of the solvent defined by the bulk dielectric constant and atomic surface tensions. , 2009, The journal of physical chemistry. B.

[22]  David Cahen,et al.  Hybrid organic—inorganic perovskites: low-cost semiconductors with intriguing charge-transport properties , 2016 .

[23]  Kevin Leyton-Brown,et al.  Sequential Model-Based Optimization for General Algorithm Configuration , 2011, LION.

[24]  Fabio C Gozzo,et al.  Correction: Corrigendum: αB-crystallin interacts with and prevents stress-activated proteolysis of focal adhesion kinase by calpain in cardiomyocytes , 2015, Nature Communications.

[25]  Shahjadi Hisan Farjana,et al.  Recent Advances in Nanogenerator‐Driven Self‐Powered Implantable Biomedical Devices , 2018 .

[26]  Stefan Grimme,et al.  Semiempirical GGA‐type density functional constructed with a long‐range dispersion correction , 2006, J. Comput. Chem..

[27]  Y. Loo,et al.  Influence of Solvent Coordination on Hybrid Organic–Inorganic Perovskite Formation , 2018 .

[28]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[29]  José Mario Martínez,et al.  PACKMOL: A package for building initial configurations for molecular dynamics simulations , 2009, J. Comput. Chem..

[30]  Donald R. Jones,et al.  Efficient Global Optimization of Expensive Black-Box Functions , 1998, J. Glob. Optim..

[31]  Shahzad Ahmad,et al.  Molecular dynamics simulations of organohalide perovskite precursors: solvent effects in the formation of perovskite solar cells. , 2015, Physical chemistry chemical physics : PCCP.

[32]  Frank Neese,et al.  The ORCA program system , 2012 .

[33]  W. L. Jorgensen,et al.  Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids , 1996 .

[34]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[35]  F. Weigend,et al.  Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy. , 2005, Physical chemistry chemical physics : PCCP.

[36]  Yixin Zhao,et al.  Mixed cation hybrid lead halide perovskites with enhanced performance and stability , 2017 .

[37]  Donald G. Truhlar,et al.  Implicit Solvation Models: Equilibria, Structure, Spectra, and Dynamics , 1999 .

[38]  Frank Hutter,et al.  Automated configuration of algorithms for solving hard computational problems , 2009 .

[39]  B. Sumpter,et al.  Density-functional approaches to noncovalent interactions: a comparison of dispersion corrections (DFT-D), exchange-hole dipole moment (XDM) theory, and specialized functionals. , 2011, The Journal of chemical physics.

[40]  Dong-Won Kang,et al.  PCBM-blended chlorobenzene hybrid anti-solvent engineering for efficient planar perovskite solar cells , 2017 .

[41]  Daniel S. Falster,et al.  Corrigendum: The Coral Trait Database, a curated database of trait information for coral species from the global oceans , 2017, Scientific Data.

[42]  Warren B. Powell,et al.  The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery , 2011, INFORMS J. Comput..

[43]  A. Tsipis DFT flavor of coordination chemistry , 2014 .

[44]  James M. Stevenson,et al.  Mayer Bond Order as a Metric of Complexation Effectiveness in Lead Halide Perovskite Solutions , 2017 .

[45]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[46]  D. Sculley,et al.  Google Vizier: A Service for Black-Box Optimization , 2017, KDD.