Expensive optimization with production-graph resource constraints: a first look at a new problem class

We consider a new class of expensive, resource-constrained optimization problems (here arising from molecular discovery) where costs are associated with the experiments (or evaluations) to be carried out during the optimization process. In the molecular discovery problem, candidate compounds to be optimized must be synthesized in an iterative process that starts from a set of purchasable items and builds up to larger molecules. To produce target molecules, their required resources are either used from already-synthesized items in storage or produced themselves on-demand at an additional cost. Any remaining resources from the production process are stored for reuse for the next evaluations. We model these resource dependencies with a directed acyclic production graph describing the development process from granular purchasable items to evaluable target compounds. Moreover, we develop several resource-eficient algorithms to address this problem. In particular, we develop resource-aware variants of Random Search heuristics and of Bayesian Optimization and analyze their performance in terms of anytime behavior. The experimental results were obtained from a real-world molecular optimization problem. Our results suggest that algorithms that encourage exploitation by reusing existing resources achieve satisfactory results while using fewer resources overall.

[1]  Carla P. Gomes,et al.  Computational sustainability meets materials science , 2021, Nature Reviews Materials.

[2]  K. Tsuda,et al.  Black-Box Optimization for Automated Discovery. , 2021, Accounts of chemical research.

[3]  Huanlai Xing,et al.  Expected improvement for expensive optimization: a review , 2020, Journal of Global Optimization.

[4]  Cedric Archambeau,et al.  Cost-aware Bayesian Optimization , 2020, ArXiv.

[5]  Ryan-Rhys Griffiths,et al.  Constrained Bayesian optimization for automatic chemical design using variational autoencoders , 2019, Chemical science.

[6]  Kirthevasan Kandasamy,et al.  ChemBO: Bayesian Optimization of Small Organic Molecules with Synthesizable Recommendations , 2019, AISTATS.

[7]  Joseph R. Kasprzyk,et al.  Computational design optimization of concrete mixtures: A review , 2018, Cement and Concrete Research.

[8]  Constantine Bekas,et al.  “Found in Translation”: predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models† †Electronic supplementary information (ESI) available: Time-split test set and example predictions, together with attention weights, confidence and token probabilities. See DO , 2017, Chemical science.

[9]  Alán Aspuru-Guzik,et al.  Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules , 2016, ACS central science.

[10]  Matt J. Kusner,et al.  Grammar Variational Autoencoder , 2017, ICML.

[11]  Thomas Stützle,et al.  Automatically improving the anytime behaviour of optimisation algorithms , 2014, Eur. J. Oper. Res..

[12]  Thomas G. Dietterich,et al.  Computational sustainability , 2011, AI Mag..

[13]  Joshua D. Knowles,et al.  On Handling Ephemeral Resource Constraints in Evolutionary Search , 2013, Evolutionary Computation.

[14]  Jeremy N. Burrows,et al.  The Open Access Malaria Box: A Drug Discovery Catalyst for Neglected Diseases , 2013, PloS one.

[15]  Andreas Krause,et al.  Navigating the protein fitness landscape with Gaussian processes , 2012, Proceedings of the National Academy of Sciences.

[16]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[17]  Richard Allmendinger,et al.  Tuning evolutionary search for closed-loop optimization , 2012 .

[18]  G. V. Paolini,et al.  Quantifying the chemical beauty of drugs. , 2012, Nature chemistry.

[19]  Douglas H. Fisher,et al.  Recent Advances in AI for Computational Sustainability , 2012, IEEE Intelligent Systems.

[20]  Joshua D. Knowles,et al.  Policy learning in resource-constrained optimization , 2011, GECCO '11.

[21]  Yaochu Jin,et al.  Surrogate-assisted evolutionary computation: Recent advances and future challenges , 2011, Swarm Evol. Comput..

[22]  Joshua D. Knowles,et al.  On-Line Purchasing Strategies for an Evolutionary Algorithm Performing Resource-Constrained Optimization , 2010, PPSN.

[23]  Anne Auger,et al.  Comparing results of 31 algorithms from the black-box optimization benchmarking BBOB-2009 , 2010, GECCO '10.

[24]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[25]  Joshua D. Knowles Closed-loop evolutionary multiobjective optimization , 2009, IEEE Computational Intelligence Magazine.

[26]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[27]  C. Gomes Computational Sustainability: Computational methods for a sustainable environment, economy, and society , 2009 .

[28]  Jürgen Branke,et al.  Faster convergence by means of fitness estimation , 2005, Soft Comput..

[29]  Christopher H. Bryant,et al.  Functional genomic hypothesis generation and experimentation by a robot scientist , 2004, Nature.

[30]  A. O'Hagan,et al.  Predicting the output from a complex computer code when fast approximations are available , 2000 .

[31]  Sam Kwong,et al.  Genetic algorithms: concepts and applications [in engineering design] , 1996, IEEE Trans. Ind. Electron..

[32]  Shlomo Zilberstein,et al.  Using Anytime Algorithms in Intelligent Systems , 1996, AI Mag..

[33]  Terry Jones,et al.  Fitness Distance Correlation as a Measure of Problem Difficulty for Genetic Algorithms , 1995, ICGA.

[34]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[35]  H. L. Morgan The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. , 1965 .