Efficient Black-Box Combinatorial Optimization

We consider the problem of black-box function optimization over combinatorial domains. Despite the vast literature on black-box function optimization over continuous domains, not much attention has been paid to learning models for optimization over combinatorial domains until recently. Nonetheless, optimization of such functions using state-of-the-art Bayesian optimization techniques, specifically designed for combinatorial domains, still remains challenging due to the associated computational complexity. To address this problem, we propose a computationally efficient model learning algorithm based on exponential weight updates. In particular, we use multilinear polynomials as surrogate model for functions over the Boolean hypercube. We further extend this representation via a group-theoretic Fourier expansion to address functions over categorical variables. Numerical experiments over synthetic benchmarks as well as real-world biological sequence design problems demonstrate the competitive or superior performance of the proposed algorithms versus a state-of-the-art Bayesian optimization algorithm while reducing the computational cost by multiple orders of magnitude.

[1]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[2]  Manfred K. Warmuth,et al.  Exponentiated Gradient Versus Gradient Descent for Linear Predictors , 1997, Inf. Comput..

[3]  A. Terras Fourier Analysis on Finite Groups and Applications: Index , 1999 .

[4]  P. Rowlinson FOURIER ANALYSIS ON FINITE GROUPS AND APPLICATIONS (London Mathematical Society Student Texts 43) , 2000 .

[5]  Charles J. Colbourn,et al.  Handbook of Combinatorial Designs, Second Edition (Discrete Mathematics and Its Applications) , 2006 .

[6]  I. Stansfield,et al.  Halting a cellular production line: responses to ribosomal pausing during translation , 2007, Biology of the cell.

[7]  P. Simmonds,et al.  Bioinformatic and Physical Characterizations of Genome-Scale Ordered RNA Structure in Mammalian RNA Viruses , 2008, Journal of Virology.

[8]  J. Micklefield,et al.  Reengineering orthogonally selective riboswitches , 2010, Proceedings of the National Academy of Sciences.

[9]  Jia Yuan Yu,et al.  Adaptive and Optimal Online Linear Regression on ℓ1-Balls , 2011, ALT.

[10]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[11]  Sanjeev Arora,et al.  The Multiplicative Weights Update Method: a Meta-Algorithm and Applications , 2012, Theory Comput..

[12]  T. Sanders,et al.  Analysis of Boolean Functions , 2012, ArXiv.

[13]  Dimitris Papamichail,et al.  Computational Tools and Algorithms for Designing Customized Synthetic Genes , 2014, Front. Bioeng. Biotechnol..

[14]  Xing-jie Liang,et al.  RNA as a stable polymer to build controllable and defined nanostructures for material and biomedical applications , 2015, Nano today.

[15]  M. Schatz,et al.  Big Data: Astronomical or Genomical? , 2015, PLoS biology.

[16]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[17]  D. Sculley,et al.  Google Vizier: A Service for Black-Box Optimization , 2017, KDD.

[18]  Alán Aspuru-Guzik,et al.  Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules , 2016, ACS central science.

[19]  Matthias Poloczek,et al.  Bayesian Optimization of Combinatorial Structures , 2018, ICML.

[20]  David Baker,et al.  Modular and tunable biological feedback control using a de novo protein switch , 2019, Nature.

[21]  D. Mathews,et al.  Design of highly active double-pseudoknotted ribozymes: a combined computational and experimental study , 2018, Nucleic acids research.

[22]  Jakub M. Tomczak,et al.  Combinatorial Bayesian Optimization using the Graph Cartesian Product , 2019, NeurIPS.

[23]  Adam J. Meyer,et al.  Hachimoji DNA and RNA: A genetic system with eight building blocks , 2019, Science.

[24]  Samuel C. Hoffman,et al.  Combinatorial Black-Box Optimization with Expert Advice , 2020, KDD.

[25]  Eduardo C. Garrido-Merchán,et al.  Dealing with Categorical and Integer-valued Variables in Bayesian Optimization with Gaussian Processes , 2017, Neurocomputing.