Robust Optimization for Non-Convex Objectives

We consider robust optimization problems, where the goal is to optimize in the worst case over a class of objective functions. We develop a reduction from robust improper optimization to Bayesian optimization: given an oracle that returns $\alpha$-approximate solutions for distributions over objectives, we compute a distribution over solutions that is $\alpha$-approximate in the worst case. We show that de-randomizing this solution is NP-hard in general, but can be done for a broad class of statistical learning tasks. We apply our results to robust neural network training and submodular optimization. We evaluate our approach experimentally on corrupted character classification, and robust influence maximization in networks.

[1]  Andreas Krause,et al.  Randomized Sensing in Adversarial Environments , 2011, IJCAI.

[2]  Andreas Krause,et al.  Selecting Observations against Adversarial Objectives , 2007, NIPS.

[3]  Moshe Tennenholtz,et al.  Robust Probabilistic Inference , 2014, SODA.

[4]  Shai Shalev-Shwartz,et al.  Beyond Convexity: Stochastic Quasi-Convex Optimization , 2015, NIPS.

[5]  Wei Chen,et al.  Robust Influence Maximization , 2016, KDD.

[6]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[7]  Shie Mannor,et al.  Oracle-Based Robust Optimization via Online Learning , 2014, Oper. Res..

[8]  Laurence A. Wolsey,et al.  Best Algorithms for Approximating the Maximum of a Submodular Set Function , 1978, Math. Oper. Res..

[9]  Yonatan Wexler,et al.  Minimizing the Maximal Loss: How and Why , 2016, ICML.

[10]  Shai Shalev-Shwartz,et al.  On Graduated Optimization for Stochastic Non-Convex Problems , 2015, ICML.

[11]  Zeyuan Allen Zhu,et al.  Variance Reduction for Faster Non-Convex Optimization , 2016, ICML.

[12]  Sanjeev Arora,et al.  The Multiplicative Weights Update Method: a Meta-Algorithm and Applications , 2012, Theory Comput..

[13]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[14]  Pradeep Varakantham,et al.  Robust Influence Maximization: (Extended Abstract) , 2016, AAMAS.

[15]  John C. Duchi,et al.  Stochastic Gradient Methods for Distributionally Robust Optimization with f-divergences , 2016, NIPS.

[16]  John D. Lafferty,et al.  Local Minimax Complexity of Stochastic Convex Optimization , 2016, NIPS.

[17]  John C. Duchi,et al.  Minimax rates for memory-bounded sparse linear regression , 2015, COLT.