Automatic testing and minimax optimization of system parameters for best worst-case performance

Robotic systems typically have numerous parameters, e.g. the choice of planning algorithm, real-valued parameters of motion and vision modules, and control parameters. We consider the problem of optimizing these parameters for best worst-case performance over a range of environments. To this end we first propose to evaluate system parameters by adversarially optimizing over environment parameters to find particularly hard environments. This is then nested in a game-theoretic minimax optimization setting, where an outerloop aims to find best worst-case system parameters. For both optimization levels we use Bayesian global optimization (GP-UCB) which provides the necessary confidence bounds to handle the stochasticity of the performance. We compare our method (Nested Minimax) with an existing relaxation method we adapted to become applicable in our setting. By construction our approach provides more robustness to performance stochasticity. We demonstrate the method for planning algorithm selection on a pick'n'place application and for control parameter optimization on a triple inverted pendulum for robustness to adversarial perturbations.

[1]  Shin'ichi Yuta,et al.  Control parameter design for robot vehicle based on numerical simulation and heuristic optimization - Feed-back controller design for trajectory tracking under strict physical constraints in wide speed range - , 2011, Computational Intelligence in Control and Automation (CICA).

[2]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[3]  M. Zeitz,et al.  Feedforward control design for finite-time transition problems of non-linear MIMO systems under input constraints , 2008, Int. J. Control.

[4]  Julien Marzat,et al.  Worst-case global optimization of black-box functions through Kriging and relaxation , 2012, Journal of Global Optimization.

[5]  Andreas Krause,et al.  Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.

[6]  Septimiu E. Salcudean,et al.  Fast constrained global minimax optimization of robot parameters , 1998, Robotica.

[7]  Julien Marzat,et al.  Robust minimax design from costly simulations , 2014 .

[8]  Qingfu Zhang,et al.  A surrogate-assisted evolutionary algorithm for minimax optimization , 2010, IEEE Congress on Evolutionary Computation.

[9]  Scott D. Sudhoff,et al.  Evolutionary Algorithms for Minimax Problems in Robust Design , 2009, IEEE Transactions on Evolutionary Computation.

[10]  Dumitru Dumitrescu,et al.  A new evolutionary approach to minimax problems , 2011, 2011 IEEE Congress of Evolutionary Computation (CEC).

[11]  Kai-Yew Lum,et al.  Max-min surrogate-assisted evolutionary algorithm for robust design , 2006, IEEE Transactions on Evolutionary Computation.

[12]  Sachin Chitta,et al.  A generic infrastructure for benchmarking motion planners , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Michael Zeitz Differenzielle Flachheit: Eine nützliche Methodik auch für lineare SISO-SystemeDifferential Flatness: A Useful Method also for Linear SISO Systems , 2010, Autom..

[14]  Jan Peters,et al.  An experimental comparison of Bayesian optimization for bipedal locomotion , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Knut Graichen,et al.  Feedforward Control Design for Finite-Time Transition Problems of Nonlinear Systems With Input and Output Constraints , 2008, IEEE Transactions on Automatic Control.