论文信息 - Meta Online Learning: Experiments on a Unit Commitment Problem

Meta Online Learning: Experiments on a Unit Commitment Problem

Online learning is machine learning, in real time from suc- cessive data samples. Meta online learning consists in combining several online learning algorithms from a given set (termed portfolio) of algo- rithms. The goal can be (i) mitigating the effect of a bad choice of online learning algorithms (ii) parallelization (iii) combining the strengths of dif- ferent algorithms. Basically, meta online learning boils down to combining noisy optimization algorithms. Whereas many tools exist for combining combinatorial optimization tools, little is known about combining noisy optimization algorithms. Recently, a methodology termed lag has been proposed for that. We test experimentally the lag methodology for online learning, for a stock management problem and a cartpole problem.

Olivier Teytaud | Jialin Liu

[1] Lars Kottho,et al. Algorithm Selection for Combinatorial Search Problems: A survey , 2012 .

[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[3] Yoav Shoham,et al. Understanding Random SAT: Beyond the Clauses-to-Variables Ratio , 2004, CP.

[4] Alex M. Andrew,et al. Reinforcement Learning: : An Introduction , 1998 .

[5] Lars Kotthoff,et al. Algorithm Selection for Combinatorial Search Problems: A Survey , 2012, AI Mag..

[6] V. Fabian. Stochastic Approximation of Minima with Improved Asymptotic Speed , 1967 .

[7] Olivier Teytaud,et al. Algorithm Portfolios for Noisy Optimization: Compare Solvers Early , 2014, LION.