论文信息 - A Reinforcement Learning Approach to Interval Constraint Propagation

A Reinforcement Learning Approach to Interval Constraint Propagation

When solving systems of nonlinear equations with interval constraint methods, it has often been observed that many calls to contracting operators do not participate actively to the reduction of the search space. Attempts to statically select a subset of efficient contracting operators fail to offer reliable performance speed-ups. By embedding the recency-weighted average Reinforcement Learning method into a constraint propagation algorithm to dynamically learn the best operators, we show that it is possible to obtain robust algorithms with reliable performances on a range of sparse problems. Using a simple heuristic to compute initial weights, we also achieve significant performance speed-ups for dense problems.

Frédéric Goualard | Christophe Jermann | F. Goualard | C. Jermann

[1] Pascal Van Hentenryck,et al. CLP(Intervals) Revisited , 1994, ILPS.

[2] Nicolò Cesa-Bianchi,et al. Gambling in a rigged casino: The adversarial multi-armed bandit problem , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[3] Yahia Lebbah,et al. Accelerating filtering techniques for numeric CSPs , 2002, Artif. Intell..

[4] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[5] Guido D. Salvucci,et al. Ieee standard for binary floating-point arithmetic , 1985 .

[6] Frédéric Goualard,et al. On the Selection of a Transversal to Solve Nonlinear Systems with Interval Arithmetic , 2006, International Conference on Computational Science.

[7] Y. Freund,et al. The non-stochastic multi-armed bandit problem , 2001 .

[8] Richard S. Sutton,et al. Reinforcement Learning , 1992, Handbook of Machine Learning.

[9] A. Neumaier. Interval methods for systems of equations , 1990 .

[10] M. H. van Emden,et al. Interval arithmetic: From principles to implementation , 2001, JACM.

[11] Alan K. Mackworth. Consistency in Networks of Relations , 1977, Artif. Intell..

[12] D. J. Hand,et al. Artificial intelligence , 1981, Psychological Medicine.

[13] James M. Ortega,et al. Iterative solution of nonlinear equations in several variables , 2014, Computer science and applied mathematics.

[14] E. Hansen,et al. Bounding solutions of systems of equations using interval analysis , 1981 .

[15] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..

[16] Iain S. Duff,et al. On Algorithms for Obtaining a Maximum Transversal , 1981, TOMS.

[17] Frédéric Goualard. On considering an interval constraint solving algorithm as a free-steering nonlinear Gauss-Seidel procedure , 2005, SAC '05.

[18] IMPROVING THE EFFICIENCY OF A POLYNOMIAL SYSTEM SOLVER VIA A REORDERING TECHNIQUE , 2002 .

[19] Ansi Ieee,et al. IEEE Standard for Binary Floating Point Arithmetic , 1985 .

[20] Gaétan Hains,et al. A conservative scheme for parallel interval narrowing , 2000, Inf. Process. Lett..

[21] Michel Cosnard,et al. Numerical Solution of Nonlinear Equations , 1979, TOMS.

[22] R. B. Kearfott,et al. A Review of Preconditioners for the Interval Gauss–Seidel Method , 2000 .

[23] Panos M. Pardalos,et al. Encyclopedia of Optimization , 2006 .

[24] Stefan Herbort,et al. Improving the Efficiency of a Nonlinear-System-Solver Using a Componentwise Newton Method , 1997 .