Stochastic optimization for adapting behaviors of exploration agents

We propose an explorer that uses a flexible problem-solver with a significant capacity to adapt its behavior.

[1]  Eric Horvitz,et al.  Reasoning under Varying and Uncertain Resource Constraints , 1988, AAAI.

[2]  Jonathan Gratch,et al.  On the Efficient Allocation of Resources for Hypothesis Evaluation: A Statistical Approach , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Monte Zweben,et al.  Scheduling and rescheduling with iterative repair , 1993, IEEE Trans. Syst. Man Cybern..

[4]  M. Talagrand A new isoperimetric inequality and the concentration of measure phenomenon , 1991 .

[5]  Andrew W. Moore,et al.  Hoeffding Races: Accelerating Model Selection Search for Classification and Function Approximation , 1993, NIPS.

[6]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[7]  A. Nádas An Extension of a Theorem of Chow and Robbins on Sequential Confidence Intervals for the Mean , 1969 .

[8]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[9]  Steve A. Chien,et al.  Efficient Heuristic Hypothesis Ranking , 1999, J. Artif. Intell. Res..

[10]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[11]  A E Bostwick,et al.  THE THEORY OF PROBABILITIES. , 1896, Science.

[12]  Alex Fukunaga,et al.  Using ASPEN to automate EO-1 activity planning , 1998, 1998 IEEE Aerospace Conference Proceedings (Cat. No.98TH8339).

[13]  Forbes AvenuePittsburgh Memory Based Stochastic Optimization for Validation and Tuning of Function Approximators , 1997 .

[14]  Gerald DeJong,et al.  COMPOSER: A Probabilistic Solution to the Utility Problem in Speed-Up Learning , 1992, AAAI.

[15]  Devika Subramanian,et al.  Provably Bounded Optimal Agents , 1993, IJCAI.

[16]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .