Obtaining the best value for money in adaptive sequential estimation

Abstract In [Kujala, J. V., Richardson, U., & Lyytinen, H. (2010). A Bayesian-optimal principle for learner-friendly adaptation in learning games. Journal of Mathematical Psychology , 54(2), 247–255], we considered an extension of the conventional Bayesian adaptive estimation framework to situations where each observable variable is associated with a certain random cost of observation. We proposed an algorithm that chooses each placement by maximizing the expected gain in utility divided by the expected cost. In this paper, we formally justify this placement rule as an asymptotically optimal solution to the problem of maximizing the expected utility of an experiment that terminates when the total cost overruns a given budget. For example, the cost could be defined as the random time taken by each trial in an experiment, and one might wish to maximize the expected total information gain over as many trials as can be completed in 15 min. A simple, analytically tractable example is considered.

[1]  A. Watson,et al.  Quest: A Bayesian adaptive psychometric method , 1983, Perception & psychophysics.

[2]  Jay I. Myung,et al.  Optimal experimental design for model discrimination. , 2009, Psychological review.

[3]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[4]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[5]  Heikki Lyytinen,et al.  A Bayesian-optimal principle for learner-friendly adaptation in learning games , 2010 .

[6]  Y. Chow On a Strong Law of Large Numbers for Martingales , 1967 .

[7]  D. Lindley On a Measure of the Information Provided by an Experiment , 1956 .

[8]  Peter Müller,et al.  A Bayesian Decision-Theoretic Dose-Finding Trial , 2006, Decis. Anal..

[9]  David J. C. MacKay,et al.  Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[10]  David S. Johnson,et al.  Computers and In stractability: A Guide to the Theory of NP-Completeness. W. H Freeman, San Fran , 1979 .

[11]  Tuomas J. Lukka,et al.  Bayesian adaptive estimation: The next dimension , 2006 .

[12]  Mark A. Pitt,et al.  Adaptive Design Optimization: A Mutual Information-Based Approach to Model Discrimination in Cognitive Science , 2010, Neural Computation.

[13]  M. Degroot Optimal Statistical Decisions , 1970 .

[14]  P. King-Smith,et al.  Efficient and unbiased modifications of the QUEST threshold method: Theory, simulations, experimental evaluation and practical implementation , 1994, Vision Research.

[15]  Liam Paninski,et al.  Asymptotic Theory of Information-Theoretic Experimental Design , 2005, Neural Computation.

[16]  Joseph B. Kadane,et al.  A Gridding Method for Bayesian Sequential Decision Problems , 2003 .