Asymptotic optimality results for controlled sequential estimation

We consider the problem of sequential estimation of a random parameter under a controlled setting. Unlike traditional estimation problems, the collected observations depend on the used actions, which control the quality of the sensing process. At each time step, the decision maker chooses a control from a finite set of controls or decides to stop collecting measurements. The goal is to design an efficient causal control policy and a stopping rule and the efficiency is captured using the notion of asymptotic pointwise optimality (APO). This setup, in the context of sequential estimation for controlled parameter estimation was first considered in [1] for a special case where the distributions corresponding to different controls depend on uncommon parameters. In this paper, we extend the results in [1] to a more general case wherein the observation models under different controls could depend on common parameters. For this general setting, we propose a procedure consisting of a control policy and stopping rule, which is shown to be APO. In the process we identify and point out several applications, particularly in the area of active learning.

[1]  P. Bickel,et al.  Asymptotically pointwise optimal procedures in sequential analysis , 1967 .

[2]  Venugopal V. Veeravalli,et al.  Multihypothesis sequential probability ratio tests - Part I: Asymptotic optimality , 1999, IEEE Trans. Inf. Theory.

[3]  P. Bickel,et al.  Some contributions to the asymptotic theory of Bayes solutions , 1969 .

[4]  Victor J. Yohai,et al.  Asymptotically Optimal Bayes Sequential Design of Experiments for Estimation , 1973 .

[5]  Tara Javidi,et al.  Active M-ary sequential hypothesis testing , 2010, 2010 IEEE International Symposium on Information Theory.

[6]  Peter J. Bickel,et al.  Asymptotically Optimal Bayes and Minimax Procedures in Sequential Estimation , 1968 .

[7]  J. Andel Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.

[8]  George Atia,et al.  Controlled Sensing for Multihypothesis Testing , 2012, IEEE Transactions on Automatic Control.

[9]  A. Dasgupta Asymptotic Theory of Statistics and Probability , 2008 .

[10]  Koby Crammer,et al.  Advice-Efficient Prediction with Expert Advice , 2013, ArXiv.

[11]  Christos Dimitrakakis,et al.  Cost-Minimising Strategies for Data Labelling: Optimal Stopping and Active Learning , 2007, FoIKS.

[12]  Ohad Shamir,et al.  Efficient Learning with Partially Observed Attributes , 2010, ICML.

[13]  J. Walrand,et al.  Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards , 1987 .

[14]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[15]  George Atia,et al.  Controlled sensing for sequential multihypothesis testing , 2012, 2012 IEEE International Symposium on Information Theory Proceedings.

[16]  P. Bickel,et al.  Mathematical Statistics: Basic Ideas and Selected Topics , 1977 .