Ergodicity, Decisions, and Partial Information

In the simplest sequential decision problem for an ergodic stochastic process X, at each time n a decision u n is made as a function of past observations \(X_{0},\ldots,X_{n-1}\), and a loss l(u n , X n ) is incurred. In this setting, it is known that one may choose (under a mild integrability assumption) a decision strategy whose pathwise time-average loss is asymptotically smaller than that of any other strategy. The corresponding problem in the case of partial information proves to be much more delicate, however: if the process X is not observable, but decisions must be based on the observation of a different process Y, the existence of pathwise optimal strategies is not guaranteed. The aim of this paper is to exhibit connections between pathwise optimal strategies and notions from ergodic theory. The sequential decision problem is developed in the general setting of an ergodic dynamical system \((\varOmega,\mathcal{B},\mathbf{P},T)\) with partial information \(\mathcal{Y}\subseteq \mathcal{B}\). The existence of pathwise optimal strategies grounded in two basic properties: the conditional ergodic theory of the dynamical system, and the complexity of the loss function. When the loss function is not too complex, a general sufficient condition for the existence of pathwise optimal strategies is that the dynamical system is a conditional K-automorphism relative to the past observations \(\bigvee _{n\geq 0}T^{n}\mathcal{Y}\). If the conditional ergodicity assumption is strengthened, the complexity assumption can be weakened. Several examples demonstrate the interplay between complexity and ergodicity, which does not arise in the case of full information. Our results also yield a decision-theoretic characterization of weak mixing in ergodic theory, and establish pathwise optimality of ergodic nonlinear filters.

[1]  Andrew B. Nobel,et al.  On optimal sequential prediction for general processes , 2003, IEEE Trans. Inf. Theory.

[2]  J. Doob Stochastic processes , 1953 .

[3]  Paul R. Halmos,et al.  In General a Measure Preserving Transformation is Mixing , 1944 .

[4]  H. Weizsäcker Exchanging the order of taking suprema and countable intersections of $\sigma $-algebras , 1983 .

[5]  Haruo Totoki On a class of special flows , 1970 .

[6]  V. Losert,et al.  The weighted pointwise ergodic theorem and the individual ergodic theorem along subsequences , 1985 .

[7]  D. Rudolph Pointwise and $L\sp 1$ mixing relative to a sub-sigma algebra , 2004 .

[8]  J. Stoyanov The Oxford Handbook of Nonlinear Filtering , 2012 .

[9]  Paul H. Algoet,et al.  The strong law of large numbers for sequential decisions under uncertainty , 1994, IEEE Trans. Inf. Theory.

[10]  J. Neveu,et al.  Discrete Parameter Martingales , 1975 .

[11]  Ramon van Handel,et al.  The stability of conditional Markov processes and Markov chains in random environments , 2008, 0801.4366.

[12]  Uniform Convergence of Martingales , 1990 .

[13]  M. Ledoux,et al.  Convergence of Empirical Processes for Interacting Particle Systems with Applications to Nonlinear Filtering , 2000 .

[14]  Richard L. Tweedie,et al.  Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.

[15]  M. Mirzakhani,et al.  Introduction to Ergodic theory , 2010 .

[16]  N. Etemadi An elementary proof of the strong law of large numbers , 1981 .

[17]  P. Walters Introduction to Ergodic Theory , 1977 .

[18]  V. Volkonskii,et al.  Some Limit Theorems for Random Functions. II , 1959 .

[19]  J. Conze Convergence des moyennes ergodiques pour des sous-suites , 1973 .

[20]  Ramon van Handel,et al.  The universal Glivenko–Cantelli property , 2010, 1009.4434.

[21]  R. Handel Uniform time average consistency of Monte Carlo particle filters , 2008, 0812.0350.

[22]  Ramon van Handel,et al.  A complete solution to Blackwell's unique ergodicity problem for hidden Markov chains , 2009, ArXiv.

[23]  Tsachy Weissman,et al.  Universal prediction of random binary sequences in a noisy environment , 2004 .

[24]  D. Pollard A User's Guide to Measure Theoretic Probability by David Pollard , 2001 .

[25]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[26]  R. Handel On the exchange of intersection and supremum of σ-fields in filtering theory , 2010, 1009.0507.

[27]  A. W. van der Vaart,et al.  Uniform Central Limit Theorems , 2001 .

[28]  Eric Moulines,et al.  Inference in hidden Markov models , 2010, Springer series in statistics.

[29]  P. Gänssler Weak Convergence and Empirical Processes - A. W. van der Vaart; J. A. Wellner. , 1997 .

[30]  T. Lindvall Lectures on the Coupling Method , 1992 .

[31]  O. Kallenberg Foundations of Modern Probability , 2021, Probability Theory and Stochastic Modelling.

[32]  H. Kunita Asymptotic behavior of the nonlinear filtering errors of Markov processes , 1971 .

[33]  P. Meyer,et al.  Probabilities and potential C , 1978 .

[34]  R. Handel,et al.  CONDITIONAL ERGODICITY IN INFINITE DIMENSION , 2012, 1208.3182.

[35]  A. Grothendieck,et al.  Produits Tensoriels Topologiques Et Espaces Nucleaires , 1966 .