CONSISTENCY OF THE MAXIMUM LIKELIHOOD ESTIMATOR FOR GENERAL HIDDEN MARKOV MODELS

Consider a parametrized family of general hidden Markov models, where both the observed and unobserved components take values in a complete separable metric space. We prove that the maximum likelihood estimator (MLE) of the parameter is strongly consistent under a rather minimal set of assumptions. As special cases of our main result, we obtain consistency in a large class of nonlinear state space models, as well as general results on linear Gaussian state space models and finite state models. A novel aspect of our approach is an information-theoretic technique for proving identifiability, which does not require an explicit representation for the relative entropy rate. Our method of proof could therefore form a foundation for the investigation of MLE consistency in more general dependent and non-Markovian time series. Also of independent interest is a general concentration inequality for V-uniformly ergodic Markov chains.

[1]  J. Doob Stochastic processes , 1953 .

[2]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[3]  T Petrie,et al.  Probabilistic functions of finite-state markov chains. , 1967, Proceedings of the National Academy of Sciences of the United States of America.

[4]  A. Barron THE STRONG ERGODIC THEOREM FOR DENSITIES: GENERALIZED SHANNON-MCMILLAN-BREIMAN THEOREM' , 1985 .

[5]  Patrick Billingsley,et al.  Probability and Measure. , 1986 .

[6]  John Rice,et al.  Correlation functions of a function of a finite-state Markov process with application to channel kinetics , 1987 .

[7]  Alan G. White,et al.  The Pricing of Options on Assets with Stochastic Volatilities , 1987 .

[8]  David Williams,et al.  Probability with Martingales , 1991, Cambridge mathematical textbooks.

[9]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .

[10]  B. Leroux Maximum-likelihood estimation for hidden Markov models , 1992 .

[11]  Gary A. Churchill,et al.  Hidden Markov Chains and the Analysis of Genome Structure , 1992, Comput. Chem..

[12]  Richard L. Tweedie,et al.  Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.

[13]  Paul C. Shields,et al.  The positive-divergence and blowing-up properties , 1994 .

[14]  V. Kalashnikov,et al.  Regeneration and general Markov chains , 1994 .

[15]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[16]  Sean P. Meyn,et al.  A Liapounov bound for solutions of the Poisson equation , 1996 .

[17]  P. Gänssler Weak Convergence and Empirical Processes - A. W. van der Vaart; J. A. Wellner. , 1997 .

[18]  J. Lynch,et al.  A weak convergence approach to the theory of large deviations , 1997 .

[19]  Laurent Mevel,et al.  Exponential Forgetting and Geometric Ergodicity in Hidden Markov Models , 2000, Math. Control. Signals Syst..

[20]  Laurent Mevel,et al.  Basic Properties of the Projective Product with Application to Products of Column-Allowable Nonnegative Matrices , 2000, Math. Control. Signals Syst..

[21]  R. Douc,et al.  Asymptotics of the maximum likelihood estimator for general hidden Markov models , 2001 .

[22]  Gareth O. Roberts,et al.  Corrigendum to : Bounds on regeneration times and convergence rates for Markov chains , 2001 .

[23]  P. Glynn,et al.  Hoeffding's inequality for uniformly ergodic Markov chains , 2002 .

[24]  R. Douc,et al.  Asymptotic properties of the maximum likelihood estimator in autoregressive models with Markov regime , 2004, math/0503681.

[25]  Eric Moulines,et al.  Inference in hidden Markov models , 2010, Springer series in statistics.

[26]  E. Liebscher Towards a Unified Approach for Proving Geometric Ergodicity and Mixing Properties of Nonlinear Autoregressive Processes , 2005 .

[27]  Catherine Laredo,et al.  Leroux's method for general hidden Markov models , 2006 .

[28]  Cheng-Der Fuh,et al.  Efficient likelihood estimation in state space models , 2006 .

[29]  Rogemar S. Mamon,et al.  Hidden Markov Models In Finance , 2007 .

[30]  Dimitri P. Bertsekas,et al.  Stochastic optimal control : the discrete time case , 2007 .

[31]  Journal Url,et al.  A tail inequality for suprema of unbounded empirical processes with applications to Markov chains , 2008 .

[32]  Ramon van Handel,et al.  The stability of conditional Markov processes and Markov chains in random environments , 2008, 0801.4366.

[33]  Reply to “On some problems in the article Efficient Likelihood Estimation in State Space Models” by Cheng-Der Fuh [Ann. Statist. 34 (2006) 2026–2068] , 2010, 1911.00813.

[34]  On some problems in the article Efficient Likelihood Estimation in State Space Models by Cheng-Der Fuh [Ann. Statist. 34 (2006) 2026–2068] , 2010, 1002.4959.