Learning Parametric-Output HMMs with Two Aliased States

In various applications involving hidden Markov models (HMMs), some of the hidden states are aliased, having identical output distributions. The minimality, identifiability and learnability of such aliased HMMs have been long standing problems, with only partial solutions provided thus far. In this paper we focus on parametric-output HMMs, whose output distributions come from a parametric family, and that have exactly two aliased states. For this class, we present a complete characterization of their minimality and identifiability. Furthermore, for a large family of parametric output distributions, we derive computationally efficient and statistically consistent algorithms to detect the presence of aliasing and learn the aliased HMM transition and emission parameters. We illustrate our theoretical analysis by several simulations.

[1]  D. Blackwell,et al.  On the Identifiability Problem for Functions of Finite Markov Chains , 1957 .

[2]  E. J. Gilbert On the Identifiability Problem for Functions of Finite Markov Chains , 1959 .

[3]  T Petrie,et al.  Probabilistic functions of finite-state markov chains. , 1967, Proceedings of the National Academy of Sciences of the United States of America.

[4]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[5]  J. Rice,et al.  On aggregated Markov processes , 1986 .

[6]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[7]  F. Sigworth,et al.  Yet another approach to the dwell-time omission problem of single-channel analysis. , 1990, Biophysical journal.

[8]  W. Newey,et al.  Uniform Convergence in Probability and Stochastic Equicontinuity , 1991 .

[9]  V. N. Bogaevski,et al.  Matrix Perturbation Theory , 1991 .

[10]  Shun-ichi Amari,et al.  Identifiability of hidden Markov information sources and their minimum degrees of freedom , 1992, IEEE Trans. Inf. Theory.

[11]  Lonnie Chrisman,et al.  Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach , 1992, AAAI.

[12]  B. Leroux Maximum-likelihood estimation for hidden Markov models , 1992 .

[13]  J. Rice,et al.  Maximum likelihood estimation and identification directly from single-channel recordings , 1992, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[14]  L. Finesso Consistent estimation of the order for Markov and hidden Markov chains , 1992 .

[15]  Philip C. Woodland,et al.  Speaker adaptation of continuous density HMMs using multivariate linear regression , 1994, ICSLP.

[16]  Andrew McCallum,et al.  Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State , 1995, ICML.

[17]  Sanjoy Dasgupta,et al.  Learning mixtures of Gaussians , 1999, 40th Annual Symposium on Foundations of Computer Science (Cat. No.99CB37039).

[18]  Herbert Jaeger,et al.  Observable Operator Models for Discrete Stochastic Time Series , 2000, Neural Computation.

[19]  Robert E. Mahony,et al.  Lumpable hidden Markov models-model reduction and reduced complexity filtering , 2000, IEEE Trans. Autom. Control..

[20]  R Rosales,et al.  Bayesian restoration of ion channel records using hidden Markov models. , 2001, Biophysical journal.

[21]  Mario Stanke,et al.  Gene prediction with a hidden Markov model and a new intron submodel , 2003, ECCB.

[22]  James Witkoskie,et al.  Single molecule kinetics. I. Theoretical analysis of indicators. , 2004, The Journal of chemical physics.

[23]  Guy Shani,et al.  Resolving Perceptual Aliasing In The Presence Of Noisy Sensors , 2004, NIPS.

[24]  Dimitris Achlioptas,et al.  On Spectral Learning of Mixtures of Distributions , 2005, COLT.

[25]  Eric Moulines,et al.  Inference in hidden Markov models , 2010, Springer series in statistics.

[26]  R. C. Bradley Basic properties of strong mixing conditions. A survey and some open questions , 2005, math/0511078.

[27]  Eric Moulines,et al.  Inference in Hidden Markov Models (Springer Series in Statistics) , 2005 .

[28]  Guy Shani,et al.  Model-Based Online Learning of POMDPs , 2005, ECML.

[29]  J. Feldman,et al.  Learning mixtures of product distributions over discrete domains , 2005, 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS'05).

[30]  Haikady N. Nagaraja,et al.  Inference in Hidden Markov Models , 2006, Technometrics.

[31]  Bart De Moor,et al.  Equivalence of state representations for hidden Markov models , 2007, 2007 European Control Conference (ECC).

[32]  Daniel G. Brown,et al.  The most probable annotation problem in HMMs and its application to bioinformatics , 2007, J. Comput. Syst. Sci..

[33]  Wai-Kiang Yeap,et al.  Robotics and Cognitive Approaches to Spatial Mapping , 2010, Springer Tracts in Advanced Robotics.

[34]  Anthony J. Bagnall,et al.  Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception , 2009, Adapt. Behav..

[35]  C. Matias,et al.  Identifiability of parameters in latent structure models with many observed variables , 2008, 0809.5032.

[36]  Boaz Nadler,et al.  Non-Parametric Detection of the Number of Signals: Hypothesis Testing and Random Matrix Theory , 2009, IEEE Transactions on Signal Processing.

[37]  Byron Boots,et al.  Reduced-Rank Hidden Markov Models , 2009, AISTATS.

[38]  Mikhail Belkin,et al.  Polynomial Learning of Distribution Families , 2010, 2010 IEEE 51st Annual Symposium on Foundations of Computer Science.

[39]  Anima Anandkumar,et al.  A Method of Moments for Mixture Models and Hidden Markov Models , 2012, COLT.

[40]  Aryeh Kontorovich,et al.  On learning parametric-output HMMs , 2013, ICML.

[41]  Munther A. Dahleh,et al.  Minimal realization problem for Hidden Markov Models , 2014, 2014 52nd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[42]  Aryeh Kontorovich,et al.  Uniform Chernoff and Dvoretzky-Kiefer-Wolfowitz-Type Inequalities for Markov Chains and Related Processes , 2012, Journal of Applied Probability.

[43]  Dean Alderucci A SPECTRAL ALGORITHM FOR LEARNING HIDDEN MARKOV MODELS THAT HAVE SILENT STATES , 2015 .

[44]  Munther A. Dahleh,et al.  Minimal Realization Problems for Hidden Markov Models , 2014, IEEE Transactions on Signal Processing.

[45]  Clive R. Bagshaw Single-Molecule Kinetics , 2017 .