Hidden Markov processes

An overview of statistical and information-theoretic aspects of hidden Markov processes (HMPs) is presented. An HMP is a discrete-time finite-state homogeneous Markov chain observed through a discrete-time memoryless invariant channel. In recent years, the work of Baum and Petrie (1966) on finite-state finite-alphabet HMPs was expanded to HMPs with finite as well as continuous state spaces and a general alphabet. In particular, statistical properties and ergodic theorems for relative entropy densities of HMPs were developed. Consistency and asymptotic normality of the maximum-likelihood (ML) parameter estimator were proved under some mild conditions. Similar results were established for switching autoregressive processes. These processes generalize HMPs. New algorithms were developed for estimating the state, parameter, and order of an HMP, for universal coding and classification of HMPs, and for universal decoding of hidden Markov channels. These and other related topics are reviewed.

[1]  Ghassan Kawas Kaleh,et al.  Joint parameter estimation and symbol detection for linear or nonlinear unknown channels , 1994, IEEE Trans. Commun..

[2]  D. M. Titterington,et al.  The influence of initial conditions on maximum likelihood estimation of the parameters of a binary hidden Markov model , 1998 .

[3]  N D Le,et al.  Exact likelihood evaluation in a Markov mixture model for time series of seizure counts. , 1992, Biometrics.

[4]  D. Brillinger Time series - data analysis and theory , 1981, Classics in applied mathematics.

[5]  Robert M. Gray,et al.  The ergodic decomposition of stationary discrete random processes , 1974, IEEE Trans. Inf. Theory.

[6]  David L. Neuhoff,et al.  Quantization , 2022, IEEE Trans. Inf. Theory.

[7]  Dimitri Kanevsky,et al.  An inequality for rational functions with applications to some statistical estimation problems , 1991, IEEE Trans. Inf. Theory.

[8]  Jerry M. Mendel,et al.  Optimal simultaneous detection and estimation of filtered discrete semi-Markov chains , 1988, IEEE Trans. Inf. Theory.

[9]  S. Levinson,et al.  Image Models (and their Speech Model Cousins) , 1996 .

[10]  T. Cover,et al.  Rate Distortion Theory , 2001 .

[11]  Dennis J. Clague,et al.  New Classes of Synchronous Codes , 1967, IEEE Trans. Electron. Comput..

[12]  N. Phamdo,et al.  Optimal Detection of Discrete Markov Sources Over Discrete Memoryless Channels - Applications to Combined Source-Channel Coding , 1993, Proceedings. IEEE International Symposium on Information Theory.

[13]  D. A. Bell,et al.  Information Theory and Reliable Communication , 1969 .

[14]  Georg Lindgren,et al.  Recursive estimation in mixture models with Markov regime , 1991, IEEE Trans. Inf. Theory.

[15]  S. Dharmadhikari Sufficient Conditions for a Stationary Process to be a Function of a Finite Markov Chain , 1963 .

[16]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[17]  Shun-ichi Amari,et al.  A Theory of Adaptive Pattern Classifiers , 1967, IEEE Trans. Electron. Comput..

[18]  D. Titterington Recursive Parameter Estimation Using Incomplete Data , 1984 .

[19]  C. Francq,et al.  On White Noises Driven by Hidden Markov Chains , 1997 .

[20]  Jorma Rissanen,et al.  The Minimum Description Length Principle in Coding and Modeling , 1998, IEEE Trans. Inf. Theory.

[21]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[22]  Biing-Hwang Juang,et al.  The segmental K-means algorithm for estimating parameters of hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[23]  Pravin Varaiya,et al.  Capacity, mutual information, and coding for finite-state Markov channels , 1996, IEEE Trans. Inf. Theory.

[24]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[25]  Robert J. Elliott,et al.  New finite-dimensional filters for parameter estimation of discrete-time linear Gaussian models , 1999, IEEE Trans. Autom. Control..

[26]  Chin-Hui Lee,et al.  Segmental GPD training of HMM based speech recognizer , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[27]  A. Albertsen,et al.  Estimation of kinetic rate constants from multi-channel recordings by a direct fit of the time series. , 1994, Biophysical journal.

[28]  B. Lautrup,et al.  Products of random matrices. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Chi-Chao Chao,et al.  Hidden Markov models for the burst error statistics of Viterbi decoding , 1996, IEEE Trans. Commun..

[30]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[31]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[32]  Bret Larget,et al.  A canonical representation for aggregated Markov processes , 1998, Journal of Applied Probability.

[33]  R. Douc,et al.  Asymptotic properties of the maximum likelihood estimator in autoregressive models with Markov regime , 2004, math/0503681.

[34]  James D. Hamilton State-space models , 1994 .

[35]  Peter Bryant,et al.  Asymptotic behaviour of classification maximum likelihood estimates , 1978 .

[36]  Laurent Mevel,et al.  Exponential Forgetting and Geometric Ergodicity in Hidden Markov Models , 2000, Math. Control. Signals Syst..

[37]  Lee D. Davisson,et al.  Universal noiseless coding , 1973, IEEE Trans. Inf. Theory.

[38]  J. Sansom,et al.  A Hidden Markov Model for Rainfall Using Breakpoint Data , 1998 .

[39]  John C. Kieffer,et al.  Strongly consistent code-based identification and order estimation for constrained finite-state model classes , 1993, IEEE Trans. Inf. Theory.

[40]  Louis A. Liporace,et al.  Maximum likelihood estimation for multivariate observations of Markov sources , 1982, IEEE Trans. Inf. Theory.

[41]  Fady Alajaji,et al.  Detection of binary Markov sources over channels with additive Markov noise , 1996, IEEE Trans. Inf. Theory.

[42]  Aaron D. Wyner,et al.  The rate-distortion function for source coding with side information at the decoder , 1976, IEEE Trans. Inf. Theory.

[43]  Robert D. Nowak,et al.  Wavelet-based statistical signal processing using hidden Markov models , 1998, IEEE Trans. Signal Process..

[44]  Christian Francq,et al.  Ergodicity of Autoregressive Processes with Markov-Switching and Consistency of the Maximum-Likelihood Estimator , 1998 .

[45]  Lakdere Benkherouf,et al.  A HIDDEN MARKOV MODEL FOR AN INVENTORY SYSTEM WITH PERISHABLE ITEMS , 1997 .

[46]  Christophe Andrieu,et al.  Simulated annealing for maximum a Posteriori parameter estimation of hidden Markov models , 2000, IEEE Trans. Inf. Theory.

[47]  A. Koski Modelling ECG signals with hidden Markov models , 1996, Artif. Intell. Medicine.

[48]  New York Dover,et al.  ON THE CONVERGENCE PROPERTIES OF THE EM ALGORITHM , 1983 .

[49]  Sylvia Richardson,et al.  Stochastic EM: method and application , 1995 .

[50]  Robin J. Evans,et al.  Multiple Frequency Line Tracking with Hidden Markov Models - Further Results , 1993, IEEE Trans. Signal Process..

[51]  Arthur Nadas,et al.  Optimal solution of a training problem in speech recognition , 1985, IEEE Trans. Acoust. Speech Signal Process..

[52]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[53]  R. Gray Entropy and Information Theory , 1990, Springer New York.

[54]  Axthonv G. Oettinger,et al.  IEEE Transactions on Information Theory , 1998 .

[55]  Stanley M. Dunn,et al.  Texture Classification Using Noncausal Hidden Markov Models , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[56]  Alain Glavieux,et al.  Reflections on the Prize Paper : "Near optimum error-correcting coding and decoding: turbo codes" , 1998 .

[57]  J. Makhoul,et al.  The voice of the computer is heard in the land (and it listens too!) [speech recognition] , 1997 .

[58]  Lawrence R. Rabiner,et al.  A segmental k-means training procedure for connected word recognition , 1986, AT&T Technical Journal.

[59]  Jeffrey L. Krolik,et al.  Maximum likelihood coordinate registration for over-the-horizon radar , 1997, IEEE Transactions on Signal Processing.

[60]  T. Rydén,et al.  Stylized Facts of Daily Return Series and the Hidden Markov Model , 1998 .

[61]  G. Monahan State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 1982 .

[62]  A. Farago,et al.  Algorithm to find the global optimum of left-to-right hidden Markov model parameters , 1989 .

[63]  Robert M. Gray,et al.  Global convergence and empirical consistency of the generalized Lloyd algorithm , 1986, IEEE Trans. Inf. Theory.

[64]  F. Jelinek Fast sequential decoding algorithm using a stack , 1969 .

[65]  Michael Gutman,et al.  Asymptotically optimal classification for multiple tests with empirically observed statistics , 1989, IEEE Trans. Inf. Theory.

[66]  C. Robert,et al.  Bayesian estimation of hidden Markov chains: a stochastic implementation , 1993 .

[67]  R. Douc,et al.  Asymptotics of the maximum likelihood estimator for general hidden Markov models , 2001 .

[68]  Predrag R. Jelenkovic,et al.  State Learning and Mixing in Entropy of Hidden Markov Processes and the Gilbert-Elliott Channel , 1999, IEEE Trans. Inf. Theory.

[69]  J. Kieffer,et al.  Markov Channels are Asymptotically Mean Stationary , 1981 .

[70]  N. Merhav,et al.  Hidden Markov modeling using a dominant state sequence with application to speech recognition , 1991 .

[71]  Renato De Mori,et al.  High-performance connected digit recognition using maximum mutual information estimation , 1994, IEEE Trans. Speech Audio Process..

[72]  Marcelo J. Weinberger,et al.  Upper bounds on the probability of sequences emitted by finite-state sources and on the redundancy of the Lempel-Ziv algorithm , 1992, IEEE Trans. Inf. Theory.

[73]  Robert J. Fontana Limit theorems for slowly varying composite sources , 1980, IEEE Trans. Inf. Theory.

[74]  A. Glavieux,et al.  Near Shannon limit error-correcting coding and decoding: Turbo-codes. 1 , 1993, Proceedings of ICC '93 - IEEE International Conference on Communications.

[75]  M. Puterman,et al.  Maximum-penalized-likelihood estimation for independent and Markov-dependent mixture models. , 1992, Biometrics.

[76]  Ofer Zeitouni,et al.  Asymptotic filtering for finite state Markov chains , 1996 .

[77]  T. Riden,et al.  Consistent and Asymptotically Normal Parameter Estimates for Markov Modulated Poisson Processes , 1995 .

[78]  Robert J. Fontana On universal coding for classes of composite and remote sources with memory , 1981, IEEE Trans. Inf. Theory.

[79]  Roman Kuc,et al.  Identification of hidden Markov models for ion channel currents. I. Colored background noise , 1998, IEEE Trans. Signal Process..

[80]  David J. Miller,et al.  Low-delay optimal MAP state estimation in HMM's with application to symbol decoding , 1997, IEEE Signal Processing Letters.

[81]  J Honerkamp,et al.  Analysis of multichannel patch clamp recordings by hidden Markov models. , 1997, Biometrics.

[82]  Aaron D. Wyner,et al.  Some asymptotic properties of the entropy of a stationary ergodic data source with applications to data compression , 1989, IEEE Trans. Inf. Theory.

[83]  S. Dharmadhikari Functions of Finite Markov Chains , 1963 .

[84]  Vikram Krishnamurthy,et al.  Hidden Markov Model Signal Processing in Presence , 1996 .

[85]  L. R. Rabiner,et al.  An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[86]  A. B. Poritz,et al.  Linear predictive hidden Markov models and the speech signal , 1982, ICASSP.

[87]  S. Richardson,et al.  Mixtures of distributions: inference and estimation , 1995 .

[88]  Bernard Delyon Remarks on linear and nonlinear filtering , 1995, IEEE Trans. Inf. Theory.

[89]  Neri Merhav,et al.  Optimal sequential probability assignment for individual sequences , 1994, IEEE Trans. Inf. Theory.

[90]  T. Rydén Estimating the Order of Hidden Markov Models , 1995 .

[91]  Vikram Krishnamurthy,et al.  Adaptive nonlinear filters for narrow-band interference suppression in spread-spectrum CDMA systems , 1999, IEEE Trans. Commun..

[92]  Paolo Giudici,et al.  Likelihood‐Ratio Tests for Hidden Markov Models , 2000, Biometrics.

[93]  B. G. Quinn,et al.  Random Coefficient Autoregressive Models: An Introduction , 1982 .

[94]  Israel Bar-David,et al.  Capacity and coding for the Gilbert-Elliot channels , 1989, IEEE Trans. Inf. Theory.

[95]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[96]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[97]  J. Cadre,et al.  Bearings-only tracking for maneuvering sources , 1998 .

[98]  S. W. Dharmadhikari A Characterisation of a Class of Functions of Finite Markov Chains , 1965 .

[99]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[100]  Prakash Narayan,et al.  Reliable Communication Under Channel Uncertainty , 1998, IEEE Trans. Inf. Theory.

[101]  Robert J. Fontana Universal codes for a class of composite sources (Corresp.) , 1980, IEEE Trans. Inf. Theory.

[102]  M. Borodovsky,et al.  GeneMark.hmm: new solutions for gene finding. , 1998, Nucleic acids research.

[103]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[104]  R. Adler Ergodic and mixing properties of infinite memory channels , 1961 .

[105]  D. Magill Optimal adaptive estimation of sampled stochastic processes , 1965 .

[106]  Paul D. Feigin,et al.  RANDOM COEFFICIENT AUTOREGRESSIVE PROCESSES:A MARKOV CHAIN ANALYSIS OF STATIONARITY AND FINITENESS OF MOMENTS , 1985 .

[107]  Biing-Hwang Juang,et al.  Mixture autoregressive hidden Markov models for speech signals , 1985, IEEE Trans. Acoust. Speech Signal Process..

[108]  John B. Moore,et al.  On-line identification of hidden Markov models via recursive prediction error techniques , 1994, IEEE Trans. Signal Process..

[109]  Neri Merhav,et al.  Lower and upper bounds on the minimum mean-square error in composite source signal estimation , 1991, IEEE Trans. Inf. Theory.

[110]  U. Holst,et al.  Recursive estimation in switching autoregressions with Markov regime , 1994 .

[111]  Jacob Ziv,et al.  Universal decoding for finite-state channels , 1985, IEEE Trans. Inf. Theory.

[112]  S H Chung,et al.  Characterization of single channel currents using digital signal processing techniques based on Hidden Markov Models. , 1990, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[113]  Adrian Segall,et al.  Stochastic processes in estimation theory , 1976, IEEE Trans. Inf. Theory.

[114]  Yariv Ephraim,et al.  Estimation of hidden Markov model parameters by minimizing empirical error rate , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[115]  J. Ziv Compression, tests for randomness and estimating the statistical model of an individual sequence , 1990 .

[116]  W. Wonham Some applications of stochastic difierential equations to optimal nonlinear ltering , 1964 .

[117]  J. A. Kogan,et al.  Hidden Markov models estimation via the most informative stopping times for Viterbi algorithm , 1995, Proceedings of 1995 IEEE International Symposium on Information Theory.

[118]  Roman Kuc,et al.  Identification of hidden Markov models for ion channel currents. II. State-dependent excess noise , 1998, IEEE Trans. Signal Process..

[119]  Amir Dembo,et al.  Exact filters for the estimation of the number of transitions of finite-state continuous-time Markov processes , 1988, IEEE Trans. Inf. Theory.

[120]  Athanasios Kehagias,et al.  Bayesian classification of Hidden Markov Models , 1996 .

[121]  V. Fabian On Asymptotically Efficient Recursive Estimation , 1978 .

[122]  Jorma Rissanen,et al.  Universal coding, information, prediction, and estimation , 1984, IEEE Trans. Inf. Theory.

[123]  C. Robert,et al.  Estimation of Finite Mixture Distributions Through Bayesian Sampling , 1994 .

[124]  H. Derin,et al.  A recursive algorithm for the Bayes solution of the smoothing problem , 1981 .

[125]  R. Khasminskii,et al.  Some Procedures for State Estimation of a Hidden Markov Chain with Two States , 1994 .

[126]  J. Kingman Subadditive Ergodic Theory , 1973 .

[127]  Nam C. Phamdo,et al.  Optimal detection of discrete Markov sources over discrete memoryless channels - applications to combined source-channel coding , 1994, IEEE Trans. Inf. Theory.

[128]  J. W. Carlyle,et al.  Identification of State-calculable Functions of Finite Markov Chains , 1967 .

[129]  T. Rydén An EM algorithm for estimation in Markov-modulated Poisson processes , 1996 .

[130]  H. Heffes,et al.  A class of data traffic processes — covariance function characterization and related queuing results , 1980, The Bell System Technical Journal.

[131]  D A Coast,et al.  Use of hidden Markov models for electrocardiographic signal analysis. , 1990, Journal of electrocardiology.

[132]  Yariv Ephraim,et al.  Statistical-model-based speech enhancement systems , 1992, Proc. IEEE.

[133]  G. Kitagawa,et al.  Non-Gaussian State—Space Modeling of Nonstationary Time Series , 1987 .

[134]  G. Churchill Stochastic models for heterogeneous DNA sequences. , 1989, Bulletin of mathematical biology.

[135]  Roy L. Streit,et al.  Frequency line tracking using hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[136]  Jonathan D. Cryer,et al.  Time Series Analysis , 1986, Encyclopedia of Big Data.

[137]  L. Shepp,et al.  A POISSON PROCESS WHOSE RATE IS A HIDDEN MARKOV PROCESS , 1982 .

[138]  Louis L. Scharf,et al.  Modulo-2 Pi phase sequence estimation (Corresp.) , 1980, IEEE Trans. Inf. Theory.

[139]  Vikram Krishnamurthy,et al.  A filtered EM algorithm for joint hidden Markov model and sinusoidal parameter estimation , 1995, IEEE Trans. Signal Process..

[140]  A. Cohen,et al.  Finite Mixture Distributions , 1982 .

[141]  Yang He,et al.  2-D Shape Classification Using Hidden Markov Model , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[142]  L. Finesso Consistent estimation of the order for Markov and hidden Markov chains , 1992 .

[143]  Neri Merhav,et al.  When is the generalized likelihood ratio test optimal? , 1992, IEEE Trans. Inf. Theory.

[144]  H. Krolzig Markov-Switching Vector Autoregressions: Modelling, Statistical Inference, and Application to Business Cycle Analysis , 1997 .

[145]  Robert J. McEliece,et al.  The generalized distributive law , 2000, IEEE Trans. Inf. Theory.

[146]  Peter Guttorp,et al.  A Hidden Markov Model for Space‐Time Precipitation , 1991 .

[147]  Adrian Segall,et al.  Recursive estimation from discrete-time point processes , 1976, IEEE Trans. Inf. Theory.

[148]  D. Blackwell,et al.  On the Identifiability Problem for Functions of Finite Markov Chains , 1957 .

[149]  Prakash Narayan,et al.  Order estimation and sequential universal data compression of a hidden Markov source by the method of mixtures , 1994, IEEE Trans. Inf. Theory.

[150]  John B. Moore,et al.  Hidden Markov Models: Estimation and Control , 1994 .

[151]  Michael Woodroofe,et al.  A local limit theorem for hidden Markov chains , 1997 .

[152]  Vincent Fontaine,et al.  Automatic classification of environmental noise events by hidden Markov models , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[153]  David M. Lucantoni,et al.  A Markov Modulated Characterization of Packetized Voice and Data Traffic and Related Statistical Multiplexer Performance , 1986, IEEE J. Sel. Areas Commun..

[154]  C. J. Wellekens,et al.  Explicit time correlation in hidden Markov models for speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[155]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[156]  I. Csiszár Why least squares and maximum entropy? An axiomatic approach to inference for linear inverse problems , 1991 .

[157]  T Petrie,et al.  Probabilistic functions of finite-state markov chains. , 1967, Proceedings of the National Academy of Sciences of the United States of America.

[158]  Jacob Ziv,et al.  On classification with empirically observed statistics and universal data compression , 1988, IEEE Trans. Inf. Theory.

[159]  T. Rydén,et al.  Consistent Estimation of Linear and Non‐linear Autoregressive Models with Markov Regime , 1998 .

[160]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[161]  Lawrence R. Rabiner,et al.  On the relations between modeling approaches for speech recognition , 1990, IEEE Trans. Inf. Theory.

[162]  Lain L. MacDonald,et al.  Hidden Markov and Other Models for Discrete- valued Time Series , 1997 .

[163]  Stanley L. Sclove,et al.  Application of the Conditional Population-Mixture Model to Image Segmentation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[164]  J. L. Devore A note on the observation of a Markov source through a noisy channel (Corresp.) , 1974, IEEE Trans. Inf. Theory.

[165]  T. Rydén Asymptotically efficient recursive estimation for incomplete data models using the observed information , 1998 .

[166]  D R Fredkin,et al.  Bayesian restoration of single-channel patch clamp recordings. , 1992, Biometrics.

[167]  C. Robert Mixtures of Distributions: Inference and Estimation , 1996 .

[168]  Christian P. Robert,et al.  Reparameterization strategies for hidden Markov models and Bayesian approaches to maximum likelihood estimation , 1998, Stat. Comput..

[169]  O. F. Cook The Method of Types , 1898 .

[170]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[171]  H. Akaike A new look at the statistical model identification , 1974 .

[172]  Ehud Weinstein,et al.  Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure , 1990, IEEE Trans. Acoust. Speech Signal Process..

[173]  Amos Lapidoth,et al.  On the Universality of the LZ-Based Decoding Algorithm , 1998, IEEE Trans. Inf. Theory.

[174]  D. M. Titterington,et al.  Computational Bayesian Analysis of Hidden Markov Models , 1998 .

[175]  Neri Merhav,et al.  A Bayesian approach for classification of Markov sources , 1991, IEEE Trans. Inf. Theory.

[176]  Robin J. Evans,et al.  Frequency-wavenumber tracking using hidden Markov models , 1993, IEEE Trans. Signal Process..

[177]  B. Leroux Stochastic Processes and Their Applications Maximum-likelihood Estimation for Hidden Markov Models Markov Chain * Consistency * Subadditive Ergodic Theorem * Identifiability * Entropy * Kullback-leibler Divergence * Shannon-mcmillan-breiman Theorem , 2022 .

[178]  M. Rosenblatt Markov Processes, Structure and Asymptotic Behavior , 1971 .

[179]  Donald B. Rubin,et al.  Max-imum Likelihood from Incomplete Data , 1972 .

[180]  Hans Kiinsch,et al.  State Space and Hidden Markov Models , 2000 .

[181]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[182]  G. Radons,et al.  Analysis, classification, and coding of multielectrode spike trains with hidden Markov models , 2004, Biological Cybernetics.

[183]  Robert M. Gray,et al.  Asymptotically mean stationary channels , 1981, IEEE Trans. Inf. Theory.

[184]  L. Scharf,et al.  Modulo-2 Pi Phase Sequence Estimation. , 1978 .

[185]  John B. Moore,et al.  A Soft Output Hybrid Algorithm for ML/MAP Sequence Estimation , 1998, IEEE Trans. Inf. Theory.

[186]  Peter J. Bickel,et al.  Inference in hidden Markov models I: Local asymptotic normality in the stationary case , 1996 .

[187]  E. O. Elliott Estimates of error rates for codes on burst-noise channels , 1963 .

[188]  Robert M. Gray,et al.  An Algorithm for the Design of Labeled-Transition Finite-State Vector Quantizers , 1985, IEEE Trans. Commun..

[189]  L. Goddard Information Theory , 1962, Nature.

[190]  Edward J. Wegman,et al.  Statistical Signal Processing , 1985 .

[191]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[192]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[193]  Neri Merhav,et al.  Universal coding with minimum probability of codeword length overflow , 1991, IEEE Trans. Inf. Theory.

[194]  Padhraic Smyth,et al.  Markov monitoring with unknown states , 1994, IEEE J. Sel. Areas Commun..

[195]  Robert J. Elliott,et al.  New finite-dimensional filters and smoothers for noisily observed Markov chains , 1993, IEEE Trans. Inf. Theory.

[196]  Marcelo Weinberger,et al.  Upper Bounds On The Probability Of Sequences Emitted By Finite-state Sources And On The Redundancy Of The Lempel-Ziv Algorithm , 1991, Proceedings. 1991 IEEE International Symposium on Information Theory.

[197]  Javier Garcia-Frías,et al.  Turbo decoding of hidden Markov sources with unknown parameters , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[198]  Wolfgang Fischer,et al.  The Markov-Modulated Poisson Process (MMPP) Cookbook , 1993, Perform. Evaluation.

[199]  Gerhard Tutz,et al.  State Space and Hidden Markov Models , 2001 .

[200]  Vikram Krishnamurthy,et al.  Time discretization of continuous-time filters and smoothers for HMM parameter estimation , 1996, IEEE Trans. Inf. Theory.

[201]  Georg Lindgren,et al.  Recursive estimation of parameters in Markov-modulated Poisson processes , 1995, IEEE Trans. Commun..

[202]  Murray A. Cameron,et al.  Hidden Markov chains in generalized linear models , 1998 .

[203]  Langford B. White,et al.  A reduced-complexity online state sequence and parameter estimator for superimposed convolutional coded signals , 1997, IEEE Trans. Commun..

[204]  C. Robert,et al.  Bayesian estimation of switching ARMA models , 1999, Journal of Econometrics.

[205]  James P. Hughes,et al.  Computing the observed information in the hidden Markov model using the EM algorithm , 1997 .

[206]  Biing-Hwang Juang,et al.  Discriminative learning for minimum error classification [pattern recognition] , 1992, IEEE Trans. Signal Process..

[207]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[208]  John B. Anderson,et al.  Tree encoding of speech , 1975, IEEE Trans. Inf. Theory.

[209]  S. W. Dharmadhikari Exchangeable Processes which are Functions of Stationary Markov Chains , 1964 .

[210]  T. Rydén Consistent and Asymptotically Normal Parameter Estimates for Hidden Markov Models , 1994 .

[211]  A. Dembo,et al.  Parameter estimation of partially observed continuous time stochastic processes via the EM algorithm , 1992 .

[212]  A. Camproux,et al.  A hidden Markov model approach to neuron firing patterns. , 1996, Biophysical journal.

[213]  Gene Ott,et al.  Compact encoding of stationary Markov sources , 1967, IEEE Trans. Inf. Theory.

[214]  Langford B. White,et al.  Spatial filtering of superimposed convolutional coded signals , 1997, IEEE Trans. Commun..

[215]  Rodney W. Johnson,et al.  Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy , 1980, IEEE Trans. Inf. Theory.

[216]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[217]  Neri Merhav,et al.  Universal composite hypothesis testing: A competitive minimax approach , 2002, IEEE Trans. Inf. Theory.

[218]  A. Barron THE STRONG ERGODIC THEOREM FOR DENSITIES: GENERALIZED SHANNON-MCMILLAN-BREIMAN THEOREM' , 1985 .

[219]  Hans Arnfinn Karlsen,et al.  Existence of moments in a stationary stochastic difference equation , 1990, Advances in Applied Probability.

[220]  A. Poritz,et al.  Hidden Markov models: a guided tour , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[221]  W. Qian,et al.  Estimation of parameters in hidden Markov models , 1991, Philosophical Transactions of the Royal Society of London. Series A: Physical and Engineering Sciences.

[222]  Abraham Lempel,et al.  Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[223]  Rangasami L. Kashyap Identification of a transition matrix of a Markov chain from noisy measurements of state , 1970, IEEE Trans. Inf. Theory.

[224]  Meir Feder,et al.  Universal Decoding for Channels with Memory , 1998, IEEE Trans. Inf. Theory.

[225]  Lawrence R. Rabiner,et al.  A minimum discrimination information approach for hidden Markov modeling , 1989, IEEE Trans. Inf. Theory.

[226]  Mazin G. Rahim,et al.  On second order statistics and linear estimation of cepstral coefficients , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[227]  David R. Brillinger,et al.  Time Series: Data Analysis and Theory. , 1982 .

[228]  Vincent Fontaine,et al.  AUTOMATIC CLASSIFICATION OF ENVIRONMENTAL NOISE EVENTS BY HIDDEN MARKOV MODELS , 1998 .

[229]  B. Leroux Consistent estimation of a mixing distribution , 1992 .

[230]  G. Grimmett,et al.  Probability and random processes , 2002 .

[231]  Biing-Hwang Juang,et al.  Statistical and Discriminative Methods for Speech Recognition , 1996 .

[232]  Nariman Farvardin,et al.  Switched scalar quantizers for hidden Markov sources , 1992, IEEE Trans. Inf. Theory.

[233]  Jens Ledet Jensen,et al.  Asymptotic normality of the maximum likelihood estimator in state space models , 1999 .

[234]  David J. Miller,et al.  A sequence-based approximate MMSE decoder for source coding over noisy channels using discrete hidden Markov models , 1998, IEEE Trans. Commun..

[235]  R. Shumway,et al.  AN APPROACH TO TIME SERIES SMOOTHING AND FORECASTING USING THE EM ALGORITHM , 1982 .

[236]  Langford B. White,et al.  Cartesian hidden Markov models with applications , 1992, IEEE Trans. Signal Process..

[237]  L. Geppert Trials & Triumphs Women Engineers in Europe , 1997, IEEE Spectrum.

[238]  Geoffrey J. McLachlan,et al.  Mixture models : inference and applications to clustering , 1989 .

[239]  Til T. Phan,et al.  Text-Independent Speaker Identification , 1999 .

[240]  Robert M. Gray,et al.  Probability, Random Processes, And Ergodic Properties , 1987 .

[241]  T. Rydén Parameter Estimation for Markov Modulated Poisson Processes , 1994 .

[242]  S. Shreve,et al.  Stochastic differential equations , 1955, Mathematical Proceedings of the Cambridge Philosophical Society.

[243]  Richard L. Tweedie,et al.  Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.

[244]  K. Meier-Hellstern A fitting algorithm for Markov-modulated poisson processes having two arrival rates , 1987 .

[245]  R. Chang,et al.  On receiver structures for channels having memory , 1966, IEEE Trans. Inf. Theory.

[246]  Mariëlle Stoelinga,et al.  An Introduction to Probabilistic Automata , 2002, Bull. EATCS.

[247]  Wen-Rong Wu,et al.  Rotation and gray-scale transform-invariant texture classification using spiral resampling, subband decomposition, and hidden Markov model , 1996, IEEE Trans. Image Process..

[248]  Robin J. Evans,et al.  Multiple target tracking and multiple frequency line tracking using hidden Markov models , 1991, IEEE Trans. Signal Process..

[249]  M. Rosenblatt,et al.  A MARKOVIAN FUNCTION OF A MARKOV CHAIN , 1958 .

[250]  G. Carrault,et al.  Heart signal recognition by Hidden Markov Models: the ECG case. , 1994, Methods of information in medicine.

[251]  Amir Dembo,et al.  On the parameters estimation of continuous-time ARMA processes from noisy observations , 1987 .

[252]  R. Gray Rate distortion functions for finite-state finite-alphabet Markov sources , 1971, IEEE Trans. Inf. Theory.

[253]  Tobias Rydén,et al.  On identifiability and order of continuous-time aggregated Markov chains, Markov-modulated Poisson processes, and phase-type distributions , 1996, Journal of Applied Probability.

[254]  Yariv Ephraim,et al.  Speech enhancement using state dependent dynamical system model , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[255]  Lalit R. Bahl,et al.  Design of a linguistic statistical decoder for the recognition of continuous speech , 1975, IEEE Trans. Inf. Theory.

[256]  Jeffrey L. Krolik,et al.  Over‐the‐horizon radar target localization using a hidden Markov model estimated from ionosonde data , 1998 .

[257]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[258]  Neri Merhav,et al.  Universal classification for hidden Markov models , 1991, IEEE Trans. Inf. Theory.

[259]  G. Lindgren Markov regime models for mixed distributions and switching regressions , 1978 .

[260]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[261]  Ghassan Kawas Kaleh The Baum-Welch algorithm for the detection of time-unsynchronized rectangular PAM signals , 1994, IEEE Trans. Commun..

[262]  D. B. Preston Spectral Analysis and Time Series , 1983 .

[263]  D. Blackwell,et al.  Proof of Shannon's Transmission Theorem for Finite-State Indecomposable Channels , 1958 .

[264]  John Cocke,et al.  Optimal decoding of linear codes for minimizing symbol error rate (Corresp.) , 1974, IEEE Trans. Inf. Theory.

[265]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[266]  John B. Moore,et al.  On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure , 1993, IEEE Trans. Signal Process..

[267]  R. Weiner Lecture Notes in Economics and Mathematical Systems , 1985 .

[268]  V Krishnamurthy,et al.  Adaptive processing techniques based on hidden Markov models for characterizing very small channel currents buried in noise and deterministic interferences. , 1991, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[269]  Neri Merhav Universal detection of messages via finite-state channels , 2000, IEEE Trans. Inf. Theory.

[270]  Christophe Couvreur Environmental Sound Recognition: A Statistical Approach , 1997 .

[271]  Y. C. Yao,et al.  Estimation of noisy telegraph processes: Nonlinear filtering versus nonlinear smoothing , 1985, IEEE Trans. Inf. Theory.

[272]  Pierre A. Devijver,et al.  Baum's forward-backward algorithm revisited , 1985, Pattern Recognit. Lett..

[273]  F. G. Ball,et al.  Stochastic models for ion channels: introduction and bibliography. , 1992, Mathematical biosciences.

[274]  Subhrakanti Dey,et al.  Blind equalization of IIR channels using hidden Markov models and extended least squares , 1995, IEEE Trans. Signal Process..

[275]  D. M. Titterington,et al.  Comments on "Application of the Conditional Population-Mixture Model to Image Segmentation" , 1984, IEEE Trans. Pattern Anal. Mach. Intell..

[276]  Masoud Salehi,et al.  Communication Systems Engineering , 1994 .

[277]  L. Baum,et al.  An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology , 1967 .

[278]  D. Haussler,et al.  Hidden Markov models in computational biology. Applications to protein modeling. , 1993, Journal of molecular biology.

[279]  William Turin MAP Symbol decoding in channels with error bursts , 2001, IEEE Trans. Inf. Theory.

[280]  I. Csiszár,et al.  The consistency of the BIC Markov order estimator , 2000 .

[281]  T. Rydén On recursive estimation for hidden Markov models , 1997 .

[282]  R. Redner,et al.  Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[283]  Christophe Couvreur,et al.  Hidden Markov Models and Their Mixtures , 1996 .

[284]  Lalit R. Bahl,et al.  Decoding for channels with insertions, deletions, and substitutions with applications to speech recognition , 1975, IEEE Trans. Inf. Theory.

[285]  John J. Birch Approximations for the Entropy for Functions of Markov Chains , 1962 .

[286]  Donald L. Snyder,et al.  Random Point Processes in Time and Space , 1991 .

[287]  J. Anderson,et al.  Real-number convolutional codes for speech-like quasi-stationary sources (Corresp.) , 1977, IEEE Trans. Inf. Theory.

[288]  Shun-ichi Amari,et al.  Identifiability of hidden Markov information sources and their minimum degrees of freedom , 1992, IEEE Trans. Inf. Theory.

[289]  Stephen E. Levinson,et al.  Continuously variable duration hidden Markov models for automatic speech recognition , 1986 .

[290]  E. Gilbert Capacity of a burst-noise channel , 1960 .

[291]  Robert M. Gray,et al.  Ergodicity of Markov channels , 1987, IEEE Trans. Inf. Theory.

[292]  L. Baum,et al.  Growth transformations for functions on manifolds. , 1968 .

[293]  Robert E. Mahony,et al.  Lumpable hidden Markov models-model reduction and reduced complexity filtering , 2000, IEEE Trans. Autom. Control..

[294]  Robert M. Gray,et al.  Image classification by a two-dimensional hidden Markov model , 2000, IEEE Trans. Signal Process..

[295]  Robert M. Gray,et al.  Rate-distortion speech coding with a minimum discrimination information distortion measure , 1981, IEEE Trans. Inf. Theory.

[296]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .

[297]  J. Rice,et al.  Maximum likelihood estimation and identification directly from single-channel recordings , 1992, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[298]  Wen-Rong Wu,et al.  Correction To "rotation And Gray-scale Transform-invariant Texture Classification Using Spiral Resampling, Subband Decomposition, And Hidden Markov Model" , 1996, IEEE Trans. Image Process..

[299]  P S Albert,et al.  A two-state Markov mixture model for a time series of epileptic seizure counts. , 1991, Biometrics.

[300]  E. Ayanoglu Robust and fast failure detection and prediction for fault-tolerant communication network , 1992 .

[301]  R. V. Erickson Functions of Markov Chains , 1970 .

[302]  Amir Dembo,et al.  Maximum a posteriori estimation of time-varying ARMA processes from noisy observations , 1988, IEEE Trans. Acoust. Speech Signal Process..

[303]  Neri Merhav,et al.  Estimating the number of states of a finite-state source , 1992, IEEE Trans. Inf. Theory.

[304]  Harris Drucker Speech processing in a high ambient noise environment , 1967 .

[305]  R. Gray,et al.  Asymptotically Mean Stationary Measures , 1980 .

[306]  R. Bellman Dynamic programming. , 1957, Science.

[307]  Imre Csisźar,et al.  The Method of Types , 1998, IEEE Trans. Inf. Theory.

[308]  Lalit R. Bahl,et al.  Maximum mutual information estimation of hidden Markov model parameters for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[309]  A. Heller On Stochastic Processes Derived From Markov Chains , 1965 .

[310]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[311]  B. Anderson From Wiener to hidden Markov models , 1999 .

[312]  M. Tsatsanis,et al.  Stochastic maximum likelihood methods for semi-blind channel estimation , 1998, IEEE Signal Processing Letters.

[313]  H. Teicher Identifiability of Mixtures of Product Measures , 1967 .

[314]  Tobias Rydén On identifiability and order of continuous-time aggregated Markov chains, Markov-modulated Poisson processes, and phase-type distributions , 1996 .

[315]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[316]  Padhraic J. Smyth,et al.  Hidden Markov models for fault detection in dynamic systems , 1993 .

[317]  Josef Raviv,et al.  Decision making in Markov chains applied to the problem of pattern recognition , 1967, IEEE Trans. Inf. Theory.

[318]  Edward H. Ip,et al.  Stochastic EM: method and application , 1996 .

[319]  Neri Merhav,et al.  A Bayesian classification approach with application to speech recognition , 1991, IEEE Trans. Signal Process..

[320]  P. Bickel,et al.  Asymptotic normality of the maximum-likelihood estimator for general hidden Markov models , 1998 .

[321]  P. Bougerol,et al.  Strict Stationarity of Generalized Autoregressive Processes , 1992 .