Estimation in the Mixture Transition Distribution Model for High Order Markov Chains

Abstract : The mixture transition distribution (MTD) model was introduced by Raftery (1985) as a parsimonious model for high-order Markov chains. It is flexible, can represent a wide range of dependence patterns, can be physically motivated, fits data well, and appears to be a discrete-valued analogue for the class of autoregressive time series models. However, estimation has presented difficulties because the parameter space is highly nonconvex, being defined by a large number of nonlinear constraints. Here we propose an efficient computational algorithm for maximum likelihood estimation which is based on a way of reducing the large number of constraints. This also allows more structured versions of the model, for example those involving structural zeros, to be fit quite easily. A way of fitting the model using GLIM is also discussed. The algorithm is applied to a sequence of wind directions, and also to two sequences of DNA bases from introns from genes of the mouse. In each case, the MTD model fits better than the conventional Markov chain model.

[1]  L. Hood,et al.  Organization, structure, and function of 95 kb of DNA spanning the murine T-cell receptor CαCδ region , 1992 .

[2]  A stochastic analysis of three viral sequences. , 1992, Molecular biology and evolution.

[3]  D. Shugar,et al.  Methods in enzymology: Volume 183 molecular evolution: Computer analysis of protein and nucleic acid sequences , 1991 .

[4]  W. Li,et al.  Some Results on the Estimation of a Higher Order Markov Chain , 1990 .

[5]  R. Doolittle Molecular evolution: computer analysis of protein and nucleic acid sequences. , 1990, Methods in enzymology.

[6]  A. Raftery,et al.  MODELING OUTLIERS , BURSTS , AND FLAT STRETCHES IN TIME SERIES USING MIXTURE TRANSITION DISTRIBUTION ( MTD ) l \ 10 DELS by , 1990 .

[7]  D. Mellor,et al.  The Analysis of Directional Time Series: Applications to Wind Speed and Direction , 1989 .

[8]  John Hinde,et al.  Statistical Modelling in GLIM. , 1989 .

[9]  T. Kirkwood,et al.  Statistical Analysis of Deoxyribonucleic Acid Sequence Data-a Review , 1989 .

[10]  M. Waterman Mathematical Methods for DNA Sequences , 1989 .

[11]  ANALYSIS OF DISCRETE LONGITUDINAL DATA: INFINITE-LAG MARKOV MODELS , 1989 .

[12]  S. Tavaré,et al.  Codon preference and primary sequence structure in protein-coding regions. , 1989, Bulletin of mathematical biology.

[13]  S. Adke,et al.  Limit Distribution of a High Order Markov Chain , 1988 .

[14]  Some results on higher order Markov Chain models , 1988 .

[15]  Peter S. Craig,et al.  Time Series Analysis for Directional Data , 1988 .

[16]  Adrian E. Raftery,et al.  Approximate Bayes factors for generalized linear models , 1988 .

[17]  A. Raftery,et al.  Space-time modeling with long-memory dependence: assessing Ireland's wind-power resource. Technical report , 1987 .

[18]  J. Berger,et al.  Testing a Point Null Hypothesis: The Irreconcilability of P Values and Evidence , 1987 .

[19]  A. Raftery Choosing Models for Cross-Classifications , 1986 .

[20]  A. Raftery A Note on Bayes Factors for Log‐Linear Contingency Table Models with Vague Prior Information , 1986 .

[21]  A. Raftery A model for high-order Markov chains , 1985 .

[22]  R. Katz On Some Criteria for Estimating the Order of a Markov Chain , 1981 .

[23]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[24]  M. Thomas,et al.  An Approximate Test of Markov Chain Lumpability , 1977 .

[25]  P. Billingsley,et al.  Statistical inference for Markov processes , 1961 .

[26]  John G. Kemeny,et al.  Finite Markov chains , 1960 .