The statistical analysis of general processing tree models with the EM algorithm

Multinomial processing tree models assume that an observed behavior category can arise from one or more processing sequences represented as branches in a tree. These models form a subclass of parametric, multinomial models, and they provide a substantively motivated alternative to loglinear models. We consider the usual case where branch probabilities are products of nonnegative integer powers in the parameters, 0≤θs≤1, and their complements, 1 - θs. A version of the EM algorithm is constructed that has very strong properties. First, the E-step and the M-step are both analytic and computationally easy; therefore, a fast PC program can be constructed for obtaining MLEs for large numbers of parameters. Second, a closed form expression for the observed Fisher information matrix is obtained for the entire class. Third, it is proved that the algorithm necessarily converges to a local maximum, and this is a stronger result than for the exponential family as a whole. Fourth, we show how the algorithm can handle quite general hypothesis tests concerning restrictions on the model parameters. Fifth, we extend the algorithm to handle the Read and Cressie power divergence family of goodness-of-fit statistics. The paper includes an example to illustrate some of these results.

[1]  K. Landsteiner,et al.  Ueber Agglutinationserscheinungen normalen menschlichen Blutes , 1901 .

[2]  C. A. Smith,et al.  THE ESTIMATION OF GENE FREQUENCIES IN A RANDOM‐MATING POPULATION , 1955, Annals of human genetics.

[3]  Cedric A. B. Smith,et al.  COUNTING METHODS IN GENETICAL STATISTICS , 1957 .

[4]  H. Hartley Maximum Likelihood Estimation from Incomplete Data , 1958 .

[5]  Paul F. Lazarsfeld,et al.  Latent Structure Analysis. , 1969 .

[6]  Regina C. Elandt-Johnson,et al.  Probability models and statistical methods in genetics , 1972 .

[7]  Richard A. Chechile,et al.  A Bayesian procedure for separately estimating storage and retrieval components of forgetting , 1976 .

[8]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[9]  B. Efron,et al.  Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher information , 1978 .

[10]  John A. Stankovic,et al.  Distributed Processing , 1978, Computer.

[11]  G. Bower,et al.  Comparisons of models of associative recall , 1981, Memory & cognition.

[12]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[13]  T. Wickens Models for Behavior: Stochastic Processes in Psychology , 1982 .

[14]  R. A. Boyles On the Convergence of the EM Algorithm , 1983 .

[15]  M. Humphreys,et al.  Recognition memory: A cue and information analysis , 1983, Memory & cognition.

[16]  New York Dover,et al.  ON THE CONVERGENCE PROPERTIES OF THE EM ALGORITHM , 1983 .

[17]  Philip D. Harvey Reality Monitoring in Mania and Schizophrenia: The Association of Thought Disorder and Performance , 1985, The Journal of nervous and mental disease.

[18]  William H. Batchelder,et al.  The statistical analysis of a model for storage and retrieval processes in human memory , 1986 .

[19]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[20]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[21]  David M. Riefer,et al.  Multinomial Modeling and the Measurement of Cognitive Processes. , 1988 .

[22]  Timothy R. C. Read,et al.  Goodness-Of-Fit Statistics for Discrete Multivariate Data , 1988 .

[23]  David M. Riefer,et al.  Multinomial processing models of source monitoring. , 1990 .

[24]  William H. Batchelder,et al.  Statistical Inference for Multinomial Processing Tree Models , 1991 .

[25]  Xiao-Li Meng,et al.  Using EM to Obtain Asymptotic Variance-Covariance Matrices: The SEM Algorithm , 1991 .

[26]  Allen Newell,et al.  A Preliminary Analysis of the Soar Architecture as a Basis for General Intelligence , 1991, Artif. Intell..

[27]  Paul A. Ruud,et al.  Extensions of estimation methods using the EM algorithm , 1991 .

[28]  William H. Batchelder Getting wise about minimum distance measures , 1991 .

[29]  K. Bäuml Experimental analysis of storage and retrieval processes involved in retroactive inhibition: The effect of presentation mode , 1991 .

[30]  Donald B. Rubin,et al.  EM and beyond , 1991 .

[31]  L. C. Rutledge,et al.  Genetic Data Analysis , 1991 .

[32]  Jeffrey N. Rouder,et al.  A multinomial modeling analysis of the mnemonic benefits of bizarre imagery , 1992, Memory & cognition.

[33]  William H. Batchelder,et al.  Analysis of a Model for Source Monitoring , 1994 .