A simulation-based method for model evaluation

We wish to evaluate and compare models that are non-nested and fit to data using different fitting criteria. We first estimate parameters in all models by optimizing goodness-of-fit to a dataset. Then, to assess a candidate model, we simulate a population of datasets from it and evaluate the goodness-of-fit of all the models, without re-estimating parameter values. Finally, we see whether the vector of goodness-of-fit criteria for the original data is compatible with the multivariate distribution of these criteria for the simulated datasets. By simulating from each model in turn, we determine whether any, or several, models are consistent with the data. We apply the method to compare three models, fit at different temporal resolutions to binary time series of animal behaviour data, concluding that a semi-Markov model gives a better fit than latent Gaussian and hidden Markov models.

[1]  D A Williams Discrimination between regression models to determine the pattern of enzyme synthesis in synchronous cell cultures. , 1970, Biometrics.

[2]  Debashis Kushary,et al.  Bootstrap Methods and Their Application , 2000, Technometrics.

[3]  I. Kyriazakis,et al.  Measuring diet selection in dairy cows: effect of training on choice of dietary protein level , 1997 .

[4]  E. Slud,et al.  Binary Time Series , 1980 .

[5]  D. Cox Tests of Separate Families of Hypotheses , 1961 .

[6]  Ian Diamond,et al.  Analysis of Binary Data. 2nd Edn. , 1990 .

[7]  Chris A. Glasbey,et al.  A Spectral Estimator of Arma Parameters from Thresholded Data , 2002, Stat. Comput..

[8]  H. Akaike A new look at the statistical model identification , 1974 .

[9]  Lain L. MacDonald,et al.  Hidden Markov and Other Models for Discrete- valued Time Series , 1997 .

[10]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[11]  R. Veerkamp,et al.  Diet choice by dairy cows. 1. Selection of feed protein content during the first half of lactation. , 1998, Journal of dairy science.

[12]  Iain L. MacDonald,et al.  Hidden Markov Models and Animal Behaviour , 1995 .

[13]  Williams Da,et al.  Discrimination between regression models to determine the pattern of enzyme synthesis in synchronous cell cultures. , 1970 .

[14]  C. Glasbey,et al.  Rainfall Modelling Using a Latent Gaussian Variable , 1997 .

[15]  Ilias Kyriazakis,et al.  The importance of ‘memory’ in statistical models for animal feeding behaviour , 2004, Behavioural Processes.

[16]  Patsy Haccou,et al.  Statistical Analysis of Behavioural Data: An Approach Based on Time-structured Models , 1992 .

[17]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[18]  John Hinde,et al.  Choosing Between Non-nested Models: a Simulation Approach , 1992 .

[19]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[20]  N D Le,et al.  Exact likelihood evaluation in a Markov mixture model for time series of seizure counts. , 1992, Biometrics.

[21]  David R. Cox The analysis of binary data , 1970 .

[22]  A. O'Hagan,et al.  Fractional Bayes factors for model comparison , 1995 .

[23]  Anthony C. Atkinson,et al.  A Method for Discriminating between Models , 1970 .