Model selection strategy in the analysis of capture-recapture data

Analysis of capture-recapture data is critically dependent upon selection of a proper model for inference. Model selection is particularly important in the analysis of multiple, interrelated data sets. This paper evaluates information theoretic approaches to selection of a parsimonious model and compares them to the use of likelihood ratio tests using four a levels. The purpose of the evaluation is to compare model selection strategies based on the quality of the inference, rather than on the degree to which differing selection strategies select the "true model." A measure of squared bias and variance (termed RSS) is used as a basis for comparing different data-based selection strategies, assuming that a minimum RSS value is a reasonable target. In general, the information theoretic approaches consistently selected models with a smaller RSS than did the likelihood ratio testing approach. Two information theoretic criteria have a balance between underfitting and overfitting when compared to models where the average minimum RSS was known. Other findings are presented along with a discussion of the concept of a "true model" and dimension consistency in model selection.

[1]  David R. Anderson,et al.  Data-Based Selection of an Appropriate Biological Model: The Key to Modern Data Analysis , 1992 .

[2]  R. Shibata Statistical aspects of model selection , 1989 .

[3]  David R. Anderson,et al.  AIC MODEL SELECTION IN OVERDISPERSED CAPTURE-RECAPTURE DATA' , 1994 .

[4]  H. Bozdogan Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions , 1987 .

[5]  R. Shibata Selection of the order of an autoregressive model by Akaike's information criterion , 1976 .

[6]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .

[7]  J. Nichols,et al.  Statistical inference for capture-recapture experiments , 1992 .

[8]  H. Akaike Prediction and Entropy , 1985 .

[9]  R. Huggins Some practical aspects of a conditional likelihood approach to capture experiments , 1991 .

[10]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[11]  Jean Clobert,et al.  Dependance de Facteurs de Milieu dans les Estimations de Taux de Survie par Capture-Recapture , 1985 .

[12]  R. Cormack Estimates of survival from the sighting of marked animals , 1964 .

[13]  David R. Anderson,et al.  Modeling Survival and Testing Biological Hypotheses Using Marked Animals: A Unified Approach with Case Studies , 1992 .

[14]  J. N. Kapur,et al.  Entropy optimization principles with applications , 1992 .

[15]  L. Donald,et al.  of the INTERIOR , 1962 .

[16]  M. Stone An Asymptotic Equivalence of Choice of Model by Cross‐Validation and Akaike's Criterion , 1977 .

[17]  Ping Zhang On the convergence rate of model selection criteria , 1993 .

[18]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[19]  David R. Anderson,et al.  Evaluation of the Kullback-Leibler Discrepancy for Model Selection in Open Population Capture-Recapture Models , 1994 .

[20]  P. M. North Relating Grey Heron survival rates to winter weather conditions , 1979 .

[21]  Hirotugu Akaike,et al.  Analysis of cross classified data by AIC , 1978 .

[22]  Walter Zucchini,et al.  Model Selection , 2011, International Encyclopedia of Statistical Science.

[23]  G. Seber A NOTE ON THE MULTIPLE-RECAPTURE CENSUS. , 1965, Biometrika.

[24]  G. P. Kirkwood,et al.  Estimation of survival in marked populations with possibly dependent sighting probabilities , 1981 .

[25]  H. Akaike Likelihood of a model and information criteria , 1981 .

[26]  Kenneth H. Pollock,et al.  Design and Analysis Methods for Fish Survival Experiments Based on Release-Recapture. , 1988 .

[27]  G. Jolly EXPLICIT ESTIMATES FROM CAPTURE-RECAPTURE DATA WITH BOTH DEATH AND IMMIGRATION-STOCHASTIC MODEL. , 1965, Biometrika.