Kernel probability estimation for binomial and multinomial data

Kernel-based smoothers have enjoyed considerable success in the estimation of both probability densities and event frequencies. Existing procedures can be modified to yield a similar kernel-based estimator of instantaneous probability over the course of a binomial or multinomial time series. The resulting nonparametric estimate can be described in terms of one bandwidth per outcome alternative, facilitating both the understanding and reporting of results relative to more sophisticated methods for binomial outcome estimation. Also described is a method for sample size estimation, which in turn can be used to obtain credible intervals for the resulting estimate given mild assumptions. One application of this analysis is to model response accuracy in tasks with heterogeneous trial types. An example is presented from a study of transitive inference, showing how kernel probability estimates provide a method for inferring response accuracy during the first trial following training. This estimation procedure is also effective in describing the multinomial responses typical in the study of choice and decision making. An example is presented showing how the procedure may be used to describe changing distributions of choices over time when eight response alternatives are simultaneously available.

[1]  M. Rosenblatt Remarks on Some Nonparametric Estimates of a Density Function , 1956 .

[2]  L. Brown,et al.  Interval Estimation for a Binomial Proportion , 2001 .

[3]  W M Baum,et al.  On two types of deviation from the matching law: bias and undermatching. , 1974, Journal of the experimental analysis of behavior.

[4]  S. Wood,et al.  Generalized Additive Models: An Introduction with R , 2006 .

[5]  G. Jensen,et al.  Beyond Dichotomy: Dynamics of Choice in Compositional Space , 2014 .

[6]  M. C. Jones,et al.  A Brief Survey of Bandwidth Selection for Density Estimation , 1996 .

[7]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[8]  R. Cydulka Has misdiagnosis of appendicitis decreased over time? A population-based analysis ☆ , 2003 .

[9]  James Stephen Marron,et al.  Lower bounds for bandwidth selection in density estimation , 1991 .

[10]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[11]  F. D. de Waal,et al.  Post-Conflict Affiliation by Chimpanzees with Aggressors: Other-Oriented versus Selfish Political Strategy , 2011, PloS one.

[12]  Vincent P. Ferrera,et al.  Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model , 2015, PLoS Comput. Biol..

[13]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[14]  R. Tibshirani,et al.  Generalized Additive Models , 1991 .

[15]  C. Gallistel,et al.  The learning curve: implications of a quantitative analysis. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  T. Koepsell,et al.  Has Misdiagnosis of Appendicitis Decreased Over Time , 2001 .

[17]  R. Reyment Compositional data analysis , 1989 .

[18]  Greg Jensen,et al.  Compositions and their application to the analysis of choice. , 2014, Journal of the experimental analysis of behavior.

[19]  Shigeru Shinomoto,et al.  Kernel bandwidth optimization in spike rate estimation , 2009, Journal of Computational Neuroscience.

[20]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[21]  Djalil CHAFAÏ,et al.  Confidence Regions for the Multinomial Parameter With Small Sample Size , 2008, 0805.1971.

[22]  R. Tibshirani,et al.  Generalized additive models for medical research , 1986, Statistical methods in medical research.