A Generalized Gaussian Process Model for Computer Experiments With Binary Time Series

Abstract Non-Gaussian observations such as binary responses are common in some computer experiments. Motivated by the analysis of a class of cell adhesion experiments, we introduce a generalized Gaussian process model for binary responses, which shares some common features with standard GP models. In addition, the proposed model incorporates a flexible mean function that can capture different types of time series structures. Asymptotic properties of the estimators are derived, and an optimal predictor as well as its predictive distribution are constructed. Their performance is examined via two simulation studies. The methodology is applied to study computer simulations for cell adhesion experiments. The fitted model reveals important biological information in repeated cell bindings, which is not directly observable in lab experiments. Supplementary materials for this article are available online.

[1]  Ying Hung,et al.  Binary Time Series Modeling With Application to Adhesion Frequency Experiments , 2008, Journal of the American Statistical Association.

[2]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[3]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[4]  Deborah Leckband,et al.  Memory in receptor–ligand-mediated cell adhesion , 2007, Proceedings of the National Academy of Sciences.

[5]  Robert B. Gramacy,et al.  Particle Learning of Gaussian Process Models for Sequential Design and Optimization , 2009, 0909.5262.

[6]  M. E. Johnson,et al.  Minimax and maximin distance designs , 1990 .

[7]  M. D. McKay,et al.  A comparison of three methods for selecting values of input variables in the analysis of output from a computer code , 2000 .

[8]  Christopher J Paciorek,et al.  The importance of scale for spatial-confounding bias and precision of spatial regression estimators. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[9]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[10]  D. Harville Bayesian inference for variance components using only error contrasts , 1974 .

[11]  David Barber,et al.  Bayesian Classification With Gaussian Processes , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  C. F. Sirmans,et al.  Nonstationary multivariate process modeling through spatially varying coregionalization , 2004 .

[13]  Robert B. Gramacy,et al.  Calibrating a large computer experiment simulating radiative shock hydrodynamics , 2014, 1410.3293.

[14]  Hao Zhang On Estimation and Prediction for Spatial Generalized Linear Mixed Models , 2002, Biometrics.

[15]  J. Hodges,et al.  Adding Spatially-Correlated Errors Can Mess Up the Fixed Effect You Love , 2010 .

[16]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[17]  D. Harville Matrix Algebra From a Statistician's Perspective , 1998 .

[18]  R. Rigby,et al.  Generalized Autoregressive Moving Average Models , 2003 .

[19]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[20]  Michael Affenzeller,et al.  DEVS Simulation of Spiking Neural Networks , 2003 .

[21]  D. Harville Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems , 1977 .

[22]  Jeremy E. Oakley,et al.  Multivariate Gaussian Process Emulators With Nonseparable Covariance Structures , 2013, Technometrics.

[23]  Bo Wang,et al.  Generalized Gaussian Process Regression Model for Non-Gaussian Functional Data , 2014, 1401.8189.

[24]  J. Atchison,et al.  Logistic-normal distributions:Some properties and uses , 1980 .

[25]  D. Cox,et al.  Asymptotic techniques for use in statistics , 1989 .

[26]  D.,et al.  Regression Models and Life-Tables , 2022 .

[27]  A. O'Hagan,et al.  Bayesian emulation of complex multi-output and dynamic computer models , 2010 .

[28]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[29]  C. F. Wu,et al.  Efficient Calibration for Imperfect Computer Models , 2015, 1507.07280.

[30]  R Mead,et al.  A generalised logit-normal distribution. , 1965, Biometrics.

[31]  S. Zeger,et al.  Markov regression models for time series: a quasi-likelihood approach. , 1988, Biometrics.

[32]  Boxin Tang Orthogonal Array-Based Latin Hypercubes , 1993 .

[33]  Karl J. Friston,et al.  Variance Components , 2003 .

[34]  T. Choi,et al.  Gaussian Process Regression Analysis for Functional Data , 2011 .

[35]  Runze Li,et al.  Analysis of Computer Experiments Using Penalized Likelihood in Gaussian Kriging Models , 2005, Technometrics.

[36]  A. O'Hagan,et al.  Bayesian calibration of computer models , 2001 .

[37]  A V Herz,et al.  Neural codes: firing rates and beyond. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Cheng Zhu,et al.  The kinetics of two dimensional TCR and pMHC interactions determine T cell responsiveness , 2010, Nature.

[39]  Richard J. Beckman,et al.  A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output From a Computer Code , 2000, Technometrics.

[40]  Noel A Cressie,et al.  Asymptotics for REML estimation of spatial covariance parameters , 1996 .

[41]  Stanley H. Cohen,et al.  Design and Analysis , 2010 .

[42]  C. Rasmussen,et al.  Approximations for Binary Gaussian Process Classification , 2008 .

[43]  Daniel W. Apley,et al.  Local Gaussian Process Approximation for Large Computer Experiments , 2013, 1303.0383.

[44]  Jun Dai,et al.  Reliability Simulation and Circuit-Failure Analysis in Analog and Mixed-Signal Applications , 2009, IEEE Transactions on Device and Materials Reliability.

[45]  Noel A Cressie,et al.  The asymptotic distribution of REML estimators , 1993 .

[46]  Michael A. West,et al.  A dynamic modelling strategy for Bayesian computer model emulation , 2009 .

[47]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[48]  E. Slud,et al.  Binary Time Series , 1980 .

[49]  Sonja Kuhnt,et al.  Design and analysis of computer experiments , 2010 .

[50]  V. Roshan Joseph,et al.  Orthogonal Gaussian process models , 2016, 1611.00203.

[51]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[52]  Frank Lad,et al.  Two Moments of the Logitnormal Distribution , 2008, Commun. Stat. Simul. Comput..

[53]  Chih-Li Sung,et al.  Statistica Sinica Preprint No : SS-2016-0138 R 2 Title Exploiting Variance Reduction Potential in Local Gaussian Process , 2017 .