Multivariate Probit Analysis of Binary Time Series Datawith Missing

SUMMARY The development of adequate models for binary time series data with covariate adjustment has been an active research area in the last years. In the case, where interest is focused on marginal and association parameters, generalized estimating equations (GEE) (see for example Lipsitz, Laird and Harrington (1991) and Liang, Zeger and Qaqish (1992)) and likelihood (see for example Fitzmaurice and Laird (1993) and Molenberghs and Lesaare (1994)) based methods have been proposed. The number of parameters required for the full speciication of these models grows exponentially with the length of the binary time series. Therefore, the analysis is often focused on marginal and rst order parameters. In this case, the multivariate probit model (Ashford and Sowden (1970)) becomes an attractive alternative to the above models. The application of the multivariate probit model has been hampered by the intractability of the maximum likelihood estimator, when the length of the binary time series is large. This paper shows that this diiculty can be overcome by the use of Markov Chain Monte Carlo methods. This analysis also allows for valid point and interval estimates of the parameters in small samples. In addition, the analysis is adopted to handle the case of missing at random responses. The approach is illustrated on data involving binary responses measured at unequally spaced time points. Finally, this data analysis is compared to a GEE analysis given in Fitzmaurice and Lipsitz (1995).

[1]  R. Plackett A Class of Bivariate Distributions , 1965 .

[2]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[3]  P. Green,et al.  Bayesian Computation and Stochastic , 1995 .

[4]  Stuart R. Lipsitz,et al.  A Model for Binary Time Series Data with Serial Odds Ratio Patterns , 1995 .

[5]  S G Baker,et al.  Marginal regression for repeated binary data with outcome subject to non-ignorable non-response. , 1995, Biometrics.

[6]  N. Laird,et al.  A likelihood-based method for analysing longitudinal binary responses , 1993 .

[7]  S. Zeger,et al.  Marginal Regression Models for Clustered Ordinal Measurements , 1996 .

[8]  S. Zeger,et al.  Multivariate Regression Analyses for Categorical Data , 1992 .

[9]  D. Dockery,et al.  Passive smoking, gas cooking, and respiratory health of children living in six cities. , 2015, The American review of respiratory disease.

[10]  Karl Pearson,et al.  Mathematical contributions to the theory of evolution. VIII. On the correlation of characters not quantitatively measurable , 1900, Proceedings of the Royal Society of London.

[11]  J M Neuhaus,et al.  An annotated bibliography of methods for analysing correlated categorical data. , 1992, Statistics in medicine.

[12]  A. Whittemore,et al.  Methods for analyzing panel studies of acute health effects of air pollution. Technical report No. 25 , 1979 .

[13]  John Geweke,et al.  Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments , 1991 .

[14]  K. Pearson Mathematical contributions to the theory of evolution.—On the law of reversion , 2022, Proceedings of the Royal Society of London.

[15]  S. Chib,et al.  Understanding the Metropolis-Hastings Algorithm , 1995 .

[16]  G Molenberghs,et al.  Multivariate probit analysis: a neglected procedure in medical statistics. , 1991, Statistics in medicine.

[17]  G. Molenberghs,et al.  Marginal Modeling of Correlated Ordinal Data Using a Multivariate Plackett Distribution , 1994 .

[18]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[19]  J. Dale Global cross-ratio models for bivariate, discrete, ordered responses. , 1986, Biometrics.

[20]  Geert Molenberghs,et al.  Regression Models for Longitudinal Binary Responses with Informative Drop‐Outs , 1995 .

[21]  A S Whittemore,et al.  Methods for analyzing panel studies of acute health effects of air pollution. , 1979, Biometrics.

[22]  S R Lipsitz,et al.  Estimation methods for the join distribution of repeated binary observations. , 1995, Biometrics.

[23]  N M Laird,et al.  Missing data in longitudinal studies. , 1988, Statistics in medicine.

[24]  G Molenberghs,et al.  An application of maximum likelihood and generalized estimating equations to the analysis of ordinal data from a longitudinal study with cases missing at random. , 1994, Biometrics.

[25]  L. Zhao,et al.  Correlated binary regression using a quadratic exponential model , 1990 .

[26]  S. Lipsitz,et al.  Generalized estimating equations for correlated binary data: Using the odds ratio as a measure of association , 1991 .

[27]  D. Cox The Analysis of Multivariate Binary Data , 1972 .

[28]  R. Prentice,et al.  Correlated binary regression with covariates specific to each binary observation. , 1988, Biometrics.

[29]  Philip Heidelberger,et al.  Simulation Run Length Control in the Presence of an Initial Transient , 1983, Oper. Res..

[30]  J. C. van Houwelingen,et al.  Logistic Regression for Correlated Binary Data , 1994 .

[31]  J A Anderson,et al.  The grouped continuous model for multivariate ordered categorical variables and covariate adjustment. , 1985, Biometrics.

[32]  Christian Heumann Marginal regression modeling of correlated multicategorical response: A likelihood approach , 1996 .

[33]  P. Diggle,et al.  Modelling multivariate binary data with alternating logistic regressions , 1993 .

[34]  N M Laird,et al.  Analysing incomplete longitudinal binary responses: a likelihood-based approach. , 1994, Biometrics.

[35]  A. Raftery,et al.  How Many Iterations in the Gibbs Sampler , 1991 .