A Pairwise Likelihood Method For Correlated Binary Data With/withoutMissing Observations Under Generalized Partially Linear Single-indexModels

Correlated data, such as multivariate or clustered data, arise commonly in practice. Unlike analysis for independent data, valid inference based on such data often requires proper accommodation of complex association structures among re- sponse components within clusters. Semiparametric models based on generalized estimating equations (GEE) methods, and their extensions, have become increas- ingly popular. However, these inferential schemes are greatly challenged by the complexity of such data features as missing observations, ubiquitous in applica- tions. Moreover, existing methods mainly concern marginal mean parameters with association parameters treated as nuisance. This treatment is inadequate to han- dle clustered data for which estimation of association parameters can be a central theme of the study. To address these problems, we develop a flexible semiparamet- ric method that can handle correlated data with or without missing values. Our discussion focuses on binary data that arise commonly. The proposed method en- joys a number of attractive properties, including that the missing data process is left unmodeled, yet model assumptions for the response process are kept to a min- imum. It is robust in the sense that only the mean and association structures for the response process are modeled. The proposed method is flexible because both parametric and nonparametric structures are incorporated in modeling the mean

[1]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[2]  Kung-Yee Liang,et al.  Conditional logistic regression models for correlated binary data , 1988 .

[3]  R. Prentice,et al.  Correlated binary regression with covariates specific to each binary observation. , 1988, Biometrics.

[4]  L. Zhao,et al.  Correlated binary regression using a quadratic exponential model , 1990 .

[5]  S. Lipsitz,et al.  Generalized estimating equations for correlated binary data: Using the odds ratio as a measure of association , 1991 .

[6]  S. Zeger,et al.  Multivariate Regression Analyses for Categorical Data , 1992 .

[7]  G. Molenberghs,et al.  Marginal modelling of Correlated Ordinal Data using an n-way Plackett Distribution , 1992 .

[8]  P. Diggle,et al.  Modelling multivariate binary data with alternating logistic regressions , 1993 .

[9]  N. Laird,et al.  A likelihood-based method for analysing longitudinal binary responses , 1993 .

[10]  A. Agresti,et al.  Simultaneously Modeling Joint and Marginal Distributions of Multivariate Categorical Responses , 1994 .

[11]  Jianqing Fan,et al.  Local polynomial kernel regression for generalized linear models and quasi-likelihood functions , 1995 .

[12]  Stuart R. Lipsitz,et al.  A Model for Binary Time Series Data with Serial Odds Ratio Patterns , 1995 .

[13]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[14]  Geert Molenberghs,et al.  Regression Models for Longitudinal Binary Responses with Informative Drop‐Outs , 1995 .

[15]  Jianqing Fan,et al.  Generalized Partially Linear Single-Index Models , 1997 .

[16]  R. Carroll,et al.  Semiparametric Regression for Clustered Data Using Generalized Estimating Equations , 2001 .

[17]  Raymond J. Carroll,et al.  Semiparametric regression for clustered data , 2001 .

[18]  Richard J. Cook,et al.  Marginal Methods for Incomplete Longitudinal Data Arising in Clusters , 2002 .

[19]  SECOND ORDER ESTIMATING EQUATIONS FOR CLUSTERED LONGITUDINAL BINARY DATA WITH MISSING OBSERVATIONS , 2002 .

[20]  Naisyin Wang Marginal nonparametric kernel regression accounting for within‐subject correlation , 2003 .

[21]  Qiong Yang,et al.  Description of the Framingham Heart Study data for Genetic Analysis Workshop 13 , 2003, BMC Genetics.

[22]  S. Horvath,et al.  Multivariate variance-components analysis of longitudinal blood pressure measurements from the Framingham Heart Study , 2003, BMC Genetics.

[23]  D. Cox,et al.  A note on pseudolikelihood constructed from marginal densities , 2004 .

[24]  ARROLL,et al.  Estimation in Partially Linear Models With Missing Covariates , 2004 .

[25]  G. Yi,et al.  Marginal and association regression models for longitudinal binary data with drop‐outs: A likelihood‐based approach , 2005 .

[26]  R. Carroll,et al.  Efficient Semiparametric Marginal Estimation for Longitudinal/Clustered Data , 2005 .

[27]  R. Carroll,et al.  Semiparametric estimation in general repeated measures problems , 2006 .

[28]  Hua Liang,et al.  Analysis of correlated binary data under partially linear single-index logistic models , 2009, J. Multivar. Anal..

[29]  Hua Liang,et al.  Semiparametric marginal and association regression methods for clustered binary data , 2011, Annals of the Institute of Statistical Mathematics.

[30]  Richard J. Cook,et al.  A robust pairwise likelihood method for incomplete longitudinal binary data arising in clusters , 2011 .

[31]  Harry Joe,et al.  Composite Likelihood Methods , 2012 .