Inference on the marginal distribution of clustered data with informative cluster size

In spite of recent contributions to the literature, informative cluster size settings are not well known and understood. In this paper, we give a formal definition of the problem and describe it from different viewpoints. Data generating mechanisms, parametric and nonparametric models are considered in light of examples. Our emphasis is on nonparametric and robust approaches to the inference on the marginal distribution. Descriptive statistics and parameters of interest are defined as functionals and they are accompanied with a generally applicable testing procedure. The theory is illustrated with an example on patients with incomplete spinal cord injuries.

[1]  V Reggie Edgerton,et al.  Establishing the NeuroRecovery Network: multisite rehabilitation centers that provide activity-based therapies and assessments for neurologic disorders. , 2012, Archives of physical medicine and rehabilitation.

[2]  P. Bickel,et al.  DESCRIPTIVE STATISTICS FOR NONPARAMETRIC MODELS IV. SPREAD , 1979 .

[3]  Volker Dietz,et al.  Assessing walking ability in subjects with spinal cord injury: validity and reliability of 3 walking tests. , 2005, Archives of physical medicine and rehabilitation.

[4]  David G Addiss,et al.  Modeling survival data with informative cluster size , 2008, Statistics in medicine.

[5]  Pranab Kumar Sen,et al.  Within‐cluster resampling , 2001 .

[6]  J. N. K. Rao,et al.  Mean estimating equation approach to analysing cluster-correlated data with nonignorable cluster sizes , 2005 .

[7]  Charles E McCulloch,et al.  Estimation of covariate effects in generalized linear mixed models with informative cluster sizes. , 2011, Biometrika.

[8]  Somnath Datta,et al.  Rank-Sum Tests for Clustered Data , 2005 .

[9]  Erich L. Lehmann,et al.  Descriptive Statistics for Nonparametric Models I. Introduction , 1975 .

[10]  Somnath Datta,et al.  Inference for marginal linear models for clustered longitudinal data with potentially informative cluster sizes , 2011, Statistical methods in medical research.

[11]  Somnath Datta,et al.  A Signed‐Rank Test for Clustered Data , 2008, Biometrics.

[12]  Yijian Huang,et al.  Marginal Regression of Gaps Between Recurrent Events , 2003, Lifetime data analysis.

[13]  P. Spreij Probability and Measure , 1996 .

[14]  David B Dunson,et al.  A Bayesian Approach for Joint Modeling of Cluster Size and Subunit‐Specific Outcomes , 2003, Biometrics.

[15]  J. Williamson,et al.  Weighting condom use data to account for nonignorable cluster size. , 2007, Annals of epidemiology.

[16]  Chin-Tsang Chiang,et al.  EFFICIENT ESTIMATION METHODS FOR INFORMATIVE CLUSTER SIZE DATA , 2008 .

[17]  Hannu Oja,et al.  A weighted multivariate sign test for cluster-correlated data , 2005 .

[18]  E. L. Lehmann,et al.  Descriptive Statistics for Nonparametric Models II. Location , 1975 .

[19]  Somnath Datta,et al.  Marginal Analyses of Clustered Data When Cluster Size Is Informative , 2003, Biometrics.

[20]  Katherine S Panageas,et al.  Properties of analysis methods that account for clustering in volume–outcome studies when the primary predictor is cluster size , 2007, Statistics in medicine.

[21]  Ralitza V Gueorguieva,et al.  Comments about Joint Modeling of Cluster Size and Binary and Continuous Subunit‐Specific Outcomes , 2005, Biometrics.

[22]  Somnath Datta,et al.  Marginal association measures for clustered data , 2011, Statistics in medicine.