Extending the Mann–Whitney–Wilcoxon rank sum test to longitudinal regression analysis

Outliers are commonly observed in psychosocial research, generally resulting in biased estimates when comparing group differences using popular mean-based models such as the analysis of variance model. Rank-based methods such as the popular Mann–Whitney–Wilcoxon (MWW) rank sum test are more effective to address such outliers. However, available methods for inference are limited to cross-sectional data and cannot be applied to longitudinal studies under missing data. In this paper, we propose a generalized MWW test for comparing multiple groups with covariates within a longitudinal data setting, by utilizing the functional response models. Inference is based on a class of U-statistics-based weighted generalized estimating equations, providing consistent and asymptotically normal estimates not only under complete but missing data as well. The proposed approach is illustrated with both real and simulated study data.

[1]  X M Tu,et al.  Distribution‐free models for longitudinal count responses with overdispersion and structural zeros , 2013, Statistics in medicine.

[2]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[3]  P. Holland,et al.  An Exponential Family of Probability Distributions for Directed Graphs , 1981 .

[4]  Jeanne Kowalski,et al.  Modern Applied U-Statistics , 2007 .

[5]  X. M. Tu,et al.  Multivariate U‐statistics: a tutorial with applications , 2011 .

[6]  Wenqing He,et al.  Median Regression Models for Longitudinal Data with Dropouts , 2009, Biometrics.

[7]  Naiji Lu,et al.  Social Network Endogeneity and Its Implications for Statistical and Causal Inferences , 2014 .

[8]  Yinglin Xia,et al.  Reducing sexual risk behavior in adolescent girls: results from a randomized controlled trial. , 2013, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[9]  Wan Tang,et al.  Inference for kappas for longitudinal study data: applications to sexual health research. , 2008, Biometrics.

[10]  Douglas A. Wolfe,et al.  Introduction to the Theory of Nonparametric Statistics. , 1980 .

[11]  Victor DeGruttola,et al.  Recursive partitioning for monotone missing at random longitudinal markers. , 2013, Statistics in medicine.

[12]  W. Pan,et al.  Small‐sample performance of the robust score test and its modifications in generalized estimating equations , 2005, Statistics in medicine.

[13]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[14]  R. MacGowan,et al.  Modeling Intervention Efficacy for High-Risk Women , 2000, Evaluation & the health professions.

[15]  D. Boos On Generalized Score Tests , 1992 .

[16]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[17]  R. Randles,et al.  Introduction to the Theory of Nonparametric Statistics , 1991 .

[18]  J. Kowalski,et al.  Nonparametric inference for stochastic linear hypotheses: Application to high‐dimensional data , 2004 .

[19]  W. Kruskal,et al.  Use of Ranks in One-Criterion Variance Analysis , 1952 .

[20]  N. Jewell,et al.  Hypothesis testing of regression parameters in semiparametric generalized linear models for cluster correlated data , 1990 .

[21]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[22]  Kerstin E. E. Schroder,et al.  Methodological challenges in research on sexual risk behavior: II. Accuracy of self-reports , 2003, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[23]  P Wu,et al.  A Class of Distribution-Free Models for Longitudinal Mediation Analysis , 2014, Psychometrika.

[24]  Chap T. Le,et al.  Applied Categorical Data Analysis , 1998 .

[25]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .