Robust descriptive discriminant analysis for repeated measures data

Discriminant analysis (DA) procedures based on parsimonious mean and/or covariance structures have recently been proposed for repeated measures data. However, these procedures rest on the assumption of a multivariate normal distribution. This study examines repeated measures DA (RMDA) procedures based on maximum likelihood (ML) and coordinatewise trimming (CT) estimation methods and investigates bias and root mean square error (RMSE) in discriminant function coefficients (DFCs) using Monte Carlo techniques. Study parameters include population distribution, covariance structure, sample size, mean configuration, and number of repeated measurements. The results show that for ML estimation, bias in DFC estimates was usually largest when the data were normally distributed, but there was no consistent trend in RMSE. For non-normal distributions, the average bias of CT estimates for procedures that assume unstructured group means and structured covariances was at least 40% smaller than the values for corresponding procedures based on ML estimators. The average RMSE for the former procedures was at least 10% smaller than the average RMSE for the latter procedures, but only when the data were sampled from extremely skewed or heavy-tailed distributions. This finding was observed even when the covariance and mean structures of the RMDA procedure were mis-specified. The proposed robust procedures can be used to identify measurement occasions that make the largest contribution to group separation when the data are sampled from multivariate skewed or heavy-tailed distributions.

[1]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[2]  Stefan Van Aelst,et al.  Inference for robust canonical variate analysis , 2010, Adv. Data Anal. Classif..

[3]  V. Yohai,et al.  Robust Statistics: Theory and Methods , 2006 .

[4]  Richard E Thompson,et al.  Growth curve analyses of neuropsychological profiles in children with neurofibromatosis Type 1: Specific cognitive tests remain “Spared” and “Impaired” over time , 2002, Journal of the International Neuropsychological Society.

[5]  Douglas M. Hawkins,et al.  High-Breakdown Linear Discriminant Analysis , 1997 .

[6]  K. Joossens Robust discriminant analysis , 2006 .

[7]  Ilona Berkovits,et al.  Bootstrap Resampling Approaches for Repeated Measure Designs: Relative Robustness to Sphericity and Normality Violations , 2000 .

[8]  Deo Kumar Srivastava,et al.  Trimmed T̃2: A robust analog of hotelling's T2 , 2001 .

[9]  Rand R. Wilcox,et al.  Some Results on the Tukey-Mclaughlin and Yuen Methods for Trimmed Means when Distributions are Skewed , 1994 .

[10]  N. M. Neykov,et al.  Robust two-group discrimination by bounded influence regression: a Monte Carlo simulation , 1994 .

[11]  C. Croux,et al.  Robust linear discriminant analysis using S‐estimators , 2001 .

[12]  L. Lix,et al.  Robust step-down tests for multivariate independent group designs. , 2007, The British journal of mathematical and statistical psychology.

[13]  P. Lachenbruch,et al.  Discriminant Analysis When Scale Contamination Is Present in the Initial Sample , 1977 .

[14]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[15]  A. C. Rencher Methods of multivariate analysis , 1995 .

[16]  S Rosberg,et al.  Journal of Clinical Endocrinology and Metabolism Printed in U.S.A. Copyright © 1997 by The Endocrine Society Circadian Cortisol Rhythms in Healthy Boys and Girls: Relationship with Age, Growth, Body Composition, and , 2022 .

[17]  L. Lix,et al.  Measures of relative importance for health-related quality of life , 2012, Quality of Life Research.

[18]  B. Rossion,et al.  Fixation Patterns During Recognition of Personally Familiar and Unfamiliar Faces , 2010, Front. Psychology.

[19]  Marc G. Genton,et al.  The Multivariate g-and-h Distribution , 2006, Technometrics.

[20]  A. Hadi,et al.  Maximum trimmed likelihood estimators: a unified approach, examples, and algorithms , 1997 .

[21]  Longhai Li,et al.  Discriminant Analysis for Repeated Measures Data: Effects of Mean and Covariance Misspecification on Bias and Error in Discriminant Function Coefficients , 2011 .

[22]  Rand R. Wilcox,et al.  Testing Repeated Measures Hypotheses When Covariance Matrices are Heterogeneous: Revisiting the Robustness of the Welch-James Test Again , 2000 .

[23]  H J Keselman,et al.  Testing treatment effects in repeated measures designs: trimmed means and bootstrapping. , 2000, The British journal of mathematical and statistical psychology.

[24]  Lisa M. Lix,et al.  Discriminant Analysis for Repeated Measures Data: A Review , 2010, Front. Psychology.

[25]  A. Baron,et al.  Misclassification among methods used for multiple group discrimination--the effects of distributional properties. , 1991, Statistics in medicine.

[26]  H. Keselman,et al.  Adaptive robust estimation and testing. , 2007, The British journal of mathematical and statistical psychology.

[27]  J. Ware,et al.  Applied Longitudinal Analysis , 2004 .

[28]  L. Lix,et al.  The relationship of inflammatory bowel disease type and activity to psychological functioning and quality of life. , 2006, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[29]  Byron K. Williams,et al.  Assessment of sampling stability in ecological applications of discriminant analysis , 1988 .

[30]  N. Campbell Robust Procedures in Multivariate Analysis II. Robust Canonical Variate Analysis , 1982 .

[31]  Bell Telephone,et al.  ROBUST ESTIMATES, RESIDUALS, AND OUTLIER DETECTION WITH MULTIRESPONSE DATA , 1972 .

[32]  W. Fung,et al.  High Breakdown Estimation for Multiple Populations with Applications to Discriminant Analysis , 2000 .

[33]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[34]  A. Roy,et al.  Discrimination and Classification with Repeated Measures Data under Different Covariance Structures , 2005 .

[35]  P. Rousseeuw,et al.  Unmasking Multivariate Outliers and Leverage Points , 1990 .

[36]  P. C. Chang,et al.  Robustness of Fisher's Linear Discriminant Function under Two-Component Mixed Normal Models , 1981 .

[37]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[38]  Anuradha Roy,et al.  On discrimination and classification with multivariate repeated measures data , 2005 .

[39]  Peter A. Lachenbruch,et al.  Robustness of the linear and quadratic discriminant function to certain types of non‐normality , 1973 .

[40]  Mia Hubert,et al.  Fast and robust discriminant analysis , 2004, Comput. Stat. Data Anal..

[41]  Bruno D. Zumbo,et al.  Using a Measure of Variable Importance to Investigate the Standardization of Discriminant Coefficients , 1996 .

[42]  Peter Filzmoser,et al.  Robust fitting of mixtures using the trimmed likelihood estimator , 2007, Comput. Stat. Data Anal..

[43]  L. Lix,et al.  Longitudinal study of quality of life and psychological functioning for active, fluctuating, and inactive disease patterns in inflammatory bowel disease , 2008, Inflammatory bowel diseases.

[44]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[45]  F. Verhey,et al.  The sensitivity of somatic symptoms in post‐stroke depression: a discriminant analytic approach , 2005, International journal of geriatric psychiatry.

[46]  R W Helms,et al.  A discriminant analysis extension to mixed models. , 1999, Statistics in medicine.