Testing in multifactor heteroscedastic ANOVA and repeated measures designs with large number of levels

Testing in multifactor heteroscedastic ANOVA and repeated measures designs with large number of levels Testing methods for factorial designs with independent or dependent observations where some of the factors have a large number of levels have received a lot of attention recently. Most results for independent data in the literature have been restricted to procedures using the original observations for the balanced homoscedastic case, which require strong moment assumptions and are sensitive to outliers. The results in the literature for dependent data were extensively studied in parametric, nonparametric and semiparametric, and Bayesian models but all that do inference require large sample sizes or the normality assumption. The first part of my thesis considers the use of rank statistics as robust alternatives for testing hypotheses in balanced and unbalanced, homoscedastic and heteroscedastic one-way and two-way ANOVA models when the number of levels of at least one factor is large. The second part of my thesis deals with various testing problems in possibly unbalanced and heteroscedastic multi-factor designs with arbitrary but fixed number of factors when at least one of the factors have large number of factor levels. Procedures based on both original observations and their (mid-)ranks are presented for the same general setting. The first two parts pertain to independent data. The third part of my thesis is focused on testing hypotheses in functional data, a fully nonparametric method for evaluating the effect of several crossed factors on the curve and their interactions with time. The asymptotics, which rely on the large number of measurements per curve (subject) and not on large group sizes, hold under the general assumption of

[1]  Peter J. Diggle,et al.  RATES OF CONVERGENCE IN SEMI‐PARAMETRIC MODELLING OF LONGITUDINAL DATA , 1994 .

[2]  Pierre R. Bushel,et al.  STATISTICAL ANALYSIS OF A GENE EXPRESSION MICROARRAY EXPERIMENT WITH REPLICATION , 2002 .

[3]  M. Akritas,et al.  Nonparametric goodness-of-fit test for heteroscedastic regression models , 2002 .

[4]  E. Vonesh,et al.  Linear and Nonlinear Models for the Analysis of Repeated Measurements , 1996 .

[5]  M. Akritas,et al.  TWO-WAY HETEROSCEDASTIC ANOVA WHEN THE NUMBER OF LEVELS IS LARGE , 2006 .

[6]  Pierre R. Bushel,et al.  Assessing Gene Significance from cDNA Microarray Expression Data via Mixed Models , 2001, J. Comput. Biol..

[7]  M. Gibson,et al.  Beyond ANOVA: Basics of Applied Statistics. , 1986 .

[8]  J. Rice,et al.  Smoothing spline models for the analysis of nested and crossed samples of curves , 1998 .

[9]  Michael G. Akritas,et al.  Asymptotics for Analysis of Variance When the Number of Levels is Large , 2000 .

[10]  Yuedong Wang,et al.  Mixed-Effects Smoothing Spline ANOVA , 1998 .

[11]  J. Hart,et al.  Kernel Regression Estimation Using Repeated Measurements Data , 1986 .

[12]  P. Diggle,et al.  Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. , 1994, Biometrics.

[13]  D. Hand,et al.  Practical Longitudinal Data Analysis , 1996 .

[14]  Li Ping Yang,et al.  Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data , 1998 .

[15]  Michael G. Akritas,et al.  Nonparametric models and methods for ANCOVA with dependent data , 2004 .

[16]  Michael G. Akritas,et al.  The Rank Transform Method in Some Two-Factor Designs , 1990 .

[17]  S. Arnold The Theory of Linear Models and Multivariate Analysis , 1981 .

[18]  G. Box Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, I. Effect of Inequality of Variance in the One-Way Classification , 1954 .

[19]  G. Box Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, II. Effects of Inequality of Variance and of Correlation Between Errors in the Two-Way Classification , 1954 .

[20]  Adrian F. M. Smith,et al.  Bayesian computation via the gibbs sampler and related markov chain monte carlo methods (with discus , 1993 .

[21]  J. Hart Kernel regression estimation with time series errors , 1991 .

[22]  Michael G. Akritas,et al.  Nonparametric models and methods for nonlinear analysis of covariance , 2000 .

[23]  D. Bates,et al.  Approximations to the Log-Likelihood Function in the Nonlinear Mixed-Effects Model , 1995 .

[24]  S. L. Andersen,et al.  Permutation Theory in the Derivation of Robust Criteria and the Study of Departures from Assumption , 1955 .

[25]  S. Portnoy Asymptotic behavior of M-estimators of p regression parameters when p , 1985 .

[26]  Marie Davidian,et al.  Nonlinear Models for Repeated Measurement Data , 1995 .

[27]  Chin-Tsang Chiang,et al.  Smoothing Spline Estimation for Varying Coefficient Models With Repeatedly Measured Dependent Variables , 2001 .

[28]  Edgar Brunner,et al.  Nonparametric Hypotheses and Rank Statistics for Unbalanced Factorial Designs , 1997 .

[29]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[30]  Michael G. Akritas,et al.  Fully Nonparametric Hypotheses for Factorial Designs I: Multivariate Repeated Measures Designs , 1994 .

[31]  P. Diggle Analysis of Longitudinal Data , 1995 .

[32]  Bruce G. Lindsay,et al.  Efficiency of projected score methods in rectangular array asymptotics , 2003 .

[33]  B. Silverman,et al.  Estimating the mean and covariance structure nonparametrically when the data are curves , 1991 .

[34]  Edgar Brunner,et al.  A unified approach to rank tests for mixed models , 1997 .

[35]  H. Sahai,et al.  The Analysis of Variance: Fixed, Random and Mixed Models , 2000 .

[36]  Dennis D. Boos,et al.  ANOVA and rank tests when the number of treatments is large , 1995 .

[37]  Haiyan Wang,et al.  Rank tests for anova with large number of factor levels , 2004 .

[38]  R. G. Krutchkoff,et al.  Two-way fixed effects analysis of variance when the error variances may be unequal , 1988 .

[39]  Frank Yates,et al.  The Analysis of Multiple Classifications with Unequal Numbers in the Different Classes , 1934 .

[40]  David R. Brillinger,et al.  The Analysis of Time Series Collected in an Experimental Design , 1973 .

[41]  Marie Davidian,et al.  The Nonlinear Mixed Effects Model with a Smooth Random Effects Density , 1993 .

[42]  Edgar Brunner,et al.  Nonparametric analysis of longitudinal data in factorial experiments , 2012 .

[43]  N. Cliff Dominance statistics: Ordinal analyses to answer ordinal questions. , 1993 .

[44]  M. Akritas,et al.  Heteroscedastic One-Way ANOVA and Lack-of-Fit Tests , 2004 .

[45]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[46]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[47]  Naomi Altman,et al.  Kernel Smoothing of Data with Correlated Errors , 1990 .

[48]  S. Arnold Asymptotic Validity of F Tests for the Ordinary Linear Model and the Multiple Correlation Model , 1980 .

[49]  C Brownie,et al.  Type I error robustness of ANOVA and ANOVA on ranks when the number of treatments is large. , 1994, Biometrics.

[50]  Jianqing Fan,et al.  Test of Significance When Data Are Curves , 1998 .

[51]  Michael G. Akritas,et al.  Nonparametric ANCOVA with two and three covariates , 2004 .

[52]  A. Mallet A maximum likelihood estimation method for random coefficient regression models , 1986 .