Hierarchical linear regression models for conditional quantiles

The quantile regression has several useful features and therefore is gradually developing into a comprehensive approach to the statistical analysis of linear and nonlinear response models, but it cannot deal effectively with the data with a hierarchical structure. In practice, the existence of such data hierarchies is neither accidental nor ignorable, it is a common phenomenon. To ignore this hierarchical data structure risks overlooking the importance of group effects, and may also render many of the traditional statistical analysis techniques used for studying data relationships invalid. On the other hand, the hierarchical models take a hierarchical data structure into account and have also many applications in statistics, ranging from overdispersion to constructing min-max estimators. However, the hierarchical models are virtually the mean regression, therefore, they cannot be used to characterize the entire conditional distribution of a dependent variable given high-dimensional covariates. Furthermore, the estimated coefficient vector (marginal effects) is sensitive to an outlier observation on the dependent variable. In this article, a new approach, which is based on the Gauss-Seidel iteration and taking a full advantage of the quantile regression and hierarchical models, is developed. On the theoretical front, we also consider the asymptotic properties of the new method, obtaining the simple conditions for an n1/2-convergence and an asymptotic normality. We also illustrate the use of the technique with the real educational data which is hierarchical and how the results can be explained.

[1]  Judith D. Singer,et al.  Using SAS PROC MIXED to Fit Multilevel Models, Hierarchical Models, and Individual Growth Models , 1998 .

[2]  D. Ruppert,et al.  Trimmed Least Squares Estimation in the Linear Model , 1980 .

[3]  R. Koenker,et al.  Computing regression quantiles , 1987 .

[4]  D. Rubin,et al.  Estimation in Covariance Components Models , 1981 .

[5]  N. Longford A fast scoring algorithm for maximum likelihood estimation in unbalanced mixed models with nested random effects , 1987 .

[6]  Barr Rosenberg,et al.  Linear regression with randomly dispersed parameters , 1973 .

[7]  James P. Hobert,et al.  Hierarchical Models: A Current Computational Perspective , 2000 .

[8]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[9]  R. C. Elston,et al.  Estimation of Time-Response Curves and Their Confidence Bands , 1962 .

[10]  P. Billingsley,et al.  Convergence of Probability Measures , 1969 .

[11]  R. Kass,et al.  Approximate Bayesian Inference in Conditionally Independent Hierarchical Models (Parametric Empirical Bayes Models) , 1989 .

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  Jana Jurečková,et al.  On adaptive scale-equivariant m-estimators in linear models , 1982 .

[14]  D. F. Morrison,et al.  Multivariate Statistical Methods , 1968 .

[15]  Barbara Entwisle,et al.  Contextual analysis through the multilevel linear model. , 1983 .

[16]  H. Goldstein Multilevel Statistical Models , 2006 .

[17]  Jana Jurečková,et al.  Asymptotic Relations of $M$-Estimates and $R$-Estimates in Linear Regression Model , 1977 .

[18]  R. Koenker,et al.  The Gaussian hare and the Laplacian tortoise: computability of squared-error versus absolute-error estimators , 1997 .

[19]  D. Lindley,et al.  Bayes Estimates for the Linear Model , 1972 .

[20]  A. F. Smith A General Bayesian Linear Model , 1973 .