Generalized linear array models with applications to multidimensional smoothing

Summary.  Data with an array structure are common in statistics, and the design or regression matrix for analysis of such data can often be written as a Kronecker product. Factorial designs, contingency tables and smoothing of data on multidimensional grids are three such general classes of data and models. In such a setting, we develop an arithmetic of arrays which allows us to define the expectation of the data array as a sequence of nested matrix operations on a coefficient array. We show how this arithmetic leads to low storage, high speed computation in the scoring algorithm of the generalized linear model. We refer to a generalized linear array model and apply the methodology to the smoothing of multidimensional arrays. We illustrate our procedure with the analysis of three data sets: mortality data indexed by age at death and year of death, spatially varying microarray background data and disease incidence data indexed by age at death, year of death and month of death.

[1]  R. A. FISHER,et al.  The Design and Analysis of Factorial Experiments , 1938, Nature.

[2]  F. Yates Design and Analysis of Factorial Experiments , 1958 .

[3]  H. Akaike Maximum likelihood identification of Gaussian autoregressive moving average models , 1973 .

[4]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[5]  Joe Brewer,et al.  Kronecker products and matrix calculus in system theory , 1978 .

[6]  Peter Craven,et al.  Smoothing noisy data with spline functions , 1978 .

[7]  Carl de Boor,et al.  Efficient Computer Manipulation of Tensor Products , 1979, TOMS.

[8]  G. Wahba Bayesian "Confidence Intervals" for the Cross-validated Smoothing Spline , 1983 .

[9]  Peter Green Linear models for field trials, smoothing and cross-validation , 1985 .

[10]  B. Silverman,et al.  Some Aspects of the Spline Smoothing Approach to Non‐Parametric Regression Curve Fitting , 1985 .

[11]  D Clayton,et al.  Models for temporal variation in cancer rates. II: Age-period-cohort models. , 1987, Statistics in medicine.

[12]  D Clayton,et al.  Models for temporal variation in cancer rates. I: Age-period and age-cohort models. , 1987, Statistics in medicine.

[13]  Charles R. Johnson,et al.  Topics in Matrix Analysis , 1991 .

[14]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[15]  B. Silverman,et al.  Nonparametric Regression and Generalized Linear Models: A roughness penalty approach , 1993 .

[16]  B. Silverman,et al.  Nonparametric regression and generalized linear models , 1994 .

[17]  Paul Dierckx,et al.  Curve and surface fitting with splines , 1994, Monographs on numerical analysis.

[18]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[19]  Steven G. Gilmour,et al.  The analysis of designed experiments and longitudinal data by using smoothing splines - Discussion , 1999 .

[20]  M. Kenward,et al.  The Analysis of Designed Experiments and Longitudinal Data by Using Smoothing Splines , 1999 .

[21]  C. Loan The ubiquitous Kronecker product , 2000 .

[22]  S. Wood Modelling and smoothing parameter estimation with multiple quadratic penalties , 2000 .

[23]  Matt P. Wand,et al.  Smoothing and mixed models , 2003, Comput. Stat..

[24]  Karl J. Friston,et al.  Variance Components , 2003 .

[25]  Paul H. C. Eilers,et al.  Smoothing and forecasting mortality rates , 2004 .

[26]  B. Ripley,et al.  Semiparametric Regression: Preface , 2003 .

[27]  I. Currie,et al.  The Importance of Year of Birth in Two-Dimensional Mortality Data , 2006 .

[28]  Paul H. C. Eilers,et al.  Fast and compact smoothing on large multidimensional grids , 2006, Comput. Stat. Data Anal..

[29]  Paul H. C. Eilers,et al.  3D space-varying coefficient models with application to diffusion tensor imaging , 2007, Comput. Stat. Data Anal..

[30]  Abel M. Rodrigues Matrix Algebra Useful for Statistics , 2007 .

[31]  Thomas Kneib,et al.  Semiparametric multinomial logit models for analysing consumer choice behaviour , 2007 .