Restricted maximum likelihood estimation of covariances in sparse linear models

This paper surveys the theoretical and computational development of the restricted maximum likelihood (REML) approach for the estimation of covariance matrices in linear stochastic models. A new derivation of this approach is given, valid under very weak conditions on the noise. Then the calculation of the gradient of restricted loglikelihood functions is dis- cussed, with special emphasis on the case of large and sparse model equations with a large number of unknown covariance components and possibly incomplete data. It turns out that the gradient calculations require hardly any extra storage, and only a small multiple of the number of operations needed to calculate the function values alone. The analytic gradient procedure was integrated into the VCE package for co- variance component estimation in large animal breeding models. It resulted in dramatic improvements of performance over the previous implementation with finite difference gradients. An example with more than 250 000 normal equations and 55 covariance components took hours instead of days of CPU time, and this was not an untypical case.

[1]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[2]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[3]  H. Hartley,et al.  Computing Maximum Likelihood Estimates for the Mixed A.O.V. Model Using the W Transformation , 1973 .

[4]  W. F. Tinney,et al.  On computing certain elements of the inverse of a sparse matrix , 1975, Commun. ACM.

[5]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[6]  S. R. Searle,et al.  Restricted Maximum Likelihood (REML) Estimation of Variance Components in the Mixed Model , 1976 .

[7]  D. Harville Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems , 1977 .

[8]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[9]  M GayDavid,et al.  Algorithm 611: Subroutines for Unconstrained Minimization Using a Model/Trust-Region Approach , 1983 .

[10]  David M. Gay,et al.  Algorithm 611: Subroutines for Unconstrained Minimization Using a Model/Trust-Region Approach , 1983, TOMS.

[11]  C. R. Henderson Estimation of Variances and Covariances under Multiple Trait Models , 1984 .

[12]  C. R. Henderson Applications of linear models in animal breeding , 1984 .

[13]  Bobby Schnabel,et al.  A modular system of algorithms for unconstrained minimization , 1985, TOMS.

[14]  R. Jennrich,et al.  Unbalanced repeated-measures models with structured covariance matrices. , 1986, Biometrics.

[15]  Robin Thompson,et al.  Estimation of genetic parameters. , 2005 .

[16]  N. Laird,et al.  Maximum likelihood computations with repeated measures: application of the EM algorithm , 1987 .

[17]  William H. Fellner,et al.  Sparse matrices, and the estimation of variance components by likelihood methods , 1987 .

[18]  Bruce Tier,et al.  A Derivative-Free Approach for Estimating Variance Components in Animal Models by Restricted Maximum Likelihood1 , 1987 .

[19]  I. Duff,et al.  Direct Methods for Sparse Matrices , 1987 .

[20]  K. Meyer,et al.  DFREML—A Set of Programs to Estimate Variance Components Under an Individual Animal Model , 1988 .

[21]  K. Meyer,et al.  Restricted maximum likelihood to estimate variance components for animal models with several random effects using a derivative-free algorithm , 1989, Genetics Selection Evolution.

[22]  D. Bates,et al.  Newton-Raphson and EM Algorithms for Linear Mixed-Effects Models for Repeated-Measures Data , 1988 .

[23]  Eildert Groeneveld,et al.  A Generalized Computing Procedure for Setting Up and Solving Mixed Linear Models , 1990 .

[24]  E. Tholen Untersuchung von Ursachen und Auswirkungen heterogener Varianzen der Indexmerkmale in der deutschen Schweineherdbuchzucht , 1990 .

[25]  K. Meyer,et al.  Estimating variances and covariances for multivariate animal models by restricted maximum likelihood , 1991, Genetics Selection Evolution.

[26]  J. L. Gill Biases in balanced experiments with uncontrolled random factors1 , 1991 .

[27]  Barry W. Peyton,et al.  Block sparse Cholesky algorithms on advanced uniprocessor computers , 1991 .

[28]  Milena Kovac Derivative free methods in covariance components estimation , 1992 .

[29]  D. Boichard,et al.  Multivariate restricted maximum likelihood estimation of genetic parameters for growth, carcass and meat quality traits in French Large White and French Landrace pigs , 1993, Genetics Selection Evolution.

[30]  Miguel Pérez-Enciso,et al.  Sparse Matrix Inversion for Restricted Maximum Likelihood Estimation of Variance Components by Expectation-Maximization , 1993 .

[31]  I Misztal,et al.  Comparison of computing properties of derivative and derivative-free algorithms in variance-component estimation by REML. , 1994, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[32]  J. Spilke,et al.  Comparison of four multivariate REML (co)variance component estimation packages , 1994 .

[33]  K. Meyer DERIVATIVE-INTENSE RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION OF COVARIANCE COMPONENTS FOR ANIMAL MODELS , 1994 .

[34]  Russ Wolfinger,et al.  Computing Gaussian Likelihoods and Their Derivatives for General Linear Mixed Models , 1994, SIAM J. Sci. Comput..

[35]  Pontus Matstoms,et al.  Sparse QR factorization in MATLAB , 1994, TOMS.

[36]  Simultaneous estimation of variances and covariances using REML and Henderson 3 in a selected population of White Leghorns. , 1994, British poultry science.

[37]  E. Groeneveld A reparameterization to improve numerical optimization in multivariate REML (co)variance component estimation , 1994, Genetics Selection Evolution.

[38]  N. Wray,et al.  Calculation of prediction error variances using sparse matrix methods. , 1994, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[39]  Christian Bischof,et al.  ADIFOR 2.0 User's Guide , 1995 .

[40]  Paul D. Hovland,et al.  ADIFOR 2.0 user`s guide (Revision B) , 1995 .

[41]  D. Boichard,et al.  Restricted maximum likelihood estimates of genetic parameters of adult male and female Rhode Island red chickens divergently selected for residual feed consumption. , 1995, Poultry science.

[42]  Robin Thompson,et al.  Restricted Maximum Likelihood Estimation of Variance Components for Univariate Animal Models Using Sparse Matrix Techniques and Average Information , 1995 .

[43]  Patrick J. Burns,et al.  Large-Scale Estimation of Variance and Covariance Components , 1995, SIAM J. Sci. Comput..

[44]  W. Brade,et al.  Bestimmung genetischer Populationsparameter für die Einsatzleistung von Milchkühen , 1995 .

[45]  W. Brade,et al.  Rechentechnische Aspekte der multivariaten REML Kovarianzkomponentenschätzung, dargestellt an einem Anwendungsbeispiel aus der Rinderzüchtung , 1996 .

[46]  L. Csató,et al.  Joint genetic evaluation of field and station test in the Hungarian white and landrace populations , 1996 .

[47]  N. Mielenz,et al.  A Monte-Carlo study of (co)variance component estimation ( REML ) for traits with different design matrices , 1996 .

[48]  W. Brade,et al.  Bedeutung der speziellen Kombinationseignung in der Milchrinderzüchtung , 1996 .

[49]  V. Yohai,et al.  Robust estimation of variance components , 1998 .

[50]  L. Dixon,et al.  Automatic differentiation of algorithms , 2000 .

[51]  L. Dixon,et al.  Automatic Di erentiation of Algorithms , 2000 .