Information Matrix Splitting

Efficient statistical estimates via the maximum likelihood method requires the observed information, the negative of the Hessian of the underlying log-likelihood function. Computing the observed information is computationally prohibitive for high-throughput biological data, therefore, the expected information matrix---the Fisher information matrix---is often preferred due to its simplicity. In this paper, we prove that the average of the observed and the Fisher information of restricted/residual log-likelihood functions for linear mixed models can be split into two matrices. The expectation of one part is the Fisher information matrix but enjoys a simper formula than the Fisher information matrix. The other part which involves a lot of computations is a random zero matrix and thus is negligible. Leveraging such a splitting can simplify evaluation of the approximate Hessian of a log-likelihood function.

[1]  Ram Zamir,et al.  A Proof of the Fisher Information Inequality via a Data Processing Argument , 1998, IEEE Trans. Inf. Theory.

[2]  A. Heavens,et al.  Generalized Fisher matrices , 2014, 1404.2854.

[3]  N. Longford A fast scoring algorithm for maximum likelihood estimation in unbalanced mixed models with nested random effects , 1987 .

[4]  Shengxin Zhu Computing log-likelihood and its derivatives for restricted maximum likelihood methods , 2016, 1608.07207.

[5]  J. Gillis,et al.  Matrix Iterative Analysis , 1961 .

[6]  M. Vallisneri A User Manual for the Fisher Information Matrix , 2007 .

[7]  Zhiwu Zhang,et al.  Mixed linear model approach adapted for genome-wide association studies , 2010, Nature Genetics.

[8]  Xiaowen Xu,et al.  Information Splitting for Big Data Analytics , 2016, 2016 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC).

[9]  Robert I. Jennrich,et al.  Newton-Raphson and Related Algorithms for Maximum Likelihood Variance Component Estimation , 1976 .

[10]  B. Efron,et al.  Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher information , 1978 .

[11]  Robin Thompson,et al.  Restricted Maximum Likelihood Estimation of Variance Components for Univariate Animal Models Using Sparse Matrix Techniques and Average Information , 1995 .

[12]  Ram Zamir A Necessary and Sufficient Condition for Equality in the Matrix Fisher-Information-Inequality , 1997 .

[14]  Robin Thompson,et al.  Average information REML: An efficient algorithm for variance parameter estimation in linear mixed models , 1995 .

[15]  Information geometry and phase transitions , 2003, cond-mat/0401092.

[16]  R. Punnett,et al.  The Genetical Theory of Natural Selection , 1930, Nature.

[17]  X. R. Wang,et al.  Relating Fisher information to order parameters. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Eleazar Eskin,et al.  Improved linear mixed models for genome-wide association studies , 2012, Nature Methods.

[19]  Ying Liu,et al.  FaST linear mixed models for genome-wide association studies , 2011, Nature Methods.

[20]  M. Stephens,et al.  Genome-wide Efficient Mixed Model Analysis for Association Studies , 2012, Nature Genetics.