Matrix Differential Calculus Jan R. Magnus and Heinz Neudecker John Wiley and Sons, 1988Linear Structures Jan R. Magnus Charles Griffin and Co., 1988

One of the more onerous tasks of multivariate statistical analysis is that of finding the first-order and second-order partial derivatives of complicated likelihood functions involving vectors or matrices as their arguments. Two decades ago, the usual way of tackling such problems was to resort to scalar differential calculus. Thus, a function expressed compactly in terms of matrices would be expanded in scalar form, differentiated by scalar methods, and then reassembled in the desired matrix form (for examples, see Anderson [1, p. 47] and Rao [19, p. 448]). This resort to scalar methods was the consequence of an hiatus in matrix theory that threatened to impede the development of multivariate statistical analysis. Efforts to develop a theory of matrix differential calculus were heralded by the seminal article of Dwyer and MacPhail [5] and were given further impetus some twenty years later by Dwyer [4]. These authors were faced with the problem differentiating an arbitrary function Y = Y(X), where Y = [yki] and X = [Xjj] are matrices of any order. However, they avoided the difficulties inherent in a fully fledged theory of matrix differential calculus by confining their attention to the component derivatives dykl/dX and dY/dXjj. Although the derivative dY/dxtj proved easier to handle, it appeared that the derivative dykI/dX was the more useful in multivariate analysis. By developing a general theory that linked the two forms of derivative, Dwyer and MacPhail were able to obtain their derivatives in one form and convert them into the other form for practical use. Since the appearance of the article by Dwyer, there has been a rapid growth in the literature on matrix differential calculus and on the related topics of the algebra of Kronecker products and the vectorization operator.

[1]  P. S. Dwyer,et al.  Symbolic Matrix Derivatives , 1948 .

[2]  T. Koopmans Statistical inference in dynamic economic models , 1951 .

[3]  Tjalling C. Koopmans,et al.  Statistical Inference in Dynamic Economic Models , 1951 .

[4]  Tjalling C. Koopmans,et al.  Studies in Econometric Method , 1954 .

[5]  T. W. Anderson,et al.  An Introduction to Multivariate Statistical Analysis , 1959 .

[6]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[7]  Nicolas Bourbaki,et al.  Eléments de Mathématique , 1964 .

[8]  Calyampudi R. Rao,et al.  Linear statistical inference and its applications , 1965 .

[9]  P. S. Dwyer Some Applications of Matrix Derivatives in Multivariate Analysis , 1967 .

[10]  M. Marcus Finite dimensional multilinear algebra , 1973 .

[11]  E. C. Macrae Matrix Derivatives with an Application to an Adaptive Linear Decision Problem , 1974 .

[12]  S. R. Searle,et al.  Vec and vech operators for matrices, with some uses in jacobians and multivariate statistics , 1979 .

[13]  D. G. Nel On matrix differentiation in statistics , 1980 .

[14]  Alexander Graham,et al.  Kronecker Products and Matrix Calculus: With Applications , 1981 .

[15]  S. R. Searle,et al.  The Vec-Permutation Matrix, the Vec Operator and Kronecker Products: A Review , 1981 .

[16]  D.S.G. Pollock,et al.  Tensor products and matrix differential calculus , 1985 .

[17]  J. Magnus,et al.  Matrix Differential Calculus with Applications in Statistics and Econometrics (Revised Edition) , 1999 .