Identification of a linear system from inexact data: A three-variable example☆

Abstract In 1901 Pearson formulated the general problem of how to fit a hyperplane in the most efficient way to a system of points in a data space. This problem is still not exactly solved in all its generality. As Kalman and Los have shown, all statistical attemps to solve the problem have failed, because each of them can provide only prejudicial and statistical, but not objective and mathematical solutions. However, exact mathematical solutions do exist for special cases. This paper's main principle of linear identification from inexact data provides the mathematical framework in which the problem and the deficiencies of the statistical solutions are conveniently discussed, in particular those of the least squares regression and statistical common factors schemes. It will be argued that the exact common factors, or Frisch scheme, offers most promise to direct us to complete and exact solutions, even though it imposes severe restrictions on the orders of the systems because of Wilson's inequality. Throughout this paper the problem and its various solution schemes are illustrated by an empirical example consisting of three data variables describing the profitability performance of some large U.S. bank holding companies. For this empirical example the Frisch scheme provides a unique solution, contrary to some earlier pessimistic conclusions.

[1]  R. Frisch Statistical confluence analysis by means of complete regression systems , 1934 .

[2]  L. Thurstone The Vectors of Mind , 1935 .

[3]  C. Los,et al.  How to Determine the Corank and Noise Level of a System , 1988 .

[4]  Cheng Hsiao,et al.  Latent variable models in econometrics , 1984 .

[5]  H. Harman Modern factor analysis , 1961 .

[6]  R A Kerr Pity the Poor Weatherman: Despite satellites, supercomputers, and billions of observations, weather forecasting skill is improving only slowly, often too slowly for the public to notice. , 1985, Science.

[7]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[8]  Olav Reiersol,et al.  Confluence Analysis by Means of Lag Moments and Other Methods of Confluence Analysis , 1941 .

[9]  R. E. Kalman,et al.  The prejudices of least squares, principal components and common factor schemes , 1987 .

[10]  W. Ledermann On the rank of the reduced correlational matrix in multiple-factor analysis , 1937 .

[11]  E. B. Wilson,et al.  The Resolution of Six Tests into Three General Factors. , 1939, Proceedings of the National Academy of Sciences of the United States of America.

[12]  E. Malinvaud,et al.  Statistical Methods of Econometrics. by E. Malinvaud , 1972 .

[13]  R. Pintner,et al.  Crossroads in the Mind of Man: A Study of Differentiable Mental Abilities. , 1929 .

[14]  C. Spearman General intelligence Objectively Determined and Measured , 1904 .

[15]  Zvi Griliches,et al.  ECONOMIC DATA ISSUES , 1986 .

[16]  K. Popper,et al.  Conjectures and Refutations , 1963 .

[17]  R. E. Kalman,et al.  Identification from Real Data , 1982 .

[18]  H. Theil,et al.  Three-Stage Least Squares: Simultaneous Estimation of Simultaneous Equations , 1962 .

[19]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[20]  Z. Griliches ERRORS IN VARIABLES AND OTHER UNOBSERVABLES , 1974 .

[21]  S. S. Wilks,et al.  Linear Regression Analysis of Economic Time Series. , 1938 .

[22]  A. Shapiro Rank-reducibility of a symmetric matrix and sampling theory of minimum trace factor analysis , 1982 .

[23]  T. Haavelmo The Statistical Implications of a System of Simultaneous Equations , 1943 .

[24]  F. Galton I. Family likeness in stature , 1886, Proceedings of the Royal Society of London.

[25]  P. Phillips Proffessor T.W. Anderson , 1986, Econometric Theory.

[26]  Herman Rubin,et al.  Statistical Inference in Factor Analysis , 1956 .

[27]  R. Kálmán Identification of noisy systems , 1985 .

[28]  A. McNair THE HALF-LIFE OF VANADIUM-50 , 1961 .

[29]  S. Mulaik,et al.  Foundations of Factor Analysis , 1975 .

[30]  Edward E. Leamer,et al.  Consistent Sets of Estimates for Regressions with Errors in All Variables , 1984 .