INFERENCE AND MISSING DATA

Two results are presented concerning inference when data may be missing. First, ignoring the process that causes missing data when making sampling distribution inferences about the parameter of the data, θ, is generally appropriate if and only if the missing data are “missing at random” and the observed data are “observed at random,” and then such inferences are generally conditional on the observed pattern of missing data. Second, ignoring the process that causes missing data when making Bayesian inferences about θ is generally appropriate if and only if the missing data are missing at random and the parameter of the missing data is “independent” of θ. Examples and discussion indicating the implications of these results are included.

[1]  S. S. Wilks Moments and Distributions of Estimates of Population Parameters from Fragmentary Samples , 1932 .

[2]  Oscar Kempthorne The design and analysis of experiments. , 1952 .

[3]  H. O. Hartley,et al.  A Plan for Programming Analysis of Variance for General Purpose Computers , 1956 .

[4]  M. Healy,et al.  Missing Values in Experiments Analysed on Automatic Computers , 1956 .

[5]  T. W. Anderson Maximum Likelihood Estimates for a Multivariate Normal Distribution when Some Observations are Missing , 1957 .

[6]  G. N. Wilkinson Estimation of Missing Values for the Analysis of Incomplete Data , 1958 .

[7]  E. Lehmann Testing Statistical Hypotheses , 1960 .

[8]  R. Bargmann,et al.  MAXIMUM LIKELIHOOD ESTIMATION WITH INCOMPLETE MULTIVARIATE DATA , 1964 .

[9]  R. Elashoff,et al.  Missing Observations in Multivariate Statistics I. Review of the Literature , 1966 .

[10]  R. R. Hocking,et al.  Estimation of Parameters in the Multivariate Normal Distribution with Missing Observations , 1968 .

[11]  R. R. Hocking,et al.  The analysis of incomplete data. , 1971 .

[12]  Donald B. Rubin,et al.  A Non‐Iterative Algorithm for Least Squares Estimation of Missing Values in Any Analysis of Variance Design , 1972 .

[13]  R. R. Hocking,et al.  Optimum Incomplete Multinormal Samples , 1972 .

[14]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[15]  Donald B. Rubin,et al.  Characterizing the Estimation of Parameters in Incomplete-Data Problems , 1974 .

[16]  Donald B. Rubin,et al.  Noniterative Least Squares Estimates, Standard Errors and F‐Tests for Analyses of Variance with Missing Data , 1976 .