Subjective Bayesian analysis for surveys with missing data

Almost every survey has missing data, sometimes because of inadequacy of the sampling frame, sometimes because of the unwillingness of persons in the frame to participate, sometimes because of the inability of the surveyors to find everyone in the frame, and usually because of an unknown mixture of all these reasons and others. The analyses of surveys often ignore the missing data, treating it as a complete sample of those who responded. However, to do this is often to assume away the principal source of uncertainty, rendering statements of uncertainty conditioned on that assumption problematic. One solution is the assumption of 'ignorability', as discussed by Don Rubin. This approach essentially states in conditional probability terms what one must assume, to get away with an analysis that does not take into account the missing data. The purpose of this paper is to explore a second approach, in which differing beliefs about what the missing data would have been had they been collected are explored to see how robust the results of the survey are. What is reasonable to assume depends on subjective judgements of the analyst. These ideas are considered in the context of a survey on juror death penalty attitudes and behavior. The comparison of subjectivist Bayesian and objectivist frequentistic methods applied to practical problems is very important to the advancement of both methodological positions. This paper is devoted to the consideration of a type of data in which one would have thought that the frequentists would be at their best. The frequency argument is based on viewing a given instance or sample as a member of an infinite stream of independent and identically distributed such instances, and associating the probability of the instance with the relative frequency in the infinite stream. Of course, there are data situations that are very awkward from this stance. Consider, for example, the probability that it will rain in Nottingham sometime tomorrow. Suppose that many years of past data are available. With what subset of this data should I associate tomorrow's rain? Should I choose only those in which the wind, rain and temperature conditions in Dublin match those of today? In making such choices the subjectivity of the associated 'infinite' stream is apparent, and is an embarrassment to a frequentist seeking objectivity. Thus, to explore the possible usefulness of frequentistic ideas, one must choose an example more carefully, in the hope of finding one more congenial to their approach. Surely sampling from a fixed population comes to mind as a candidate. There are two possible senses of sampling that then come to mind: that the individuals in the sample are a random sample from a frame of such individuals, and that the sample itself is random from a frame of subsets, perhaps subsets of the same size. Since the latter results in a sample of size one, it seems very weak in a frequentistic sense. Consequently, the first interpretation is concentrated on here. In a practical implementation of sampling from a human population, not everyone responds. People move, are busy, do not bother and find the questions objectionable. In a reasonably good survey, perhaps 70% of the chosen sample responds. This practical fact imposes a heavy burden on the analysis of the survey data. One response that might be made is to take the sample as representative only of those who would have responded had they been asked. While this has the attraction of objectivistic frequentist ideological purity, it has costs. One is that the sampling frame is of unknown

[1]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[2]  L. J. Savage The Foundations of Statistical Inference. , 1963 .

[3]  P. Ellsworth,et al.  The effects of death qualification on jurors' predisposition to convict and on the quality of deliberation , 1984 .

[4]  S. Gross Determining the neutrality of death-qualified juries , 1984 .

[5]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[6]  H. Jeffreys,et al.  The Theory of Probability , 1896 .

[7]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[8]  Edward E. Leamer,et al.  Specification Searches: Ad Hoc Inference with Nonexperimental Data , 1980 .

[9]  Harry Kalven,et al.  The American Jury , 1967 .

[10]  Joseph B. Kadane,et al.  Juries Hearing Death Penalty Cases: Statistical Analysis of a Legal Procedure , 1983 .

[11]  P. Holland,et al.  Discrete Multivariate Analysis. , 1976 .

[12]  W. H. Charles,et al.  The Limits of the Criminal Sanction , 1968 .

[13]  G. Moran,et al.  Neither "tentative" nor "fragmentary": Verdict preference of impaneled felony jurors as a function of attitude toward capital punishment. , 1986 .

[14]  D. Poirier Frequentist and Subjectivist Perspectives on the Problems of Model Building in Economics , 1988 .

[15]  James M. Dickey,et al.  Scientific Reporting and Personal Probabilities: Student's Hypothesis , 1973 .

[16]  George L. Jurow New Data on the Effect of a "Death Qualified" Jury on the Guilt Determination Process , 1971 .

[17]  Robert Fitzgerald,et al.  Due process vs. crime control , 1984 .

[18]  K. Middendorf,et al.  Death penalty beliefs and jurors' responses to aggravating and mitigating circumstances in capital trials , 1988 .