Outcome-free Design of Observational Studies: Peer Influence on Smoking

For estimating causal effects of treatments, randomized experiments are appropriately considered the gold standard, although they are often infeasible for a variety of reasons. Nevertheless, nonrandomized studies can and should be designed to approximate randomized experiments by using only background information to create subgroups of similar treated and control units, where "similar" here refers to their distributions of background variables. This activity should be conducted without access to any outcome data to assure the objectivity of the design. In many situations, these goals can be accomplished using propensity score methods, as illustrated here in the context of a study on whether nonsmoking Harvard freshmen are influenced by their smoking peers. In that study, propensity score methods were used to create matched groups of treated units (rooming with at least one smoker) and control units (rooming with only non-smokers) who are at least as similar with respect to their distributions of observed background characteristics as if they had been randomized, thereby approximating a randomized experiment with respect to the observed covariates.

[1]  D. Rubin,et al.  Reducing Bias in Observational Studies Using Subclassification on the Propensity Score , 1984 .

[2]  John Van Hoewyk,et al.  A multivariate technique for multiply imputing missing values using a sequence of regression models , 2001 .

[3]  D. Rubin,et al.  Combining Propensity Score Matching with Additional Adjustments for Prognostic Covariates , 2000 .

[4]  Henry E. Kyburg,et al.  Studies in Subjective Probability , 1965 .

[5]  D. Rubin 2 Statistical Inference for Causal Effects, With Emphasis on Applications in Epidemiology and Medical Statistics , 2007 .

[6]  W. G. Cochran The Planning of Observational Studies of Human Populations , 1965 .

[7]  Donald B. Rubin,et al.  Formal modes of statistical inference for causal effects , 1990 .

[8]  R. Fisher Statistical Methods for Research Workers , 1971 .

[9]  Donald B. Rubin,et al.  Statistical Issues in the Estimation of the Causal Effects of Smoking Due to the Conduct of the Tobacco Industry , 2000 .

[10]  Sen. M.I.,et al.  Statistical Science in the Courtroom , 2001 .

[11]  D. Rubin Assignment to Treatment Group on the Basis of a Covariate , 1976 .

[12]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[13]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[14]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[15]  A. Roy Some thoughts on the distribution of earnings , 1951 .

[16]  D B Rubin,et al.  Matching using estimated propensity scores: relating theory to practice. , 1996, Biometrics.

[17]  D. Rubin,et al.  Small-sample degrees of freedom with multiple imputation , 1999 .

[18]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[19]  P. Holland Statistics and Causal Inference , 1985 .

[20]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[21]  A. Dawid Conditional Independence in Statistical Theory , 1979 .

[22]  D. Basu Randomization Analysis of Experimental Data: The Fisher Randomization Test , 1980 .

[23]  D B Rubin,et al.  Estimating the causal effects of smoking , 2001, Statistics in medicine.

[24]  Michel Mouchart,et al.  Discussion on "Conditional independence in statistitical theory" by A.P. Dawid , 1979 .

[25]  D. Rubin Using Propensity Scores to Help Design Observational Studies: Application to the Tobacco Litigation , 2001, Health Services and Outcomes Research Methodology.

[26]  D. Rubin The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials , 2007, Statistics in medicine.

[27]  D. Basu,et al.  Randomization Analysis of Experimental Data: The Fisher Randomization Test Rejoinder , 1980 .

[28]  S. Langenskiöld Peer influence on smoking : causation or correlation? , 2005 .

[29]  D. Rubin The ethics of consulting for the tobacco industry , 2002, Statistical methods in medical research.

[30]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[31]  D. Rubin,et al.  ON LORD'S PARADOX , 1982 .

[32]  D. Rubin,et al.  Using Multivariate Matched Sampling and Regression Adjustment to Control Bias in Observational Studies , 1978 .

[33]  Donald B. Rubin,et al.  The Design of a General and Flexible System for Handling Nonresponse in Sample Surveys , 2004 .

[34]  D. Rubin Matched Sampling for Causal Effects: The Use of Matched Sampling and Regression Adjustment to Remove Bias in Observational Studies , 1973 .

[35]  D. Rubin Matched Sampling for Causal Effects: Matching to Remove Bias in Observational Studies , 1973 .

[36]  Donald B. Rubin,et al.  Affinely Invariant Matching Methods with Ellipsoidal Distributions , 1992 .

[37]  B. D. Finetti,et al.  Foresight: Its Logical Laws, Its Subjective Sources , 1992 .

[38]  D. Rubin,et al.  Bayesian inference for causal effects in randomized experiments with noncompliance , 1997 .

[39]  W. G. Cochran,et al.  Controlling Bias in Observational Studies: A Review. , 1974 .

[40]  D. Rubin,et al.  Multiple Imputation for Nonresponse in Surveys , 1989 .

[41]  Donald B. Rubin,et al.  A Potential Outcomes View of Value-Added Assessment in Education , 2004 .

[42]  Andrew Gelman,et al.  Applied Bayesian Modeling And Causal Inference From Incomplete-Data Perspectives , 2005 .

[43]  F. Earls,et al.  Firearm Violence Exposure and Serious Violent Behavior , 2005, Science.

[44]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[45]  T. Haavelmo,et al.  The Probability Approach in Econometrics , 1944 .

[46]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[47]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .