A Characterization of Optimal Designs for Observational Studies

SUMMARY An empirical investigation of the effects of a treatment is an observational study if it involves the comparison of treated and control groups that were not formed by randomization. In such studies, treated and control groups may differ systematically with respect to pretreatment measures or covariates, and addressing these pre- treatment differences is a central concern. Matching and subclassification are two standard methods of adjusting for observed pretreatment differences; they may be used alone (e.g. Cochran (1968)) or in conjunction with analytical or model-based adjustments (e.g. Rubin (1973, 1979), Holford (1978), Rosenbaum and Rubin (1984), section 3.3, and Rosenbaum (1987, 1988a)). In particular, Rubin's simulation studies suggest that model-based adjustments applied to matched or subclassified samples are more robust than model-based adjustments applied to unmatched samples. The purpose of this paper is to identify the structure of an optimal subclassification, i.e. one in which the treated and control subjects in the same subclass are, on average, as similar as possible with respect to observed covariates. It turns out that this structure is simple and intuitive. While adjustments may control imbalances in observed covariates, they cannot

[1]  P. Hall On Representatives of Subsets , 1935 .

[2]  W. G. Cochran The effectiveness of adjustment by subclassification in removing bias in observational studies. , 1968, Biometrics.

[3]  G. Chase,et al.  On the efficiency of matched pairs in Bernoulli trials , 1968 .

[4]  D. Rubin Matched Sampling for Causal Effects: The Use of Matched Sampling and Regression Adjustment to Remove Bias in Observational Studies , 1973 .

[5]  Robin Plackett The analysis of categorical data , 1974 .

[6]  J. A. Bondy,et al.  Graph Theory with Applications , 1978 .

[7]  A. Smith,et al.  Analysis of prospective epidemiologic studies by minimum distance case-control matching. , 1977, American journal of epidemiology.

[8]  R. G. Carpenter,et al.  Matching when covariables are normally distributed , 1977 .

[9]  S M McKinlay,et al.  Pair-matching-a reappraisal of a popular technique. , 1977, Biometrics.

[10]  T. Holford The analysis of pair-matched case-control studies, a multivariate approach. , 1978, Biometrics.

[11]  J. A. Bondy,et al.  Graph Theory with Applications , 1978 .

[12]  I. Anderson,et al.  Graphs and Networks , 1981, The Mathematical Gazette.

[13]  D. Rubin,et al.  Using Multivariate Matched Sampling and Regression Adjustment to Control Bias in Observational Studies , 1978 .

[14]  Kenneth Steiglitz,et al.  Combinatorial Optimization: Algorithms and Complexity , 1981 .

[15]  Clifford C. Clogg,et al.  The Analysis of Categorical Data (2nd Ed.). , 1983 .

[16]  Robert E. Tarjan,et al.  Data structures and network algorithms , 1983, CBMS-NSF regional conference series in applied mathematics.

[17]  D. Rubin,et al.  Reducing Bias in Observational Studies Using Subclassification on the Propensity Score , 1984 .

[18]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[19]  P. Rosenbaum Model-Based Direct Adjustment , 1987 .

[20]  P. Rosenbaum Permutation Tests for Matched Pairs with Adjustments for Covariates , 1988 .

[21]  P. Rosenbaum Sensitivity analysis for matching with multiple controls , 1988 .

[22]  P. Rosenbaum The Role of Known Effects in Observational Studies , 1989 .

[23]  Paul R. Rosenbaum,et al.  Optimal Matching for Observational Studies , 1989 .