Instrumental Variables Estimation in Cluster Randomized Trials with Noncompliance : A Study of A Biometric Smartcard Payment System in India

Many policy evaluations occur in settings with treatment randomized at the cluster level and there is treatment noncompliance at the unit level within each cluster. For example, villages might be assigned to treatment and control, but residents in each village may choose to not comply with their assigned treatment status. This was the case in Andhra Pradesh, India, where the state government sought to evaluate the use of new biometric smartcards to deliver payments from two anti-poverty programs. Smartcard payments were randomized at the village level, but residents could choose to register for a smartcard or not. In some villages, more than 90% of residents complied with the treatment, while in other locations fewer than 15% of the residents complied. When noncompliance is present, investigators may choose to focus attention on either intention to treat effects or the treatment effect among the units that comply. When analysts focus on the effect among compliers, the instrumental variables framework can be used to evaluate identify and estimate causal effects. While a large literature exists on instrumental variables estimation methods, relatively little work has been focused on settings with clustered treatments. In the paper, we review extant methods for instrumental variable estimation in clustered designs. We then show that these methods depend on assumptions that are often unrealistic in applied settings. In response, we develop an estimation method that relaxes these assumptions. Specifically, our method allows for possible treatment effect heterogeneity that is correlated with cluster size and uses a finite sample variance estimator. We evaluate these methods using a series of simulations and apply them to data from the evaluation of using smartcard payments for anti-poverty programs in India. ∗We thank participants at the Penn Causal Inference Seminar for helpful comments. †Assistant Professor, University of Wisconsin, Madison, Email: hyunseung@stat.wisc.edu ‡Associate Professor, University of Pennsylvania, Email: luke.keele@uphs.upenn.edu ar X iv :1 80 5. 03 74 4v 2 [ st at .M E ] 7 J an 2 01 9

[1]  Jonathan H. Wright,et al.  A Survey of Weak Instruments and Weak Identification in Generalized Method of Moments , 2002 .

[2]  J. Robins,et al.  Instruments for Causal Inference: An Epidemiologist's Dream? , 2006, Epidemiology.

[3]  Gary King,et al.  The essential role of pair matching in cluster-randomized experiments, with application to the Mexican Universal Health Insurance Evaluation , 2009, 0910.3752.

[4]  Jake Bowers,et al.  Attributing Effects to a Cluster-Randomized Get-Out-the-Vote Campaign , 2009 .

[5]  Peter M. Aronow,et al.  Field Experiments and the Study of Voter Turnout , 2013 .

[6]  Jean-Marie Dufour,et al.  Some Impossibility Theorems in Econometrics with Applications to Structural and Dynamic Models , 1997 .

[7]  M. Bils [How Large Are Human-Capital Externalities? Evidence from Compulsory Schooling Laws]: Comment , 2000, NBER Macroeconomics Annual.

[8]  J. L. Hodges,et al.  Estimates of Location Based on Rank Tests , 1963 .

[9]  A. Wald The Fitting of Straight Lines if Both Variables are Subject to Error , 1940 .

[10]  Joel A. Middleton,et al.  Unbiased Estimation of the Average Treatment Effect in Cluster-Randomized Experiments , 2015 .

[11]  Richard J. Hayes,et al.  Cluster randomised trials , 2009 .

[12]  Peter Z. Schochet,et al.  Report : Estimation and Identification of the Complier Average Causal Effect Parameter in Education RCTs , 2009 .

[13]  Dylan S. Small,et al.  Randomization Inference in a Group–Randomized Trial of Treatments for Depression , 2008 .

[14]  E. Lehmann Elements of large-sample theory , 1998 .

[15]  Susan Athey,et al.  When Should You Adjust Standard Errors for Clustering? , 2017, The Quarterly Journal of Economics.

[16]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[17]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[18]  Christopher W. Larimer,et al.  Social Pressure and Voter Turnout: Evidence from a Large-Scale Field Experiment , 2008, American Political Science Review.

[19]  Tihomir Asparouhov,et al.  Cluster randomized trials with treatment noncompliance. , 2008, Psychological methods.

[20]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[21]  Xiao-Hua Zhou,et al.  Clustered encouragement designs with individual noncompliance: bayesian inference with randomization, and application to advance directive forms. , 2002, Biostatistics.

[22]  Peter Z. Schochet Estimators for Clustered Education RCTs Using the Neyman Model for Causal Inference , 2013 .

[23]  James E. Pustejovsky,et al.  Small-Sample Methods for Cluster-Robust Variance Estimation and Hypothesis Testing in Fixed Effects Models , 2016, Journal of Business & Economic Statistics.

[24]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[25]  Deborah Watson-Jones,et al.  HPTN 071 (PopART): Rationale and design of a cluster-randomised trial of the population impact of an HIV combination prevention intervention including universal testing and treatment – a study protocol for a cluster randomised trial , 2014, Trials.

[26]  Stephen Howes,et al.  Small but effective: India's targeted unconditional cash transfers , 2010 .

[27]  J M Robins,et al.  Confidence intervals for causal parameters. , 1988, Statistics in medicine.

[28]  L. Hedges,et al.  Intraclass Correlation Values for Planning Group-Randomized Trials in Education , 2007 .

[29]  P. Ding,et al.  General Forms of Finite Population Central Limit Theorems with Applications to Causal Inference , 2016, 1610.04821.

[30]  Karthik Muralidharan,et al.  Building State Capacity: Evidence from Biometric Smartcards in India , 2016 .

[31]  Tihomir Asparouhov,et al.  Intention‐to‐treat analysis in cluster randomized trials with noncompliance , 2008, Statistics in medicine.

[32]  Elizabeth L. Ogburn,et al.  High HIV prevalence and incidence among MSM across 12 cities in India , 2015, AIDS.

[33]  M. Baiocchi,et al.  Instrumental variable methods for causal inference , 2014, Statistics in medicine.

[34]  Douglas L. Miller,et al.  A Practitioner’s Guide to Cluster-Robust Inference , 2015, The Journal of Human Resources.

[35]  Dylan S Small,et al.  Random effects logistic models for analysing efficacy of a longitudinal randomized treatment with non‐adherence , 2006, Statistics in medicine.

[36]  Luke Keele,et al.  Inference for instrumental variables: a randomization inference approach , 2016, 1606.04146.

[37]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: Sensitivity Analysis and Bounds , 2015 .

[38]  Donald K. K. Lee,et al.  SHARP BOUNDS ON THE VARIANCE IN RANDOMIZED EXPERIMENTS , 2014, 1405.6555.