Inference for Cluster Randomized Experiments with Non-ignorable Cluster Sizes

This paper considers the problem of inference in cluster randomized experiments when cluster sizes are non-ignorable. Here, by a cluster randomized experiment, we mean one in which treatment is assigned at the level of the cluster; by non-ignorable cluster sizes we mean that the distribution of potential outcomes, and the treatment effects in particular, may depend non-trivially on the cluster sizes. In order to permit this sort of flexibility, we consider a sampling framework in which cluster sizes themselves are random. In this way, our analysis departs from earlier analyses of cluster randomized experiments in which cluster sizes are treated as non-random. We distinguish between two different parameters of interest: the equally-weighted cluster-level average treatment effect, and the size-weighted cluster-level average treatment effect. For each parameter, we provide methods for inference in an asymptotic framework where the number of clusters tends to infinity and treatment is assigned using a covariate-adaptive stratified randomization procedure. We additionally permit the experimenter to sample only a subset of the units within each cluster rather than the entire cluster and demonstrate the implications of such sampling for some commonly used estimators. A small simulation study and empirical demonstration show the practical relevance of our theoretical results.

[1]  Ji-zhe Liu Inference for Two-stage Experiments under Covariate-Adaptive Randomization , 2023, 2301.09016.

[2]  Yuya Sasaki,et al.  Non-Robustness of the Cluster-Robust Inference: with a Proposal of a New Robust Method , 2022, 2210.16991.

[3]  M. Mohanan,et al.  Different Strokes for Different Folks: Experimental Evidence on the Effectiveness of Input and Output Incentive Contracts for Health Care Providers with Varying Skills , 2019 .

[4]  P. Ding,et al.  Model‐assisted analyses of cluster‐randomized experiments , 2021, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[5]  J. Wooldridge,et al.  Revisiting regression adjustment in experiments with heterogeneous treatment effects , 2020 .

[6]  James G. MacKinnon,et al.  When and How to Deal with Clustered Errors in Regression Models , 2020 .

[7]  S. Raudenbush,et al.  Randomized Experiments in Education, with Implications for Multilevel Causal Inference , 2020 .

[8]  유재철,et al.  Randomization , 2020, Randomization, Bootstrap and Monte Carlo Methods in Biology.

[9]  Peter Z. Schochet,et al.  Design-Based Ratio Estimators and Central Limit Theorems for Clustered, Blocked RCTs , 2020, Journal of the American Statistical Association.

[10]  Eric B. Johnson,et al.  Linkedin(to) Job Opportunities: Experimental Evidence from Job Readiness Training , 2019, SSRN Electronic Journal.

[11]  Anna Popova,et al.  Does Teacher Training Actually Work? Evidence from a Large-Scale Randomized Evaluation of a National Teacher Training Program , 2019, American Economic Journal: Applied Economics.

[12]  J. Svensson,et al.  Reducing Child Mortality in the Last Mile: Experimental Evidence on Community Health Promoters in Uganda , 2019, American Economic Journal: Applied Economics.

[13]  C. de Chaisemartin,et al.  At What Level Should One Cluster Standard Errors in Paired Experiments, and in Stratified Experiments with Small Strata? , 2019, SSRN Electronic Journal.

[14]  Joseph P. Romano,et al.  Inference in Experiments With Matched Pairs , 2019, Journal of the American Statistical Association.

[15]  K. Muralidharan,et al.  Improving Last-Mile Service Delivery Using Phone-Based Monitoring , 2018, American Economic Journal: Applied Economics.

[16]  J. Lafortune,et al.  Role Models or Individual Consulting: The Impact of Personalizing Micro-entrepreneurship Training , 2018, American Economic Journal: Applied Economics.

[17]  Craig McIntosh,et al.  The Neighborhood Impacts of Local Infrastructure Investment: Evidence from Urban Mexico , 2018, American Economic Journal: Applied Economics.

[18]  Azeem M. Shaikh,et al.  Inference under Covariate-Adaptive Randomization with Multiple Treatments , 2018, Quantitative Economics.

[19]  B. Hansen,et al.  Asymptotic Theory for Clustered Samples , 2017, Journal of Econometrics.

[20]  M. Sulaiman,et al.  Leader Selection and Service Delivery in Community Groups: Experimental Evidence from Uganda , 2017, American Economic Journal: Applied Economics.

[21]  Susan Athey,et al.  When Should You Adjust Standard Errors for Clustering? , 2017, The Quarterly Journal of Economics.

[22]  Markus Goldstein,et al.  Women's Empowerment in Action: Evidence from a Randomized Control Trial in Africa , 2017, American Economic Journal: Applied Economics.

[23]  Fan Li,et al.  Review of Recent Methodological Developments in Group-Randomized Trials: Part 1-Design. , 2017, American journal of public health.

[24]  Thomas Giebe,et al.  Growing Markets Through Business Training for Female Entrepreneurs: A Market-Level Randomized Experiment in Kenya , 2017, American Economic Journal: Applied Economics.

[25]  Luke W. Miratrix,et al.  Bridging Finite and Super Population Causal Inference , 2017, 1702.08615.

[26]  A. Banerjee,et al.  E-Governance, Accountability, and Leakage in Public Programs: Experimental Evidence from a Financial Management Reform in India , 2016, American Economic Journal: Applied Economics.

[27]  Susan Athey,et al.  The Econometrics of Randomized Experiments , 2016, 1607.00698.

[28]  Bas van der Klaauw,et al.  Job-Search Periods for Welfare Applicants: Evidence from a Randomized Experiment , 2016, American Economic Journal: Applied Economics.

[29]  Azeem M. Shaikh,et al.  Inference Under Covariate-Adaptive Randomization , 2015, Journal of the American Statistical Association.

[30]  P. Gertler,et al.  Approximate Expected Utility Rationalization , 2015, American Economic Journal: Applied Economics.

[31]  Douglas L. Miller,et al.  A Practitioner’s Guide to Cluster-Robust Inference , 2015, The Journal of Human Resources.

[32]  Orazio Attanasio,et al.  The Impacts of Microfinance: Evidence from Joint-Liability Lending in Mongolia , 2015 .

[33]  Esther Duflo,et al.  Education, HIV, and Early Fertility: Experimental Evidence from Kenya , 2014, The American economic review.

[34]  Matías Busso,et al.  The Causal Effect of Competition on Prices and Quality: Evidence from a Field Experiment , 2014, American Economic Journal: Applied Economics.

[35]  Jonathan Zinman,et al.  Microcredit Impacts: Evidence from a Randomized Microcredit Program Placement Experiment by Compartamos Banco , 2014 .

[36]  Peter Z. Schochet Estimators for Clustered Education RCTs Using the Neyman Model for Causal Inference , 2013 .

[37]  A. Banerjee,et al.  The Miracle of Microfinance? Evidence from a Randomized Evaluation , 2013 .

[38]  Ghazala Mansuri,et al.  Together We Will: Experimental Evidence on Female Voting Behavior in Pakistan , 2011 .

[39]  P. Aronow,et al.  Unbiased Estimation of the Average Treatment Effect in Cluster-Randomized Experiments , 2011 .

[40]  Miriam Bruhn,et al.  In Pursuit of Balance: Randomization in Practice in Development Field Experiments , 2008 .

[41]  C. Hansen Asymptotic properties of a robust variance matrix estimator for panel data when T is large , 2007 .

[42]  J. N. K. Rao,et al.  Mean estimating equation approach to analysing cluster-correlated data with nonignorable cluster sizes , 2005 .

[43]  J. Matthews,et al.  Randomization in Clinical Trials: Theory and Practice; , 2003 .

[44]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[45]  Robert D. Tortora,et al.  Sampling: Design and Analysis , 2000 .

[46]  S. Raudenbush Statistical analysis and optimal design for cluster randomized trials , 1997 .

[47]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[48]  Esther Duflo,et al.  Estimating the Impact of Microcredit on Those Who Take it Up: Evidence from a Randomized Experiment in Morocco , 2014 .

[49]  B. Giraudeau,et al.  [Cluster randomised trials]. , 2014, Annales de dermatologie et de venereologie.

[50]  M. Kremer,et al.  USING RANDOMIZATION IN DEVELOPMENT ECONOMICS RESEARCH: A TOOLKIT , 2008 .

[51]  B. Longest Sampling techniques. , 1971, Hospitals.