Beyond Baseline and Follow-Up: The Case for More T in Experiments

The vast majority of randomized experiments in economics rely on a single baseline and single follow-up survey. If multiple follow-ups are conducted, the reason is typically to examine the trajectory of impact effects, so that in effect only one follow-up round is being used to estimate each treatment effect of interest. While such a design is suitable for study of highly autocorrelated and relatively precisely measured outcomes in the health and education domains, this paper makes the case that it is unlikely to be optimal for measuring noisy and relatively less autocorrelated outcomes such as business profits, household incomes and expenditures, and episodic health outcomes. Taking multiple measurements of such outcomes at relatively short intervals allows the researcher to average out noise, increasing power. When the outcomes have low autocorrelation, it can make sense to do no baseline at all. Moreover, the author shows how for such outcomes, more power can be achieved with multiple follow-ups than allocating the same total sample size over a single follow-up and baseline. The analysis highlights the large gains in power from ANCOVA rather than difference-in-differences when autocorrelations are low and a baseline is taken. The paper discusses the issues involved in multiple measurements, and makes recommendations for the design of experiments and related non-experimental impact evaluations.

[1]  F. Vega-Redondo Complex Social Networks: Econometric Society Monographs , 2007 .

[2]  Jonathan Morduch,et al.  Portfolios of the Poor: How the World's Poor Live on $2 a Day , 2009, Ethics & International Affairs.

[3]  Dean S. Karlan,et al.  Teaching Entrepreneurship: Impact of Business Training on Microfinance Clients and Institutions , 2006, Review of Economics and Statistics.

[4]  Dean S. Karlan,et al.  Microcredit in Theory and Practice: Using Randomized Credit Scoring for Impact Evaluation , 2011, Science.

[5]  J. Gibson,et al.  Non-Classical Measurement Error in Long-Term Retrospective Recall Surveys , 2010 .

[6]  Dean Karlan,et al.  The Risk of Asking: Being Surveyed Can Affect Later Behavior , 2011 .

[7]  Jonathan Zinman,et al.  Being surveyed can change later behavior and related parameter estimates , 2011, Proceedings of the National Academy of Sciences.

[8]  D. McKenzie,et al.  The Development Impact of a Best Practice Seasonal Worker Policy: New Zealand’s Recognised Seasonal Employer (RSE) Scheme , 2010 .

[9]  Jikun Huang,et al.  Improving Estimates of Inequality and Poverty from Urban China's Household Income and Expenditure Survey , 2001 .

[10]  R. Townsend,et al.  Households as Corporate Firms: Constructing Financial Statements from Integrated Household Surveys , 2006 .

[11]  M. Bruhn License to Sell: The Effect of Business Registration Reform on Entrepreneurial Activity in Mexico , 2008, The Review of Economics and Statistics.

[12]  Reg,et al.  Keeping it Simple : Financial Literacy and Rules of Thumb , 2011 .

[13]  D. McKenzie,et al.  Does Management Matter? Evidence from India , 2011 .

[14]  D. McKenzie,et al.  Returns to Capital in Microenterprises: Evidence from a Field Experiment , 2007, SSRN Electronic Journal.

[15]  D. McKenzie,et al.  Measuring Microenterprise Profits: Don't Ask How the Sausage is Made , 2007 .

[16]  S. Dercon,et al.  Teacher Shocks and Student Learning: Evidence from Zambia , 2004 .

[17]  A. Vickers How many repeated measures in repeated measures designs? Statistical issues for comparative trials , 2003, BMC medical research methodology.

[18]  Tristan Zajonc,et al.  Do Value-Added Estimates Add Value? Accounting for Learning Dynamics , 2009 .

[19]  Antoinette Schoar,et al.  Keeping it Simple: Financial Literacy and Rules of Thumb , 2010 .

[20]  Christopher Woodruff,et al.  Measuring microenterprise profits: Must we ask how the sausage is made? , 2007 .

[21]  Simon Quinn,et al.  Using PDA consistency checks to increase the precision of profits and sales measurement in panels , 2012 .

[22]  Edward Miguel,et al.  Worms: Identifying Impacts on Education and Health in the Presence of Treatment Externalities, Guide to Replication of Miguel and Kremer (2004) , 2014 .

[23]  Orley Ashenfelter,et al.  Using the Longitudinal Structure of Earnings to Estimate the Effect of Training Programs , 1984 .

[24]  M. Fafchamps,et al.  When is Capital Enough to Get Female Enterprises Growing? Evidence from a Randomized Experiment in Ghana , 2011 .

[25]  S J Pocock,et al.  Repeated measures in clinical trials: analysis using mean summary statistics and its implications for design. , 1992, Statistics in medicine.

[26]  Michael Woolcock,et al.  Toward a plurality of methods in project evaluation: a contextualised approach to understanding impact trajectories and efficacy , 2009 .

[27]  D. Karlan,et al.  Expanding Microenterprise Credit Access: Using Randomized Supply Decisions to Estimate the Impacts in Manila , 2009 .

[28]  A. Banerjee,et al.  The Miracle of Microfinance? Evidence from a Randomized Evaluation , 2013 .

[29]  M. Fafchamps,et al.  When is Capital Enough to Get Female Microenterprises Growing? Evidence from a Randomized Experiment in Ghana , 2011 .

[30]  D. McKenzie Aggregate Shocks and Urban Labor Market Responses: Evidence from Argentina's Financial Crisis , 2004, Economic Development and Cultural Change.

[31]  Miriam Bruhn,et al.  In Pursuit of Balance: Randomization in Practice in Development Field Experiments , 2008 .