Instruments, Randomization, and Learning about Development

There is currently much debate about the effectiveness of foreign aid and about what kind of projects can engender economic development. There is skepticism about the ability of econometric analysis to resolve these issues or of development agencies to learn from their own experience. In response, there is increasing use in development economics of randomized controlled trials (RCTs) to accumulate credible knowledge of what works, without overreliance on questionable theory or statistical methods. When RCTs are not possible, the proponents of these methods advocate quasi-randomization through instrumental variable (IV) techniques or natural experiments. I argue that many of these applications are unlikely to recover quantities that are useful for policy or understanding: two key issues are the misunderstanding of exogeneity and the handling of heterogeneity. I illustrate from the literature on aid and growth. Actual randomization faces similar problems as does quasi-randomization, notwithstanding rhetoric to the contrary. I argue that experiments have no special ability to produce more credible knowledge than other methods, and that actual experiments are frequently subject to practical problems that undermine any claims to statistical or epistemic superiority. I illustrate using prominent experiments in development and elsewhere. As with IV methods, RCT-based evaluation of projects, without guidance from an understanding of underlying mechanisms, is unlikely to lead to scientific progress in the understanding of economic development. I welcome recent trends in development experimentation away from the evaluation of projects and toward the evaluation of theoretical mechanisms. (JEL C21, F35, O19)

[1]  I NICOLETTI,et al.  The Planning of Experiments , 1936, Rivista di clinica pediatrica.

[2]  D. McKenzie How Can We Learn Whether Firm Policies are Working in Africa? Challenges (and Solutions?) for Experiments and Structural Models , 2011 .

[3]  Edgar K. Browning Incentive and Disincentive Experimentation for Income Maintenance Policy Purposes: Note , 1971 .

[4]  R. Bailey Dissent on Development: Studies and debates in development economics , 1972 .

[5]  T. C. Edens,et al.  Economic Growth , 1957, The Journal of Economic History.

[6]  William Diebold,et al.  Equality, the Third World and Economic Delusion , 1982, American Political Science Review.

[7]  P. T. Bauer,et al.  Equality, the Third World and Economic Delusion , 1982 .

[8]  Edward E. Leamer,et al.  Vector autoregressions for causal inference , 1985 .

[9]  D. Weil,et al.  A Contribution to the Empirics of Economic Growth Author ( s ) : , 2008 .

[10]  Joshua D. Angrist,et al.  Lifetime Earnings and the Vietnam Era Draft Lottery: Evidence from Social Security Administrative Records , 1990 .

[11]  H. James VARIETIES OF SELECTION BIAS , 1990 .

[12]  James J. Heckman,et al.  Randomization and Social Policy Evaluation , 1991 .

[13]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1995 .

[14]  R Peto,et al.  Large-scale randomized evidence: large, simple trials and overviews of trials. , 1993, Annals of the New York Academy of Sciences.

[15]  C. Hoxby,et al.  Appendices to : “ Does Competition among Public Schools Benefit Students and Taxpayers ? , 2004 .

[16]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1994 .

[17]  R. Barro,et al.  Inflation and Economic Growth , 1995 .

[18]  Peter Boone,et al.  Politics and the Effectiveness of Foreign Aid , 1995 .

[19]  James J. Heckman,et al.  Assessing the Case for Social Experiments , 1995 .

[20]  R W Makuch,et al.  Can treatment that is helpful on average be harmful to some patients? A study of the conflicting information needs of clinical inquiry and drug regulation. , 1996, Journal of clinical epidemiology.

[21]  R. Barro Determinants of Economic Growth: A Cross-Country Empirical Study , 1996 .

[22]  Charles F. Manski,et al.  Learning about Treatment Effects from Experiments with Random Assignment of Treatments , 1996 .

[23]  S Senn,et al.  On wisdom after the event. , 1997, Journal of clinical epidemiology.

[24]  J. Heckman Instrumental Variables: A Study of Implicit Behavioral Assumptions Used in Making Program Evaluations. , 1997 .

[25]  D. Dollar,et al.  Aid, Policies, and Growth , 1997 .

[26]  R W Makuch,et al.  On reaching the tunnel at the end of the light. , 1997, Journal of clinical epidemiology.

[27]  J. Angrist,et al.  Using Maimonides&Apos; Rule to Estimate the Effect of Class Size on Student Achievement , 1997 .

[28]  D G Altman,et al.  Within trial variation--a false trail? , 1998, Journal of clinical epidemiology.

[29]  M. Egger,et al.  Incommunicable knowledge? Interpreting and applying the results of clinical trials and meta-analyses. , 1998, Journal of clinical epidemiology.

[30]  E S Fisher,et al.  Variation in carotid endarterectomy mortality in the Medicare population: trial hospitals, volume, and patient characteristics. , 1998, JAMA.

[31]  David Card The Causal Effect of Education on Learning , 1999 .

[32]  Robert Lensink,et al.  Are There Negative Returns to Aid? , 2001, Changing the Conditions for Development Aid.

[33]  Henrik Hansen,et al.  Aid effectiveness disputed , 2000 .

[34]  J J Heckman,et al.  Local instrumental variables and latent variable models for identifying and bounding treatment effects. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[35]  James A. Robinson,et al.  The Colonial Origins of Comparative Development: An Empirical Investigation , 2000 .

[36]  Henrik Hansen,et al.  Aid and Growth Regressions , 2001 .

[37]  J. Concato,et al.  Randomized, controlled trials, observational studies, and the hierarchy of research designs. , 2000, The New England journal of medicine.

[38]  Henrik Hansen,et al.  On Aid, Growth and Good Policies , 2001, Changing the Conditions for Development Aid.

[39]  Lisa Chauvet,et al.  Aid and Performance: A Reassessment , 2001, Changing the Conditions for Development Aid.

[40]  Economic Policy, Distribution and Poverty: The Nature of Disagreements , 2001 .

[41]  R. Kanbur Economic Policy, Distribution and Poverty: The Nature of Disagreements , 2001 .

[42]  Shah Ebrahim,et al.  Data dredging , bias , or confounding They can all get you into the BMJ and the Friday papers , 2002 .

[43]  Jeffrey M. Woodbridge Econometric Analysis of Cross Section and Panel Data , 2002 .

[44]  Henrik Hansen,et al.  On the Empirics of Foreign Aid and Growth , 2004 .

[45]  S. Ebrahim,et al.  Data dredging, bias, or confounding , 2002, BMJ : British Medical Journal.

[46]  C. Wolfe The Creation of the Modern World: The Untold Story of the British Enlightenment , 2003 .

[47]  J. Moake,et al.  This article has been cited by other articles , 2003 .

[48]  F. Wolak,et al.  Structural Econometric Modeling: Rationales and Examples from Industrial Organization , 2004 .

[49]  J. Dwyer One world: the ethics of globalization , 2003 .

[50]  Oshua,et al.  USING MAIMONIDES’ RULE TO ESTIMATE THE EFFECT OF CLASS SIZE ON SCHOLASTIC ACHIEVEMENT* , 2003 .

[51]  Esther Duflo,et al.  Scaling Up and Evaluation , 2003 .

[52]  Esther Duflo,et al.  WOMEN AS POLICY MAKERS: EVIDENCE FROM A RANDOMIZED POLICY EXPERIMENT IN INDIA , 2004 .

[53]  David Roodman,et al.  Aid, Policies, and Growth: Comment , 2004 .

[54]  Edward Miguel,et al.  Worms: Identifying Impacts on Education and Health in the Presence of Treatment Externalities, Guide to Replication of Miguel and Kremer (2004) , 2014 .

[55]  Fadhel Kaboub Realistic Evaluation , 2004 .

[56]  The World Bank is finally embracing science , 2004, The Lancet.

[57]  Michael A. Clemens,et al.  Counting Chickens When They Hatch: The Short-Term Effect of Aid on Growth , 2004 .

[58]  D. Roodman,et al.  The Anarchy of Numbers: Aid, Development, and Cross-Country Empirics , 2003 .

[59]  Eric Zitzewitz,et al.  Retrospective vs. Prospective Analyses of School Inputs: The Case of Flip Charts in Kenya , 2004 .

[60]  E. Miguel,et al.  Economic Shocks and Civil Conflict: An Instrumental Variables Approach , 2004, Journal of Political Economy.

[61]  S. Sloman Causal Models: How People Think about the World and Its Alternatives , 2005 .

[62]  R. Murnane,et al.  Improving the Performance of the Education Sector: The Valuable, Challenging, and Limited Role of Random Assignment Evaluations , 2005 .

[63]  R. Cooper,et al.  The end of poverty: economic possibilities for our time. , 2008, European journal of dental education : official journal of the Association for Dental Education in Europe.

[64]  C. Meghir,et al.  Education Choices in Mexico: Using a Structural Model and a Randomized Experiment to evaluate Progresa.∗ , 2005 .

[65]  Stepan Jurajda,et al.  Admission to Selective Schools, Alphabetically , 2005 .

[66]  Thomas Pogge,et al.  World Poverty and Human Rights , 2005, Ethics & International Affairs.

[67]  E. Duflo,et al.  Dams , 2005 .

[68]  David A. Freedman,et al.  Statistical Models: Theory and Practice: References , 2005 .

[69]  Esther Duflo,et al.  Monitoring Works : Getting Teachers to Come to School ∗ , 2007 .

[70]  Robert J. Barro,et al.  Religion and Economy , 2006 .

[71]  Petra E. Todd,et al.  Assessing the Impact of a School Subsidy Program in Mexico: Using a Social Experiment to Validate a Dynamic Behavioral Model of Child Schooling and Fertility. , 2006, The American economic review.

[72]  Jeffrey R. Kling,et al.  Neighborhoods and Academic Achievement: Results from the Moving to Opportunity Experiment. NBER Working Paper No. 11909. , 2006 .

[73]  Santiago Levy,et al.  Progress Against Poverty: Sustaining Mexico's Progresa-Oportunidades Program , 2006 .

[74]  W. Easterly,et al.  The White Man's Burden: Why the West's Efforts to Aid the Rest Have Done So Much Ill and So Little Good , 2006 .

[75]  James J Heckman,et al.  Understanding Instrumental Variables in Models with Essential Heterogeneity , 2006, The Review of Economics and Statistics.

[76]  Justin McCrary,et al.  Manipulation of the Running Variable in the Regression Discontinuity Design: A Density Test , 2007 .

[77]  Nancy Cartwright,et al.  Are RCTs the Gold Standard? , 2007 .

[78]  D. Karlan,et al.  Credit Elasticities in Less-Developed Economies: Implications for Microfinance , 2007 .

[79]  James J. Heckman,et al.  Econometric Evaluation of Social Programs, Part II: Using the Marginal Treatment Effect to Organize Alternative Econometric Estimators to Evaluate Social Programs, and to Forecast their Effects in New Environments , 2007 .

[80]  D. McKenzie,et al.  Returns to Capital in Microenterprises: Evidence from a Field Experiment , 2007, SSRN Electronic Journal.

[81]  G. V. D. Berg,et al.  An Economic Analysis of Exclusion Restrictions for Instrumental Variable Estimation , 2007 .

[82]  J. Worrall Evidence in Medicine and Evidence‐Based Medicine , 2007 .

[83]  A. Banerjee,et al.  Making Aid Work , 2007 .

[84]  N. Cartwright,et al.  Hunting Causes and Using Them: Approaches in Philosophy and Economics , 2007 .

[85]  J. Sachs,et al.  THE END OF POVERTY: Economic Possibilities for Our Time , 2005 .

[86]  Dani Rodrik,et al.  The New Development Economics: We Shall Experiment, but How Shall We Learn? , 2008 .

[87]  Arvind Subramanian,et al.  Aid and Growth: What Does the Cross-Country Evidence Really Show? , 2005, The Review of Economics and Statistics.

[88]  Reinventing Foreign Reinventing Foreign Aid , 2008 .

[89]  Robert J. Sampson,et al.  Moving to Inequality: Neighborhood Effects and Experiments Meet Social Structure1 , 2008, American Journal of Sociology.

[90]  Steven D. Levitt,et al.  FIELD EXPERIMENTS IN ECONOMICS : THE PAST , THE PRESENT , AND THE FUTURE , 2008 .

[91]  David A. Freedman,et al.  On regression adjustments to experimental data , 2008, Adv. Appl. Math..

[92]  Erik Weber,et al.  Counterfactuals and causal inference: methods and principles for social research , 2008 .

[93]  W. Easterly,et al.  Can the West Save Africa? , 2008 .

[94]  M. Urquiola,et al.  Class-Size Caps, Sorting, and the Regression-Discontinuity Design , 2009 .

[95]  Jonathan Zinman,et al.  Put Your Money Where Your Butt is: A Commitment Contract for Smoking Cessation , 2009 .

[96]  P. Calkins,et al.  Common Wealth: Economics for a Crowded Planet , 2009 .

[97]  Monica Costa Dias,et al.  Alternative approaches to evaluation in empirical microeconomics , 2002, The Journal of Human Resources.

[98]  J. Morduch,et al.  The Impact of Microcredit on the Poor in Bangladesh: Revisiting the Evidence , 2009 .

[99]  Jonathan Robinson,et al.  Nudging Farmers to Use Fertilizer: Theory and Experimental Evidence from Kenya , 2009 .

[100]  David S. Lee,et al.  Regression Discontinuity Designs in Economics , 2009 .

[101]  Nancy Cartwright What are randomised controlled trials good for? , 2009 .

[102]  G. Imbens,et al.  Better Late than Nothing: Some Comments on Deaton (2009) and Heckman and Urzua (2009) , 2009 .

[103]  Sendhil Mullainathan,et al.  What's Advertising Content Worth? Evidence from a Consumer Credit Marketing Field Experiment , 2009 .

[104]  Michael A. Clemens,et al.  When does rigorous impact evaluation make a difference? The case of the Millennium Villages , 2010 .

[105]  C. Barrett,et al.  The Power and Pitfalls of Experiments in Development Economics: Some Non‐random Reflections , 2010 .

[106]  James J Heckman,et al.  Comparing IV with Structural Models: What Simple IV Can and Cannot Identify , 2009, Journal of econometrics.

[107]  P. Todd,et al.  Structural Estimation and Policy Evaluation in Developing Countries , 2009 .

[108]  Joel Mokyr,et al.  The Enlightened Economy: An Economic History of Britain 1700-1850 , 2010 .

[109]  Joshua D. Angrist,et al.  The Credibility Revolution in Empirical Economics: How Better Research Design is Taking the Con Out of Econometrics , 2010, SSRN Electronic Journal.

[110]  J. Shogren,et al.  The Experimental Mindset within Development Economics: Proper Use and Handling Are Everything , 2010 .

[111]  C. Sims But Economics Is Not an Experimental Science , 2010 .

[112]  Angus Deaton,et al.  Understanding the Mechanisms of Economic Development , 2010 .

[113]  Erik Snowberg,et al.  Selective Trials: A Principal-Agent Approach to Randomized Controlled Experiments , 2010 .

[114]  Thomas Barnebeck Andersen,et al.  Does the Internet Reduce Corruption? Evidence from U.S. States and across Countries , 2011 .

[115]  Sebastian Vollmer,et al.  Testing for heterogeneous treatment effects in experimental data: false discovery risks and correction procedures , 2014 .

[116]  Matin Qaim,et al.  Yield Effects of Tissue Culture Bananas in Kenya: Accounting for Selection Bias and the Role of Complementary Inputs , 2012 .

[117]  Thuan Thai,et al.  Child Schooling, Child Health, and Rainfall Shocks: Evidence from Rural Vietnam , 2011 .

[118]  C. Udry Esther Duflo: 2010 John Bates Clark Medalist , 2011 .

[119]  Alessandro Tarozzi,et al.  Microcredit, Family Planning Programs, and Contraceptive Behavior: Evidence From a Field Experiment in Ethiopia , 2011, Demography.

[120]  Glenn W. Harrison,et al.  Experimental methods and the welfare evaluation of policy lotteries , 2011 .

[121]  F. Khan,et al.  Admissible Evidence in the Court of Development Evaluation? The Impact of CARE’s SHOUHARDO Project on Child Stunting in Bangladesh , 2011 .

[122]  Alexander Pfaff,et al.  Demonstrating bias and improved inference for stoves' health benefits. , 2011, International journal of epidemiology.

[123]  G. Harrison Randomisation and Its Discontents , 2011 .

[124]  Morten Jerven A Clash of Disciplines? Economists and Historians Approaching the African Past , 2011 .

[125]  Maren Duvendack,et al.  High Noon for Microfinance Impact Evaluations: Re-investigating the Evidence from Bangladesh , 2011 .

[126]  Jeffrey R. Kling,et al.  Mechanism Experiments and Policy Evaluations , 2011 .

[127]  Cristian Pop-Eleches,et al.  Going to a Better School: Effects and Behavioral Responses , 2011 .

[128]  S. W. Omamo,et al.  Social protection 2.0: Exploring issues, evidence and debates in a globalizing world , 2011 .

[129]  François Claveau,et al.  Evidential variety as a source of credibility for causal inference: beyond sharp designs and structural models , 2011 .

[130]  M. Eisner,et al.  The Effectiveness of Two Universal Preventive Interventions in Reducing Children's Externalizing Behavior: A Cluster Randomized Controlled Trial , 2011, Journal of clinical child and adolescent psychology : the official journal for the Society of Clinical Child and Adolescent Psychology, American Psychological Association, Division 53.

[131]  What Can Be Learned About the Competitive Effects of Mergers from “Natural Experiments”? , 2011 .

[132]  A. Mody,et al.  Growth from International Capital Flows: The Role of Volatility Regimes , 2011, SSRN Electronic Journal.

[133]  R. Naylor,et al.  Expanding the boundaries of agricultural development , 2011, Food Security.

[134]  Kosuke Imai,et al.  Unpacking the Black Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies , 2011, American Political Science Review.

[135]  A. Dillon Do differences in the scale of irrigation projects generate different impacts on poverty and production , 2011 .

[136]  O. Johansson-Stenman,et al.  Does Environmental Economics Produce Aeroplanes Without Engines? On the Need for an Environmental Social Science , 2011 .

[137]  G. Fields Labor market analysis for developing countries , 2011 .

[138]  T. Coelli,et al.  Assessing the Welfare Effects of Microfinance in Vietnam: Empirical Results from a Quasi-Experimental Survey , 2012 .

[139]  Maren Duvendack,et al.  Assessing ‘what works’ in international development: meta-analysis for sophisticated dummies , 2012 .

[140]  Lars Ivar Oppedal Berge,et al.  Business Training in Tanzania: From Research-driven Experiment to Local Implementation , 2012 .

[141]  Nancy Cartwright Presidential Address: Will This Policy Work for You? Predicting Effectiveness Better: How Philosophy Helps , 2012, Philosophy of Science.

[142]  Michael Hout,et al.  Social and Economic Returns to College Education in the United States , 2012 .

[143]  J. S. Silva,et al.  A cautionary note on tests of overidentifying restrictions , 2012 .

[144]  Kate Barker,et al.  Setting Priorities, Targeting Subsidies among Water, Sanitation, and Preventive Health Interventions in Developing Countries , 2012 .

[145]  A. Ashta,et al.  Developing microfinance: A survey of the literature† , 2012 .

[146]  D. Burde Assessing Impact and Bridging Methodological Divides: Randomized Trials in Countries Affected by Conflict , 2012, Comparative Education Review.

[147]  Paul J. Ferraro,et al.  Evaluation of biodiversity policy instruments: what works and what doesn't? , 2012 .

[148]  A. Briant,et al.  Can Tax Breaks Beat Geography? Lessons from the French Enterprise Zone Experience , 2012 .

[149]  Chad R. Lykins Why “What Works” Still Doesn't Work: How to Improve Research Syntheses at the What Works Clearinghouse , 2012 .

[150]  T. Halliday,et al.  Health Status and the Allocation of Time , 2012, Health economics.

[151]  G. Mwabu,et al.  Health Effects of Socioeconomic Status: Methods and Findings.. , 2012 .

[152]  Jeremy Foltz,et al.  Working Paper 145 - Assessing the Returns to Education in the Gambia , 2012 .

[153]  Donald B. Rubin,et al.  Evaluating the Effect of Training on Wages in the Presence of Noncompliance, Nonemployment, and Missing Outcome Data , 2012 .

[154]  Robert D. Woodberry The Missionary Roots of Liberal Democracy , 2012, American Political Science Review.