A computationally efficient procedure for data envelopment analysis.

This thesis is the final outcome of a project carried out for the UK's Department for Education and Skills (DfES). They were interested in finding a fast algorithm for solving a Data Envelopment Analysis (DEA) model to compare the relative efficiency of 13216 primary schools in England based on 9 input-output factors. The standard approach for solving a DEA model comparing n units (such as primary schools) based on m factors, requires solving 2n linear programming (LP) problems, each with m constraints and at least n variables. At m = 9 and n = 13216, it was proving to be difficult. The research reported in this thesis describes both theoretical and practical contributions to achieving faster computational performance. First we establish that in analysing any unit t only against some critically important units - we call them generators - we can either (a) complete its efficiency analysis, or (b) find a new generator. This is an important contribution to the theory of solution procedures of DEA. It leads to our new Generator Based Algorithm (GBA) which solves only n LPs of maximum size (m x k), where k is the number of generators. As k is a small percentage of n, GBA significantly improves computational performance in large datasets. Further, GBA is capable of solving all the commonly used DEA models including important extensions of the basic models such as weight restricted models. In broad outline, the thesis describes four themes. First, it provides a comprehensive critical review of the extant literature on the computational aspects of DEA. Second, the thesis introduces the new computationally efficient algorithm GBA. It solves the practical problem in 105 seconds. The commercial software used by the DfES, at best, took more than an hour and often took 3 to 5 hours making it impractical for model development work. Third, the thesis presents results of comprehensive computational tests involving GBA, Jose Dula's BuildHull - the best available DEA algorithm in the literature - and the standard approach. Dula's published result showing that BuildHull consistently outperforms the standard approach is confirmed by our experiments. It is also shown that GBA is consistently better than BuildHull and is a viable tool for solving large scale DBA problems. An interesting by-product of this work is a new closed-form solution to the important practical problem of finding strictly positive factor weights without explicit weight restrictions for what are known in the DEA literature as "extreme-efficient units". To date, the only other methods for achieving this require solving additional LPs or a pair of Mixed Integer Linear Programs.

[1]  Russell G. Thompson,et al.  Importance for DEA of zeros in data, multipliers, and solutions , 1993 .

[2]  R. RajivD.BANKE Estimating most productive scale size using data envelopment analysis , 2003 .

[3]  M. Farrell,et al.  Estimating Efficient Production Functions Under Increasing Returns to Scale , 1962 .

[4]  Joe Zhu,et al.  A modified super-efficiency DEA model for infeasibility , 2009, J. Oper. Res. Soc..

[5]  Ali Emrouznejad,et al.  Evaluation of research in efficiency and productivity: A survey and analysis of the first 30 years , 2008 .

[6]  Yih-Long Chang,et al.  Efficient algorithm for additive and multiplicative models in data envelopment analysis , 1989 .

[7]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[8]  Boaz Golany,et al.  Foundations of data envelopment analysis for Pareto-Koopmans efficient empirical production functions , 1985 .

[9]  ROBUST EFFICIENCY MEASUREMENT Dealing with outliers in Data Envelopment Analysis , 1999 .

[10]  Paul Rouse,et al.  Interior point methods in DEA to determine non-zero multiplier weights , 2012, Comput. Oper. Res..

[11]  William W. Cooper,et al.  Choosing weights from alternative optimal solutions of dual multiplier models in DEA , 2007, Eur. J. Oper. Res..

[12]  Kaoru Tone,et al.  Data Envelopment Analysis , 1996 .

[13]  Lawrence M. Seiford,et al.  Stability regions for maintaining efficiency in data envelopment analysis , 1998, Eur. J. Oper. Res..

[14]  S. Afriat Efficiency Estimation of Production Function , 1972 .

[15]  Harold O. Fried,et al.  The Measurement of Productive Efficiency and Productivity Growth , 2008 .

[16]  Yao Chen,et al.  Measuring super-efficiency in DEA in the presence of infeasibility , 2005, Eur. J. Oper. Res..

[17]  Rolf Färe,et al.  Estimation of returns to scale using data envelopment analysis: A comment , 1994 .

[18]  P. Andersen,et al.  A procedure for ranking efficient units in data envelopment analysis , 1993 .

[19]  R. R. Russell,et al.  On the Axiomatic Approach to the Measurement of Technical Efficiency , 1988 .

[20]  A. Charnes,et al.  A structure for classifying and characterizing efficiency and inefficiency in Data Envelopment Analysis , 1991 .

[21]  Rajiv D. Banker,et al.  Estimation of returns to scale using data envelopment analysis , 1992 .

[22]  Robert M. Thrall,et al.  What Is the Economic Meaning of FDH? , 1999 .

[23]  José H. Dulá,et al.  A Computational Framework for Accelerating DEA , 2001 .

[24]  William W. Cooper,et al.  Evaluating Program and Managerial Efficiency: An Application of Data Envelopment Analysis to Program Follow Through , 1981 .

[25]  Dominique Deprins,et al.  Measuring Labor-Efficiency in Post Offices , 2006 .

[26]  Léopold Simar,et al.  Estimating and bootstrapping Malmquist indices , 1999, Eur. J. Oper. Res..

[27]  Joe Zhu,et al.  Sensitivity analysis of DEA models for simultaneous changes in all the data , 1998, J. Oper. Res. Soc..

[28]  R. Shepherd Theory of cost and production functions , 1970 .

[29]  A. Charnes,et al.  Data Envelopment Analysis Theory, Methodology and Applications , 1995 .

[30]  Rajiv D. Banker,et al.  The super-efficiency procedure for outlier identification, not for ranking efficient units , 2006, Eur. J. Oper. Res..

[31]  Ole Bent Olesen,et al.  Indicators of ill-conditioned data sets and model misspecification in data envelopment analysis: an extended facet approach , 1996 .

[32]  Juan Aparicio,et al.  Closest targets and minimum distance to the Pareto-efficient frontier in DEA , 2007 .

[33]  José H. Dulá,et al.  A computational study of DEA with massive data sets , 2008, Comput. Oper. Res..