Stepwise regression data envelopment analysis for variable reduction

In this paper, we develop stepwise regression data envelopment model to select important variables. We formulate null hypothesis to understand the importance of each variable and use Kruskal-Wallis test for this purpose. If the Kruskal-Wallis test does not reject the null hypothesis then we can conclude that all the variables are of equal importance as their presence and on the other hand absence of other variable does not create huge fluctuations in efficiency scores in fact give a complete ranking relative to base model. If the Kruskal-Wallis test does reject the null hypothesis this will imply there is significant fluctuation in the efficiency score relative to base model. And therefore we have to further check the pair of variables that causes the fluctuation in order to determine its importance using Conover-Inman test. The results of the proposed models are compared with the results of previously published models of the same dataset. The proposed models helps understand the extent of misclassification decision making units as efficient/inefficient when variables are retained or discarded alongside provides useful managerial prescription to make improvement strategies.

[1]  Jesús T. Pastor,et al.  A Statistical Test for Nested Radial Dea Models , 2002, Oper. Res..

[2]  M. F. Fuller,et al.  Practical Nonparametric Statistics; Nonparametric Statistical Inference , 1973 .

[3]  T. Ueda,et al.  APPLICATION OF PRINCIPAL COMPONENT ANALYSIS FOR PARSIMONIOUS SUMMARIZATION OF DEA INPUTS AND/OR OUTPUTS , 1997 .

[4]  Joe Zhu,et al.  Data envelopment analysis vs. principal component analysis: An illustrative study of economic performance of Chinese cities , 1998, Eur. J. Oper. Res..

[5]  Zilla Sinuany-Stern,et al.  Combining ranking scales and selecting variables in the DEA context: the case of industrial branches , 1998, Comput. Oper. Res..

[6]  William W. Cooper,et al.  Introduction to Data Envelopment Analysis and Its Uses: With Dea-Solver Software and References , 2005 .

[7]  T. J. Cook,et al.  Evaluating the administrative efficiency of courts , 1982 .

[8]  Jordan L. Boyd-Graber,et al.  Maximum Likelihood , 2006 .

[9]  C. Jun,et al.  Performance of some variable selection methods when multicollinearity is present , 2005 .

[10]  Jati K. Sengupta,et al.  Tests of efficiency in data envelopment analysis , 1990, Comput. Oper. Res..

[11]  Larry Jenkins,et al.  A multivariate statistical approach to reducing the number of variables in data envelopment analysis , 2003, Eur. J. Oper. Res..

[12]  Michael Norman,et al.  Data Envelopment Analysis: The Assessment of Performance , 1991 .

[13]  H. Lütjohann The stepwise regression algorithm seen from the statistician’s point of view , 1970 .

[14]  J. Ruggiero Data Envelopment Analysis , 2011 .

[15]  Abstmct,et al.  Hypothesis Tests Using Data Envelopment Analysis , 2022 .

[16]  R. Banker Maximum likelihood, consistency and data envelopment analysis: a statistical foundation , 1993 .

[17]  Abraham Charnes,et al.  Measuring the efficiency of decision making units , 1978 .

[18]  Cliff T. Ragsdale,et al.  Spreadsheet modeling and decision analysis , 1996 .

[19]  Janet M. Wagner,et al.  Stepwise selection of variables in data envelopment analysis: Procedures and managerial perspectives , 2007, Eur. J. Oper. Res..