A combined machine learning algorithms and DEA method for measuring and predicting the efficiency of Chinese manufacturing listed companies

Abstract Data Envelopment Analysis (DEA) is a linear programming methodology for measuring the efficiency of Decision Making Units (DMUs) to improve organizational performance in the private and public sectors. However, if a new DMU needs to be known its efficiency score, the DEA analysis would have to be re-conducted, especially nowadays, datasets from many fields have been growing rapidly in the real world, which will need a huge amount of computation. Following the previous studies, this paper aims to establish a linkage between the DEA method and machine learning (ML) algorithms, and proposes an alternative way that combines DEA with ML (ML-DEA) algorithms to measure and predict the DEA efficiency of DMUs. Four ML-DEA algorithms are discussed, namely DEA-CCR model combined with back-propagation neural network (BPNN-DEA), with genetic algorithm (GA) integrated with back-propagation neural network (GANN-DEA), with support vector machines (SVM-DEA), and with improved support vector machines (ISVM-DEA), respectively. To illustrate the applicability of above models, the performance of Chinese manufacturing listed companies in 2016 is measured, predicted and compared with the DEA efficiency scores obtained by the DEA-CCR model. The empirical results show that the average accuracy of the predicted efficiency of DMUs is about 94%, and the comprehensive performance order of four ML-DEA algorithms ranked from good to poor is GANN-DEA, BPNN-DEA, ISVM-DEA, and SVM-DEA.

[1]  Dariush Khezrimotlagh,et al.  An integrated model for green supplier selection under fuzzy environment: application of data envelopment analysis and genetic programming approach , 2015, Neural Computing and Applications.

[2]  Ali Emrouznejad,et al.  A survey and analysis of the first 40 years of scholarly literature in DEA: 1978–2016 , 2018 .

[3]  Mohammad Khoveyni,et al.  Directional congestion in the framework of data envelopment analysis , 2020 .

[4]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[5]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[6]  Abraham Charnes,et al.  Measuring the efficiency of decision making units , 1978 .

[7]  Fotios Pasiouras,et al.  Assessing Bank Efficiency and Performance with Operational Research and Artificial Intelligence Techniques: A Survey , 2009, Eur. J. Oper. Res..

[8]  Ali Emrouznejad,et al.  Measurement efficiency and productivity in SAS/OR , 2005, Comput. Oper. Res..

[9]  Hiroaki Kitano,et al.  Designing Neural Networks Using Genetic Algorithms with Graph Generation System , 1990, Complex Syst..

[10]  Alexander J. Smola,et al.  Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[11]  So Young Sohn,et al.  Global stock market investment strategies based on financial network indicators using machine learning techniques , 2019, Expert Syst. Appl..

[12]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[13]  Yao Chen,et al.  DEANN: A healthcare analytic methodology of data envelopment analysis and artificial neural networks for the prediction of organ recipient functional status , 2016 .

[14]  Peter Wanke,et al.  Insurance companies in Mozambique: a two-stage DEA and neural networks on efficiency and capacity slacks , 2014 .

[15]  Hsiang-Hsi Liu,et al.  A Comparison of Three-Stage DEA and Artificial Neural Network on the Operational Efficiency of Semi-Conductor Firms in Taiwan , 2013 .

[16]  Lijun Zhao,et al.  Robot teaching by teleoperation based on visual interaction and extreme learning machine , 2018, Neurocomputing.

[17]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[18]  Na Liu,et al.  Multimodal biometrics recognition based on local fusion visual features and variational Bayesian extreme learning machine , 2016, Expert Syst. Appl..

[19]  Colin Campbell,et al.  Learning with Support Vector Machines , 2011, Learning with Support Vector Machines.

[20]  James T. Kwok Support vector mixture for classification and regression problems , 1998, ICPR.

[21]  Lothar M. Schmitt,et al.  Theory of genetic algorithms , 2001, Theor. Comput. Sci..

[22]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[23]  Xueli Zhan,et al.  Analysis of Financing Efficiency of Chinese Agricultural Listed Companies Based on Machine Learning , 2019, Complex..

[24]  F. Rattay,et al.  The basic mechanism for the electrical stimulation of the nervous system , 1999, Neuroscience.

[25]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[26]  Ali Emrouznejad,et al.  A combined neural network and DEA for measuring efficiency of large scale datasets , 2009, Comput. Ind. Eng..

[27]  Sonia Rebai,et al.  A graphically based machine learning approach to predict secondary schools performance in Tunisia , 2020 .

[28]  Gianpaolo Francesco Trotta,et al.  Computer vision and deep learning techniques for pedestrian detection and tracking: A survey , 2018, Neurocomputing.

[29]  Subhash C. Ray,et al.  Distribution of cost and profit efficiency: Evidence from Indian banking , 2010, Eur. J. Oper. Res..

[30]  Bo Jiang,et al.  Supplier's Efficiency and Performance Evaluation using DEA-SVM Approach , 2013, J. Softw..

[31]  A. Nandy,et al.  Farm efficiency estimation using a hybrid approach of machine-learning and data envelopment analysis: Evidence from rural eastern India , 2020 .

[32]  Michael D. Vose,et al.  The simple genetic algorithm - foundations and theory , 1999, Complex adaptive systems.

[33]  Paul J. Werbos,et al.  Applications of advances in nonlinear sensitivity analysis , 1982 .

[34]  Zongguo Wen,et al.  Determinants of efficiency in anaerobic bio-waste co-digestion facilities: A data envelopment analysis and gradient boosting approach , 2019, Applied Energy.

[35]  Lei Chen,et al.  A data envelopment analysis (DEA)-based method for rule reduction in extended belief-rule-based systems , 2017, Knowl. Based Syst..

[36]  Jooh Lee,et al.  Two-stage production modeling of large U.S. banks: A DEA-neural network approach , 2015, Expert Syst. Appl..

[37]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[38]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[39]  M. Farrell The Measurement of Productive Efficiency , 1957 .

[40]  I. Vlahavas,et al.  Machine Learning and Data Mining Methods in Diabetes Research , 2017, Computational and structural biotechnology journal.

[41]  Tzu-Yu Lin,et al.  Using independent component analysis and network DEA to improve bank performance evaluation , 2013 .

[42]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[43]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[44]  Jian-Bo Yang,et al.  Using Two-layer Minimax Optimization and DEA to Determine Attribute Weights , 2018, Journal of Management Science and Engineering.

[45]  A. Charnes,et al.  Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis , 1984 .

[46]  V. Vapnik Three fundamental concepts of the capacity of learning machines , 1993 .

[47]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[48]  Chuen-Sheng Cheng A multi-layer neural network model for detecting changes in the process mean , 1995 .

[49]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[50]  Hava T. Siegelmann,et al.  Support Vector Clustering , 2002, J. Mach. Learn. Res..

[51]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.