Data Envelopment Analysis of clinics with sparse data: Fuzzy clustering approach

This paper presents a method for utilizing Data Envelopment Analysis (DEA) with sparse input and output data using fuzzy clustering concepts. DEA, a methodology to assess relative technical efficiency of production units is susceptible to missing data, thus, creating a need to supplement sparse data in a reliable and accurate manner. The approach presented is based on a modified fuzzy c-means clustering using optimal completion strategy (OCS) algorithm. This particular algorithm is sensitive to the initial values chosen to substitute missing values and also to the selected number of clusters. Therefore, this paper proposes an approach to estimate the missing values using the OCS algorithm, while considering the issue of initial values and cluster size. This approach is demonstrated on a real and complete dataset of 22 rural clinics in the State of Kansas, assuming varying levels of missing data. Results show the effect of the clustering based approach on the data recovered considering the amount and type of missing data. Moreover, the paper shows the effect that the recovered data has on the DEA scores.

[1]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[2]  Guojun Gan,et al.  Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability) , 2007 .

[3]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[4]  Timo Kuosmanen,et al.  Data envelopment analysis with missing data , 2009, J. Oper. Res. Soc..

[5]  A. Charnes,et al.  A multiplicative model for efficiency analysis , 1982 .

[6]  Hung-Tso Lin,et al.  Personnel selection using analytic network process and fuzzy data envelopment analysis approaches , 2010, Comput. Ind. Eng..

[7]  T. R. Nunamaker Measuring routine nursing service efficiency: a comparison of cost per patient day and data envelopment analysis models. , 1983, Health services research.

[8]  T. Butler,et al.  Evaluation of operating room suite efficiency in the Veterans Health Administration system by using data-envelopment analysis. , 2006, American journal of surgery.

[9]  Iain Paterson,et al.  Measuring Hospital Efficiency in Austria – A DEA Approach , 2002, Health care management science.

[10]  Abraham Charnes,et al.  Cone ratio data envelopment analysis and multi-objective programming , 1989 .

[11]  Stefan Conrad,et al.  Fuzzy Clustering of Incomplete Data Based on Cluster Dispersion , 2010, IPMU.

[12]  James C. Bezdek,et al.  Fuzzy c-means clustering of incomplete data , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[13]  Lawrence M. Seiford,et al.  A bibliography for Data Envelopment Analysis (1978-1996) , 1997, Ann. Oper. Res..

[14]  Jianhong Wu,et al.  Data clustering - theory, algorithms, and applications , 2007 .

[15]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[16]  Rolf Färe,et al.  Malmquist Productivity Indexes and Fisher Ideal Indexes , 1992 .

[17]  R Fare,et al.  MALMQUIST INDEXES AND FISHER IDEAL INDEXES , 1992 .

[18]  Tu Bao Ho,et al.  Cluster-Based Algorithms for Dealing with Missing Values , 2002, PAKDD.

[19]  Shiang-Tai Liu,et al.  A fuzzy DEA/AR approach to the selection of flexible manufacturing systems , 2008, Comput. Ind. Eng..

[20]  Ali Emrouznejad,et al.  Evaluation of research in efficiency and productivity: A survey and analysis of the first 30 years , 2008 .

[21]  Barton A. Smith,et al.  Comparative Site Evaluations for Locating a High-Energy Physics Lab in Texas , 1986 .

[22]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[23]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[24]  H. Sherman Hospital Efficiency Measurement and Evaluation: Empirical Test of a New Technique , 1984, Medical care.

[25]  Abraham Charnes,et al.  Measuring the efficiency of decision making units , 1978 .

[26]  Dimitris K. Despotis,et al.  Data envelopment analysis with missing values: An interval DEA approach , 2006, Appl. Math. Comput..

[27]  Y A Ozcan,et al.  Physician benchmarking: measuring variation in practice behavior in treatment of otitis media , 1998, Health care management science.

[28]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[29]  Chiang Kao,et al.  Data envelopment analysis with missing data: an application to University libraries in Taiwan , 2000, J. Oper. Res. Soc..

[30]  Robert Rosenman,et al.  Efficiency of Thai provincial public hospitals during the introduction of universal health coverage using capitation , 2008, Health care management science.

[31]  E. Lettieri,et al.  Efficiency and quality of care in nursing homes: an Italian case study , 2011, Health care management science.

[32]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[33]  James C. Bezdek,et al.  Local convergence of tri-level alternating optimization , 2001, Neural Parallel Sci. Comput..

[34]  篠原 正明,et al.  William W.Cooper,Lawrence M.Seiford,Kaoru Tone 著, DATA ENVELOPMENT ANALYSIS : A Comprehensive Text with Models, Applications, References and DEA-Solver Software, Kluwer Academic Publishers, 2000年, 318頁 , 2002 .

[35]  R. Rosenman,et al.  Data Envelopment Analysis to determine efficiencies of health maintenance organizations , 2000, Health care management science.

[36]  Cláudia S. Sarrico,et al.  Data Envelopment Analysis: A Comprehensive Text with Models, Applications, References and DEA-Solver Software , 2001, J. Oper. Res. Soc..

[37]  R C Durfee,et al.  A METHOD OF CLUSTER ANALYSIS. , 1970, Multivariate behavioral research.

[38]  Timo Kuosmanen,et al.  Modeling Blank Data Entries in Data Envelopment Analysis , 2002 .

[39]  A W EDWARDS,et al.  A METHOD FOR CLUSTER ANALYSIS. , 1965, Biometrics.

[40]  R. Giglio,et al.  An Exploratory Study Using Data Envelopment Analysis to Assess Neurotrauma Patients in the Intensive Care Unit , 2003, Health care management science.

[41]  Boaz Golany,et al.  Foundations of data envelopment analysis for Pareto-Koopmans efficient empirical production functions , 1985 .

[42]  Jos L. T. Blank,et al.  Environmental factors and productivity on Dutch hospitals: a semi-parametric approach , 2009, Health care management science.

[43]  A. Charnes,et al.  Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis , 1984 .

[44]  Peter Congdon Multilevel and Clustering Analysis of Health Outcomes in Small Areas , 1997 .

[45]  B. Helmig,et al.  On the efficiency of public, welfare and private hospitals in Germany over time: a sectoral data envelopment analysis study , 2001, Health services management research.

[46]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[47]  K. Florek,et al.  Sur la liaison et la division des points d'un ensemble fini , 1951 .

[48]  Ning Jackie Zhang,et al.  Explaining the efficiency of local health departments in the U.S.: an exploratory analysis , 2010, Health care management science.

[49]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[50]  P. Andersen,et al.  A procedure for ranking efficient units in data envelopment analysis , 1993 .

[51]  Miika Linna,et al.  Measuring Efficiency of Long-Term Care Units in Finland , 2001, Health care management science.

[52]  Alberto Sanfeliu,et al.  Progress in Pattern Recognition, Speech and Image Analysis , 2003, Lecture Notes in Computer Science.

[53]  Max Chacón,et al.  Patients Classification by Risk Using Cluster Analysis and Genetic Algorithms , 2003, CIARP.

[54]  Rolf Färe,et al.  Two Perspectives on DEA: Unveiling the Link between CCR and Shephard , 2002 .

[55]  Nunamaker Tr,et al.  Measuring routine nursing service efficiency: a comparison of cost per patient day and data envelopment analysis models. , 1983 .