Interpretable knowledge extraction from emergency call data based on fuzzy unsupervised decision tree

Nowadays, call centers are common in different areas of activity providing customer services, medical attention, security services, etc. Each type of call center has its own particularities but all call centers have to plan the availability of resources at each time period to support the incoming calls. The emergency call centers are a special case with extra restrictions. In this context, this work is devoted to providing support for the decision making about resource planning of an emergency call center in order to reach its mandatory quality of service. This is carried out by the extraction of interpretable knowledge from the activity data collected by an emergency call center. A linguistic prediction, categorization and description of the days based on the call center activity and information permits the workload for each category of day to be known. This has been generated by a fuzzy version of an unsupervised decision tree (FUDT), merging decision trees and clustering. This involves quality indexes to reach an adequate trade-off between the tree complexity and the category quality in order to guarantee interpretability and performance. This unsupervised approach deals correctly with the real management of this type of centers generating and preserving expert knowledge.

[1]  Terry P. Harrison,et al.  A stochastic programming model for scheduling call centers with global Service Level Agreements , 2010, Eur. J. Oper. Res..

[2]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Min-You Chen Establishing interpretable fuzzy models from numeric data , 2002, Proceedings of the 4th World Congress on Intelligent Control and Automation (Cat. No.02EX527).

[4]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[5]  Jayanta Basak,et al.  Interpretable hierarchical clustering by constructing an unsupervised decision tree , 2005, IEEE Transactions on Knowledge and Data Engineering.

[6]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[7]  Rudolf Kruse,et al.  Obtaining interpretable fuzzy classification rules from medical data , 1999, Artif. Intell. Medicine.

[8]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[9]  Kezhi Mao,et al.  Identifying critical variables of principal components for unsupervised feature selection , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[10]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[11]  Robert X. Gao,et al.  PCA-based feature selection scheme for machine defect classification , 2004, IEEE Transactions on Instrumentation and Measurement.

[12]  Avishai Mandelbaum,et al.  Statistical Analysis of a Telephone Call Center , 2005 .

[13]  Lale Özbakir,et al.  Classification rule discovery for the aviation incidents resulted in fatality , 2009, Knowl. Based Syst..

[14]  Chaitanya K. Baru,et al.  Using 9-1-1 call data and the space-time permutation scan statistic for emergency event detection , 2008, Gov. Inf. Q..

[15]  Avishai Mandelbaum,et al.  Telephone Call Centers: Tutorial, Review, and Research Prospects , 2003, Manuf. Serv. Oper. Manag..

[16]  Shamsud D. Chowdhury,et al.  Centralization as a design consideration for the management of call centers , 2004, Inf. Manag..

[17]  Brigitte Charnomordic,et al.  A new method for inducing a set of interpretable fuzzy partitions and fuzzy inference systems from data , 2003 .

[18]  James C. Bezdek,et al.  Validity-guided (re)clustering with applications to image segmentation , 1996, IEEE Trans. Fuzzy Syst..

[19]  Zeynep Akşin,et al.  The Modern Call Center: A Multi‐Disciplinary Perspective on Operations Management Research , 2007 .

[20]  Yves Dallery,et al.  Queueing models for full-flexible multi-class call centers with real-time anticipated delays , 2009 .

[21]  Brigitte Charnomordic,et al.  Generating an interpretable family of fuzzy partitions from data , 2004, IEEE Transactions on Fuzzy Systems.

[22]  S. Appavu alias Balamurugan,et al.  Data mining based intelligent analysis of threatening e-mail , 2009, Knowl. Based Syst..

[23]  Shichao Zhang,et al.  Cost-sensitive classification with respect to waiting cost , 2010, Knowl. Based Syst..

[24]  Avishai Mandelbaum,et al.  Designing a Call Center with Impatient Customers , 2002, Manuf. Serv. Oper. Manag..

[25]  John Q. Gan,et al.  Low-level interpretability and high-level interpretability: a unified view of data-driven interpretable fuzzy system modelling , 2008, Fuzzy Sets Syst..

[26]  Maurice K. Wong,et al.  Algorithm AS136: A k-means clustering algorithm. , 1979 .

[27]  José Manuel Benítez,et al.  Feature Selection for Time Series Forecasting: A Case Study , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[28]  Miin-Shen Yang A survey of fuzzy clustering , 1993 .

[29]  Bruno Crémilleux,et al.  A quality index for decision tree pruning , 2002, Knowl. Based Syst..

[30]  Patrick T. Harker,et al.  Capacity sizing in the presence of a common shared resource: Dimensioning an inbound call center , 2003, Eur. J. Oper. Res..

[31]  Jiebo Luo,et al.  Data Mining. Multimedia, Soft Computing, and Bioinformatics , 2005, IEEE Transactions on Neural Networks.