An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models

An important objective of data mining is the development of predictive models. Based on a number of observations, a model is constructed that allows the analysts to provide classifications or predictions for new observations. Currently, most research focuses on improving the accuracy or precision of these models and comparatively little research has been undertaken to increase their comprehensibility to the analyst or end-user. This is mainly due to the subjective nature of 'comprehensibility', which depends on many factors outside the model, such as the user's experience and his/her prior knowledge. Despite this influence of the observer, some representation formats are generally considered to be more easily interpretable than others. In this paper, an empirical study is presented which investigates the suitability of a number of alternative representation formats for classification when interpretability is a key requirement. The formats under consideration are decision tables, (binary) decision trees, propositional rules, and oblique rules. An end-user experiment was designed to test the accuracy, response time, and answer confidence for a set of problem-solving tasks involving the former representations. Analysis of the results reveals that decision tables perform significantly better on all three criteria, while post-test voting also reveals a clear preference of users for decision tables in terms of ease of use.

[1]  Sirkka L. Jarvenpaa,et al.  Graphics and managerial decision making: research-based guidelines , 1988, CACM.

[2]  Michel Parent,et al.  The French Program La Route Automatisée , 2000, IEEE Intell. Syst..

[3]  Bart Baesens,et al.  Decompositional Rule Extraction from Support Vector Machines by Active Learning , 2009, IEEE Transactions on Knowledge and Data Engineering.

[4]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[5]  J. McGrath Groups: Interaction and Performance , 1984 .

[6]  Joydeep Ghosh,et al.  Symbolic Interpretation of Artificial Neural Networks , 1999, IEEE Trans. Knowl. Data Eng..

[7]  Jan Vanthienen,et al.  An Illustration of Verification and Validation in the Modelling Phase of KBS Development , 1998, Data Knowl. Eng..

[8]  Jerry N. Luftman,et al.  Enablers and Inhibitors of Business-IT Alignment , 1999, Commun. Assoc. Inf. Syst..

[9]  Rudy Setiono,et al.  An Approach To Generate Rules From Neural Networks for Regression Problems , 2004, Eur. J. Oper. Res..

[10]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[11]  Dale Goodhue,et al.  Task-Technology Fit and Individual Performance , 1995, MIS Q..

[12]  Iris Vessey,et al.  Cognitive Fit: A Theory‐Based Analysis of the Graphs Versus Tables Literature* , 1991 .

[13]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[14]  Zhi-Hua Zhou,et al.  Extracting symbolic rules from trained neural network ensembles , 2003, AI Commun..

[15]  Iris Vessey,et al.  Multiattribute Data Presentation and Human Judgment: A Cognitive Fit Perspective* , 1994 .

[16]  William Remus,et al.  A Study of Graphical and Tabular Displays and Their Interaction with Environmental Complexity , 1987 .

[17]  David P. Tegarden,et al.  Business Information Visualization , 1999, Commun. Assoc. Inf. Syst..

[18]  Wee Kheng Leow,et al.  Pruned Neural Networks for Regression , 2000, PRICAI.

[19]  D. Campbell Task Complexity: A Review and Analysis , 1988 .

[20]  Bart Baesens,et al.  ITER: An Algorithm for Predictive Regression Rule Extraction , 2006, DaWaK.

[21]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[22]  Akhilesh Chandra,et al.  Representational congruence and information retrieval: Towards an extended model of cognitive fit , 1999, Decis. Support Syst..

[23]  Fei Chen,et al.  LEARNING ACCURATE AND UNDERSTANDABLE RULES FROM SVM CLASSIFIERS , 2004 .

[24]  Iris Vessey,et al.  The Role of Cognitive Fit in the Relationship Between Software Comprehension and Modification , 2006, MIS Q..

[25]  Cheri Speier,et al.  The influence of information presentation formats on complex task decision-making performance , 2006, Int. J. Hum. Comput. Stud..

[26]  Girish H. Subramanian,et al.  A comparison of the decision table and tree , 1992, CACM.

[27]  Ron Weber,et al.  Structured tools and conditional logic: an empirical investigation , 1986, CACM.

[28]  Hennie A. M. Daniels,et al.  Methodological and practical aspects of data mining , 2000, Inf. Manag..

[29]  Huan Liu,et al.  NeuroLinear: From neural networks to oblique decision rules , 1997, Neurocomputing.

[30]  R. Wood Task complexity: Definition of the construct , 1986 .

[31]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[32]  Henry C. Lucas,et al.  An Experimental Investigation of the Use of Computer-Based Graphics in Decision Making , 1981 .

[33]  Geert Wets,et al.  From Decision Tables to Expert System Shells , 1994, Data Knowl. Eng..

[34]  Michael J. Darnell,et al.  Empirical evaluation of decision tables for constructing and comprehending expert system rules , 1992 .

[35]  J. Swait,et al.  The Influence of Task Complexity on Consumer Choice: A Latent Class Model of Decision Strategy Switching , 2001 .

[36]  Robert M. Colomb,et al.  Representation of Propositional Expert Systems as Partial Functions , 1999, Artif. Intell..

[37]  David J. Gilmore,et al.  Comprehension and Recall of Miniature Programs , 1984, Int. J. Man Mach. Stud..

[38]  Guido Dedene,et al.  A Comparison of State-of-The-Art Classification Techniques for Expert Automobile Insurance Claim Fraud Detection , 2002 .

[39]  Izak Benbasat,et al.  An experimental program investigating color-enhanced and graphical information presentation: an integration of the findings , 1986, CACM.

[40]  Patrick Brézillon,et al.  Lecture Notes in Artificial Intelligence , 1999 .

[41]  Gerardine DeSanctis,et al.  Understanding the effectiveness of computer graphics for decision support: a cumulative experimental approach , 1986, CACM.

[42]  Gerardine DeSanctis,et al.  COMPUTER GRAPHICS AS DECISION AIDS: DIRECTIONS FOR RESEARCH* , 1984 .

[43]  Chandrika Kamath,et al.  Inducing oblique decision trees with evolutionary algorithms , 2003, IEEE Trans. Evol. Comput..

[44]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[45]  Hsing Kenneth Cheng,et al.  An empirical study of mobile commerce in insurance industry: Task-technology fit and individual differences , 2007, Decis. Support Syst..

[46]  Herbert A. Simon,et al.  Why a Diagram is (Sometimes) Worth Ten Thousand Words , 1987 .

[47]  Alan R. Dennis,et al.  Using Geographical Information Systems for Decision Making: Extending Cognitive Fit Theory to Map-Based Presentations , 1998, Inf. Syst. Res..

[48]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[49]  Peter Clark,et al.  The CN2 Induction Algorithm , 1989, Machine Learning.

[50]  Huimin Zhao,et al.  Incorporating domain knowledge into data mining classifiers: An application in indirect lending , 2008, Decis. Support Syst..

[51]  U. Johansson,et al.  International Conference on Information Fusion ( FUSION ) Automatically Balancing Accuracy and Comprehensibility in Predictive Modeling , 2006 .

[52]  Joachim Diederich,et al.  Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[53]  R. P. Halverson An empirical investigation comparing IF-THEN rules and decision tables for programming rule-based expert systems , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[54]  John Sweller,et al.  Cognitive Load During Problem Solving: Effects on Learning , 1988, Cogn. Sci..

[55]  Ron Kohavi,et al.  The Power of Decision Tables , 1995, ECML.

[56]  P. Fayers,et al.  The Visual Display of Quantitative Information , 1990 .

[57]  Izak Benbasat,et al.  Behavioral Aspects of Information Processing for the Design of Management Information Systems , 1982, IEEE Transactions on Systems, Man, and Cybernetics.

[58]  David J. Sheskin,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 1997 .

[59]  Michael J. Pazzani,et al.  Knowledge discovery from data? , 2000, IEEE Intell. Syst..

[60]  Jan Vanthienen,et al.  A More General Comparison of the Decision Table and Tree : A Response , 1992 .

[61]  Cynthia Frownfelter-Lohrke,et al.  The effects of differing information presentations of general purpose financial statements on users' decisions , 1998 .

[62]  Iris Vessey,et al.  The effect of information presentation on decision making: A cost-benefit analysis , 1994, Inf. Manag..

[63]  Joan H. Coll,et al.  Graphs and tables: a four-factor experiment , 1994, CACM.