Decision-making framework with double-loop learning through interpretable black-box machine learning models

Purpose The purpose of this paper is to address the problem of weak acceptance of machine learning (ML) models in business. The proposed framework of top-performing ML models coupled with general explanation methods provides additional information to the decision-making process. This builds a foundation for sustainable organizational learning. Design/methodology/approach To address user acceptance, participatory approach of action design research (ADR) was chosen. The proposed framework is demonstrated on a B2B sales forecasting process in an organizational setting, following cross-industry standard process for data mining (CRISP-DM) methodology. Findings The provided ML model explanations efficiently support business decision makers, reduce forecasting error for new sales opportunities, and facilitate discussion about the context of opportunities in the sales team. Research limitations/implications The quality and quantity of available data affect the performance of models and explanations. Practical implications The application in the real-world company demonstrates the utility of the approach and provides evidence that transparent explanations of ML models contribute to individual and organizational learning. Social implications All used methods are available as an open-source software and can improve the acceptance of ML in data-driven decision making. Originality/value The proposed framework incorporates existing ML models and general explanation methodology into a decision-making process. To the authors’ knowledge, this is the first attempt to support organizational learning with a framework combining ML explanations, ADR, and data mining methodology based on the CRISP-DM industry standard.

[1]  Sandeep Purao,et al.  Action Design Research , 2011, MIS Q..

[2]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[3]  Ron Kohavi,et al.  The Case against Accuracy Estimation for Comparing Induction Algorithms , 1998, ICML.

[4]  Miroljub Kljajić,et al.  The relevance of facilitation in group decision making supported by a simulation model , 2011 .

[5]  Erik Strumbelj,et al.  An Efficient Explanation of Individual Classifications using Game Theory , 2010, J. Mach. Learn. Res..

[6]  Henrik Jacobsson,et al.  Rule Extraction from Recurrent Neural Networks: ATaxonomy and Review , 2005, Neural Computation.

[7]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[9]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[10]  Andreas Holzinger,et al.  Interactive machine learning for health informatics: when do we need the human-in-the-loop? , 2016, Brain Informatics.

[11]  C. Argyris On organizational learning , 1993 .

[12]  Hongyuan Zha,et al.  On Machine Learning towards Predictive Sales Pipeline Analytics , 2015, AAAI.

[13]  Erik Strumbelj,et al.  Explaining instance classifications with interactions of subsets of feature values , 2009, Data Knowl. Eng..

[14]  I. Nonaka,et al.  How Japanese Companies Create the Dynamics of Innovation , 1995 .

[15]  Lorin M. Hitt,et al.  Strength in Numbers: How Does Data-Driven Decisionmaking Affect Firm Performance? , 2011, ICIS 2011.

[16]  Jan Lundberg,et al.  Applying Action Design Research (ADR) to Develop Concept Generation and Selection Methods , 2016 .

[17]  Li Xiu,et al.  Application of data mining techniques in customer relationship management: A literature review and classification , 2009, Expert Syst. Appl..

[18]  Michael Lawrence,et al.  The effects of structural characteristics of explanations on use of a DSS , 2006, Decis. Support Syst..

[19]  Vicky Arnold,et al.  The Differential Use and Effect of Knowledge-Based System Explanations in Novice and Expert Judgement Decisions , 2006, MIS Q..

[20]  Tom Fawcett,et al.  Data Science and its Relationship to Big Data and Data-Driven Decision Making , 2013, Big Data.

[21]  Ivan Bratko,et al.  Nomograms for visualizing support vector machines , 2005, KDD '05.

[22]  Jorge A. Alvarado-Valencia,et al.  Reliance, trust and heuristics in judgmental forecasting , 2014, Comput. Hum. Behav..

[23]  Marko Robnik-Sikonja,et al.  Explaining Classifications For Individual Instances , 2008, IEEE Transactions on Knowledge and Data Engineering.

[24]  Igor Kononenko,et al.  Machine Learning and Data Mining: Introduction to Principles and Algorithms , 2007 .

[25]  G. Lilien The B2B Knowledge Gap , 2016 .

[26]  Jamie P. Monat,et al.  Industrial sales lead conversion modeling , 2011 .

[27]  Marcus Mueller,et al.  A Survey of the Application of Machine Learning in Decision Support Systems , 2015, ECIS.

[28]  Dirk Van den Poel,et al.  Model-supported business-to-business prospect prediction based on an iterative customer acquisition framework , 2013 .

[29]  Antanas Verikas,et al.  Mining data with random forests: A survey and results of new tests , 2011, Pattern Recognit..

[30]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[31]  Marko Bohanec,et al.  Explaining machine learning models in sales predictions , 2017, Expert Syst. Appl..

[32]  W. Edwards Deming,et al.  Out of the Crisis , 1982 .

[33]  Anthony J. DiBella,et al.  How Organizations Learn: An Integrated Strategy for Building Learning Capability , 1997 .

[34]  Kesten C. Green,et al.  Golden Rule of Forecasting: Be Conservative , 2015 .

[35]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[36]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.