Proactive data mining using decision trees

Most of the existing data mining algorithms are `passive'. That is, they produce models which can describe patterns, but leave the decision on how to react to these patterns in the hands of the user. In contrast, in this work we describe a proactive approach to data mining, and describe an implementation of that approach, using decision trees. We show that the proactive role requires the algorithms to consider additional domain knowledge, which is exogenous to the training set. We also suggest a novel splitting criterion, termed maximal-utility, which is driven by the proactive agenda.

[1]  David C. Yen,et al.  Applying data mining to telecom churn management , 2006, Expert Syst. Appl..

[2]  Zengyou He,et al.  Data Mining for Actionable Knowledge: A Survey , 2005, ArXiv.

[3]  Ronen Feldman,et al.  The Data Mining and Knowledge Discovery Handbook , 2005 .

[4]  Chengqi Zhang,et al.  Domain-Driven Actionable Knowledge Discovery in the Real World , 2006, PAKDD.

[5]  Lior Rokach,et al.  Pessimistic cost-sensitive active learning of decision trees for profit maximizing targeting campaigns , 2008, Data Mining and Knowledge Discovery.

[6]  Philip S. Yu,et al.  Domain Driven Data Mining , 2015 .

[7]  Longbing Cao,et al.  Domain-Driven Data Mining: Challenges and Prospects , 2010, IEEE Transactions on Knowledge and Data Engineering.

[8]  Jean-François Boulicaut,et al.  Constraint-based Data Mining , 2005, Data Mining and Knowledge Discovery Handbook.

[9]  Longbing Cao,et al.  Actionable knowledge discovery and delivery , 2012, WIREs Data Mining Knowl. Discov..

[10]  Bhavana Jamalpur,et al.  Data Mining and Business Intelligence , .