Mining Unexpected Patterns by Decision Trees with Interestingness Measures

We believe that unexpected, interesting patterns may provide researchers with different visions for future research. In this study, we propose an unexpected pattern mining conceptual model that uses decision trees to compare the recovery rates of two different treatments and to find patterns that contrast with the prior knowledge of domain users. In the proposed model, we define interestingness measures to determine whether the patterns found are interesting to the domain. By applying the concept of domain-driven data mining, we repeatedly utilize decision trees and interestingness measures in a closed-loop, in-depth mining process to find unexpected and interesting patterns. We use retrospective data from transvaginal ultrasound-guided aspirations to show that the proposed model can successfully compare different treatments using a decision tree, which is a new usage of decision trees.

[1]  Zbigniew W. Ras,et al.  Action rules discovery: system DEAR2, method and experiments , 2005, J. Exp. Theor. Artif. Intell..

[2]  A. Nap,et al.  Pathogenesis of endometriosis. , 2004, Best practice & research. Clinical obstetrics & gynaecology.

[3]  G. V. Kass An Exploratory Technique for Investigating Large Quantities of Categorical Data , 1980 .

[4]  H. Kanzaki,et al.  Management of transvaginal ultrasound-guided absolute ethanol sclerotherapy for ovarian endometriotic cysts , 2006, Journal of Medical Ultrasonics.

[5]  Liang-Ming Lo,et al.  Effectiveness of ultrasound-guided aspiration and sclerotherapy with 95% ethanol for treatment of recurrent ovarian endometriomas. , 2009, Fertility and sterility.

[6]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[7]  Philip S. Yu,et al.  D 3 M Methodology , 2010 .

[8]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[9]  David Taniar,et al.  Domain-Driven, Actionable Knowledge Discovery , 2007, IEEE Intelligent Systems.

[10]  David H. Glass,et al.  Confirmation measures of association rule interestingness , 2013, Knowl. Based Syst..

[11]  Alex Alves Freitas,et al.  On rule interestingness measures , 1999, Knowl. Based Syst..

[12]  J Noma,et al.  Efficacy of ethanol sclerotherapy for ovarian endometriomas , 2001, International journal of gynaecology and obstetrics: the official organ of the International Federation of Gynaecology and Obstetrics.

[13]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[14]  Qiang Yang,et al.  Mining optimal actions for profitable CRM , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[15]  Yiyu Yao,et al.  A Measurement-Theoretic Foundation of Rule Interestingness Evaluation , 2006, Foundations and Novel Approaches in Data Mining.

[16]  Balaji Padmanabhan,et al.  Unexpectedness as a Measure of Interestingness in Knowledge Discovery , 1999, Decis. Support Syst..

[17]  H. Kafalı,et al.  Management of non‐neoplastic ovarian cysts with sclerotherapy , 2003, International journal of gynaecology and obstetrics: the official organ of the International Federation of Gynaecology and Obstetrics.

[18]  Robert Greb,et al.  ESHRE guideline for the diagnosis and treatment of endometriosis. , 2005, Human reproduction.

[19]  Abraham Silberschatz,et al.  On Subjective Measures of Interestingness in Knowledge Discovery , 1995, KDD.

[20]  Tijl De Bie,et al.  Knowledge discovery interestingness measures based on unexpectedness , 2012, WIREs Data Mining Knowl. Discov..

[21]  Tharam S. Dillon,et al.  Interestingness measures for association rules based on statistical validity , 2011, Knowl. Based Syst..

[22]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[23]  Lingling Zhang,et al.  Research on Domain-Driven Actionable Knowledge Discovery , 2009 .

[24]  Philip S. Yu,et al.  Challenges and Trends , 2010 .

[25]  Chengqi Zhang,et al.  Domain-Driven Actionable Knowledge Discovery in the Real World , 2006, PAKDD.

[26]  Jiawei Han,et al.  Profit Mining: From Patterns to Actions , 2002, EDBT.

[27]  J. Donnez,et al.  Minimally invasive gynecologic procedures , 2011, Current opinion in obstetrics & gynecology.

[28]  Kenneth McGarry,et al.  A survey of interestingness measures for knowledge discovery , 2005, The Knowledge Engineering Review.

[29]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .