Inductive discovery of laws using monotonic rules

We are considering knowledge discovery from data describing a piece of real or abstract world. The patterns being induced put in evidence some laws hidden in the data. The most natural representation of patterns-laws is by ''if..., then...'' decision rules relating some conditions with some decisions. The same representation of patterns is used in multi-attribute classification, thus the data searched for discovery of these patterns can be seen as classification data. We adopt the classification perspective to present an original methodology of inducing general laws from data and representing them by so-called monotonic decision rules. Monotonicity concerns relationships between values of condition and decision attributes, e.g. the greater the mass (condition attribute), the greater the gravity (decision attribute), which is a specific feature of decision rules discovered from data using the Dominance-based Rough Set Approach (DRSA). While in DRSA one has to suppose a priori the presence or absence of positive or negative monotonicity relationships which hold in the whole evaluation space, in this paper, we show that DRSA can be adapted to discover rules from any kind of input classification data, exhibiting monotonicity relationships which are unknown a priori and hold in some parts of the evaluation space only. This requires a proper non-invasive transformation of the classification data, permitting representation of both positive and negative monotonicity relationships that are to be discovered by the proposed methodology. Reported results of a computational experiment confirm that the proposed methodology leads to decision rules whose predictive ability is similar to the best classification predictors. It has, however, a unique advantage over all competitors because the monotonic decision rules can be read as laws characterizing the analyzed phenomena in terms of easily understandable ''if..., then...'' decision rules, while other predictor models have no such straightforward interpretation.

[1]  Salvatore Greco,et al.  Rough Sets in Decision Making , 2009, Encyclopedia of Complexity and Systems Science.

[2]  Roman Słowiński,et al.  Sequential covering rule induction algorithm for variable consistency rough set approaches , 2011, Inf. Sci..

[4]  Salvatore Greco,et al.  Variable Consistency Model of Dominance-Based Rough Sets Approach , 2000, Rough Sets and Current Trends in Computing.

[5]  Z. Pawlak,et al.  Rough membership functions , 1994 .

[6]  Salvatore Greco,et al.  Multi-criteria classification - A new scheme for application of dominance-based decision rules , 2007, Eur. J. Oper. Res..

[7]  S. K. Wong,et al.  Comparison of the probabilistic approximate classification and the fuzzy set model , 1987 .

[8]  Robert A. Meyers,et al.  Encyclopedia of Complexity and Systems Science , 2009 .

[9]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[10]  Matthias Ehrgott,et al.  Multiple criteria decision analysis: state of the art surveys , 2005 .

[11]  J. Kacprzyk,et al.  Advances in the Dempster-Shafer theory of evidence , 1994 .

[12]  Sadaaki Miyamoto,et al.  Rough Sets and Current Trends in Computing , 2012, Lecture Notes in Computer Science.

[13]  Igor Kononenko,et al.  Machine Learning and Data Mining: Introduction to Principles and Algorithms , 2007 .

[14]  S. Greco,et al.  Decision Rule Approach , 2005 .

[15]  Salvatore Greco,et al.  Rough sets theory for multicriteria decision analysis , 2001, Eur. J. Oper. Res..

[16]  Andrew McCallum,et al.  Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data , 2004, J. Mach. Learn. Res..

[17]  C. Zopounidis Operational tools in the management of financial risks , 1997 .

[18]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[19]  Salvatore Greco,et al.  An Algorithm for Induction of Decision Rules Consistent with the Dominance Principle , 2000, Rough Sets and Current Trends in Computing.

[20]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[21]  Roman Słowiński,et al.  The Use of Rough Sets and Fuzzy Sets in MCDM , 1999 .

[22]  Salvatore Greco,et al.  Rough sets methodology for sorting problems in presence of multiple attributes and criteria , 2002, Eur. J. Oper. Res..

[23]  Graham Kendall,et al.  Search Methodologies: Introductory Tutorials in Optimization and Decision Support Techniques , 2013 .

[24]  Salvatore Greco,et al.  Monotonic Variable Consistency Rough Set Approaches , 2009, Int. J. Approx. Reason..

[25]  Roman Słowiński,et al.  A New Rough Set Approach to Evaluation of Bankruptcy Risk , 1998 .

[26]  Salvatore Greco,et al.  Rough Set Based Decision Support , 2005 .

[27]  Salvatore Greco,et al.  Dominance-Based Rough Set Approach as a Proper Way of Handling Graduality in Rough Set Theory , 2007, Trans. Rough Sets.