Rule Induction With Probabilistic Rough Classifiers

We present an algorithm ProbRough for inducing decision rules from data. The algorithm combines all the positive aspects of rule induction systems with the exibility of the probabilistic representation of data and learned knowledge. It accepts imprecise and incomplete data of any mixed qualitative and quantitative type. The probabilistic structure of data is estimated from the learning set using the frequency-based estimators. The results of learning are described in the form of a certain partition of the feature space. The partition elements are of a special geometrical shape, which enables the presentation of the decision rules in a simple form meaningful to humans. Nevertheless, the prediction accuracy is comparable or superior to more sophisticated classiiers. The ProbRough algorithm searches through the set of various partitions using the criterion based on minimizing the misclassiica-tion costs. We illustrate the algorithm behavior on several real-life datasets, acknowledged in the machine learning community.

[1]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[2]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[3]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[4]  A. Gray,et al.  Retrootting Decision Tree Classiiers Using Kernel Density Estimation , 1995 .

[5]  G. Provan Induction of Selective Bayesian Network Classiiers , 1996 .

[6]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[7]  Herbert A. Simon,et al.  Applications of machine learning and rule induction , 1995, CACM.

[8]  Pat Langley,et al.  Models of Incremental Concept Formation , 1990, Artif. Intell..

[9]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[10]  R. Słowiński Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory , 1992 .

[11]  AlgorithmsCarla E. Brodley The Process of Applying Machine Learning , 1995 .

[12]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[13]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[14]  C. J. V. Rijsbergen,et al.  Rough Sets, Fuzzy Sets and Knowledge Discovery , 1994, Workshops in Computing.

[15]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[16]  Alexander G. Gray,et al.  Retrofitting Decision Tree Classifiers Using Kernel Density Estimation , 1995, ICML.

[17]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[18]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.