Exploiting Upper Approximation in the Rough Set Methodology

In this paper, we investigate enhancements to an upper classifier - a decision algorithm generated by an upper classification method, which is one of the classification methods in rough set theory. Specifically, we consider two enhancements. First, we present a stepwise backward feature selection algorithm to preprocess a given set of features. This is important because rough classification methods are incapable of removing superfluous features. We prove that the stepwise backward selection algorithm finds a small subset of relevant features that are ideally sufficient and necessary to define target concepts with respect to a given threshold. This threshold value indicates an acceptable degradation in the quality of an upper classifier. Second, to make an upper classifier adaptive, we associate it with some kind of frequency information, which we call incremental information. An extended decision table is used to represent an adaptive upper classifier. It is also used for interpreting an upper classifier either deterministically or nondeterministically.

[1]  Zdzislaw Pawlak,et al.  On learning - a rough set approach , 1984, Symposium on Computation Theory.

[2]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[3]  Wojciech Ziarko,et al.  The Discovery, Analysis, and Representation of Data Dependencies in Databases , 1991, Knowledge Discovery in Databases.

[4]  Maciej Modrzejewski,et al.  Feature Selection Using Rough Sets Theory , 1993, ECML.

[5]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[6]  DecisionTablesJitender S. DeogunThe,et al.  Rough Set Based Classiication Methods and Extended Decision Tables , 1994 .

[7]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[8]  Alan J. Miller,et al.  Subset Selection in Regression , 1991 .

[9]  Usama M. Fayyad,et al.  The Attribute Selection Problem in Decision Tree Generation , 1992, AAAI.

[10]  A. Atkinson Subset Selection in Regression , 1992 .

[11]  C. Chang Dynamic programming as applied to feature subset selection in a pattern recognition system , 1972, ACM Annual Conference.

[12]  Basabi Chakraborty,et al.  Fuzzy Set Theoretic Measure for Automatic Feature Evaluation , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  Roman Słowiński,et al.  Rough Classification with Valued Closeness Relation , 1994 .

[14]  Hayri Sever Knowledge structuring for database mining and text retrieval using past optimal queries , 1995 .

[15]  Steven L. Salzberg,et al.  Improving Classification Methods via Feature Selection , 1992 .

[16]  MiningApplicationsVijay V. RaghavanThe The State of Rough Sets for Database , 1995 .

[17]  Robert N. Oddy,et al.  Information Retrieval Research , 1982 .

[18]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[19]  Anne Lohrli Chapman and Hall , 1985 .

[20]  Philip K. Chan,et al.  Systems for Knowledge Discovery in Databases , 1993, IEEE Trans. Knowl. Data Eng..

[21]  Peter Bollmann-Sdorra,et al.  Measurement-theoretical investigation of the MZ-metric , 1980, SIGIR '80.

[22]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.