Determining Feature Weight of Pattern Classification by Using Rough Genetic Algorithm and Fuzzy Similarity Measure

The nearest neighbor (NN) methods solve classification problem by storing examples as points in a feature space, which requires some means of measuring distances between examples. However, it suffers from the existence of noisy attributes. One resolution is to modify the distance of similarity degree using attribute weights, which can not on]y decrease the influence of noisy attributes, but also subset relevant attributes. In this paper, a rough genetic algorithm (RGA) proposed by Lingras and Davies is applied to the classification problem under an undetermined environment, based on a fuzzy distance function by calculating attribute weights, The RGA can complement the existing tools developed in rough computing. Computational experiments are conducted on benchmark problems, downloaded from UCI machine learning databases. Experimental results, compared with a usual GA[1] and the C4.5 algorithms, verify the efficiency of the developed algorithm. Furthermore, the weights learned by the proposed learning method is applicable to not only fuzzy similarity functions but also any similarity functions. As an application, a new distance metric, weighted discretized value difference metric

[1]  Michio Sugeno,et al.  Fuzzy Measure and Fuzzy Integral , 1972 .

[2]  Stefan Wess,et al.  Using k-d Trees to Improve the Retrieval Step in Case-Based Reasoning , 1993, EWCBR.

[3]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[4]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[5]  Francesco Ricci,et al.  Learning a Local Similarity Metric for Case-Based Reasoning , 1995, ICCBR.

[6]  Chen Li Knowledge Discovery in Database , 1999 .

[7]  Cullen Schaffer,et al.  Selecting a classification method by cross-validation , 1993, Machine Learning.

[8]  Stanley Y. W. Su,et al.  Modeling and Management of Temporal Data in Object-Oriented Knowledge Bases , 1998, Int. J. Artif. Intell. Tools.

[9]  N. Ishii,et al.  An online genetic algorithm for dynamic Steiner tree problem , 2000, 2000 26th Annual Conference of the IEEE Industrial Electronics Society. IECON 2000. 2000 IEEE International Conference on Industrial Electronics, Control and Instrumentation. 21st Century Technologies.

[10]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[11]  Steven Salzberg,et al.  A Weighted Nearest Neighbor Algorithm for Learning with Symbolic Features , 2004, Machine Learning.

[12]  David L. Waltz,et al.  Toward memory-based reasoning , 1986, CACM.

[13]  Yong Wang,et al.  Learning Feature Weights from Similarity Information , 1998, Int. J. Artif. Intell. Tools.