Selecting Prototypes in Mixed Incomplete Data

In this paper we introduce a new method for selecting prototypes with Mixed Incomplete Data (MID) object description, based on an extension of the Nearest Neighbor rule. This new rule allows dealing with functions that are not necessarily dual functions of distances. The introduced compact set editing method (CSE) constructs a prototype consistent subset, which is also subclass consistent. The experimental results show that CSE has a very nice computational behavior and effectiveness, reducing around 50% of prototypes without appreciable degradation on accuracy, in almost all databases with more than 300 objects.

[1]  David W. Aha,et al.  Learning Representative Exemplars of Concepts: An Initial Case Study , 1987 .

[2]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[3]  Dennis L. Wilson,et al.  Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..

[4]  Yoshiharu Sato,et al.  EXTENDED FUZZY CLUSTERING MODELS FOR ASYMMETRIC SIMILARITY , 1995 .

[5]  B. John Oommen,et al.  A brief taxonomy and ranking of creative prototype reduction schemes , 2003, Pattern Analysis & Applications.

[6]  José Francisco Martínez Trinidad,et al.  The logical combinatorial approach to pattern recognition, an overview through selected works , 2001, Pattern Recognit..

[7]  Sang-Woon Kim,et al.  Creative prototype reduction schemes: a taxonomy and ranking , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[8]  Belur V. Dasarathy,et al.  Minimal consistent set (MCS) identification for optimal nearest neighbor decision systems design , 1994, IEEE Trans. Syst. Man Cybern..

[9]  K. J. Lynch,et al.  Automatic construction of networks of concepts characterizing document databases , 1992, IEEE Trans. Syst. Man Cybern..

[10]  I. Tomek,et al.  Two Modifications of CNN , 1976 .

[11]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[12]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[13]  David B. Skalak,et al.  Prototype and Feature Selection by Sampling and Random Mutation Hill Climbing Algorithms , 1994, ICML.

[14]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[15]  Bernadette Bouchon-Meunier,et al.  Fuzzy Logic And Soft Computing , 1995 .

[16]  José Francisco Martínez Trinidad,et al.  Structuralization of universes , 2000, Fuzzy Sets Syst..

[17]  James C. Bezdek,et al.  Nearest prototype classification: clustering, genetic algorithms, or random search? , 1998, IEEE Trans. Syst. Man Cybern. Part C.