Feature selection for multiple binary classification problems

Abstract The present study proposes an unsupervised method for selection of feature subsets, which retain sufficient information for classification purposes. Multiple alternative physically feasible partitions can be dealt with, using the present method. The method is based on an alternative approach to data representation, in which the axes are the data points instead of the features (a transpose projection). Under this representation, coherent features are located in the vicinity of each other, and hence can be clustered, while noisy features are pointed out and eliminated. The method bypasses the “curse of dimensionality” and demonstrates good results in particular in small data sets.

[1]  Mineichi Kudo,et al.  Feature selection based on the structural indices of categories , 1993, Pattern Recognit..

[2]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[3]  Gerard V. Trunk,et al.  A Problem of Dimensionality: A Simple Example , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Rudolf Hanka,et al.  An interpretation of Mahalanobis distance in the dual space , 1982, Pattern Recognit..

[5]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[6]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[7]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.

[8]  John A. Daly,et al.  Some dual problems in pattern recognition , 1971, Pattern Recognit..

[9]  G. Hughes,et al.  Number of pattern classifier design samples per class (Corresp.) , 1969, IEEE Trans. Inf. Theory.

[10]  William S. Meisel,et al.  Computer-oriented approaches to pattern recognition , 1972 .

[11]  Josef Kittler,et al.  An analysis of the Max-Min approach to feature selection and ordering , 1993, Pattern Recognit. Lett..

[12]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.

[13]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  George Nagy,et al.  Feature Extraction on Binary Patterns , 1969, IEEE Trans. Syst. Sci. Cybern..

[15]  Anil K. Jain,et al.  Algorithms for feature selection: An evaluation , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[16]  N. E. Day Estimating the components of a mixture of normal distributions , 1969 .

[17]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[18]  R. Bellman,et al.  V. Adaptive Control Processes , 1964 .

[19]  J. Friedman Regularized Discriminant Analysis , 1989 .

[20]  Selim G. Akl,et al.  Feature selection by interactive clustering , 1981, Pattern Recognit..

[21]  Baozong Yuan,et al.  A more efficient branch and bound algorithm for feature selection , 1993, Pattern Recognit..