A two-stage feature selection method with its application

A two-stage selection algorithm combining IG and BPSO for foreign fiber data.IG can filter noisy features and the BPSO can select the highly discriminating features.The experiments show that our method can find the subsets with small size and high accuracy.The optimum subsets are significant for online detection based on machine vision of foreign fibers in cotton. Foreign fibers in cotton seriously affect the quality of cotton products. Online detection systems of foreign fibers based on machine vision are the efficient tools to minimize the harmful effects of foreign fibers. The optimum feature set with small size and high accuracy can efficiently improve the performance of online detection systems. To find the optimal feature sets, a two-stage feature selection algorithm combining IG (Information Gain) approach and BPSO (Binary Particle Swarm Optimization) is proposed for foreign fiber data. In the first stage, IG approach is used to filter noisy features, and the BPSO uses the classifier accuracy as a fitness function to select the highly discriminating features in the second stage. The proposed algorithm is tested on foreign fiber dataset. The experimental results show that the proposed algorithm can efficiently find the feature subsets with smaller size and higher accuracy than other algorithms. Display Omitted

[1]  Driss Aboutajdine,et al.  A two-stage gene selection scheme utilizing MRMR filter and GA wrapper , 2011, Knowledge and Information Systems.

[2]  Daoliang Li,et al.  Feature selection based on improved ant colony optimization for online detection of foreign fiber in cotton , 2014, Appl. Soft Comput..

[3]  Daoliang Li,et al.  A new approach for image processing in foreign fiber detection , 2009 .

[4]  Stjepan Oreski,et al.  Genetic algorithm-based heuristic for feature selection in credit risk assessment , 2014, Expert Syst. Appl..

[5]  Daoliang Li,et al.  Feature Selection for Cotton Foreign Fiber Objects Based on PSO Algorithm , 2011, CCTA.

[6]  Daoliang Li,et al.  Classification and identification of foreign fibers in cotton on the basis of a support vector machine , 2010, Math. Comput. Model..

[7]  Daoliang Li,et al.  Fast recognition of foreign fibers in cotton lint using machine vision , 2011, Math. Comput. Model..

[8]  Daoliang Li,et al.  An improved genetic algorithm for optimal feature subset selection from multi-character feature set , 2011, Expert Syst. Appl..

[9]  Jordán Pascual Espada,et al.  Machine learning approach for text and document mining , 2014, ArXiv.

[10]  Jin Li,et al.  Feature evaluation and selection with cooperative game theory , 2012, Pattern Recognit..

[11]  Wei-Pang Yang,et al.  Classifier design with feature selection and feature extraction using layered genetic programming , 2008, Expert Syst. Appl..

[12]  Daoliang Li,et al.  Original paper: Classification of foreign fibers in cotton lint using machine vision and multi-class support vector machine , 2010 .

[13]  Huan Liu,et al.  Advancing feature selection research , 2010 .

[14]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[15]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.

[16]  Samina Khalid,et al.  A survey of feature selection and feature extraction techniques in machine learning , 2014, 2014 Science and Information Conference.

[17]  Xuehua Zhao,et al.  Feature Selection Based on Ant Colony Optimization for Cotton Foreign Fiber , 2011 .