Research on the Feature Selection Approach Based on IFDS and DPSO With Variable Thresholds in Complex Data Environments

Neighborhood rough model is widely used in feature selection with high dimension, fuzzy, continuous and discrete attributes, incomplete data and so on, and the application of neighborhood rough model depends on neighborhood threshold. In the application of the model, the point-value neighborhood threshold is not adaptive, which leads to low classification accuracy and high time complexity of the algorithm. In order to solve the above problems, a feature selection approach based on IFDS (Incomplete Fuzzy Hybrid Decision System) and DPSO (Discrete Particle Swarm Optimization Algorithm) with variable thresholds is proposed. Firstly, a neighborhood rough model capable of simultaneously processing fuzzy, hybrid and incomplete data was established. The average reachable distance was introduced to construct the attribute neighborhood threshold set and reduce the interference of noise data on classification accuracy. Secondly, we constructed the DPSO particle fitness function using the feature subset length, the significance of the attribute and the negative domain of the neighborhood, and improved the inertia weight computing method, so as to enhance the feature selection speed and the feature subset quality. Finally, the simulation experiment was performed using the real industrial production data. The experiment effect shows that this method has obvious advantages in improving the classification accuracy, optimizing the search speed and the optimal feature subset quality.

[1]  Mateusz Adamczyk Parallel feature selection algorithm based on rough sets and particle swarm optimization , 2014, 2014 Federated Conference on Computer Science and Information Systems.

[2]  Qinghua Hu,et al.  A Fitting Model for Feature Selection With Fuzzy Rough Sets , 2017, IEEE Transactions on Fuzzy Systems.

[3]  Wei-Zhi Wu,et al.  On rule acquisition in incomplete multi-scale decision tables , 2017, Inf. Sci..

[4]  Dazi Li,et al.  A hybrid fault diagnosis method based on fuzzy signed directed graph and neighborhood rough set , 2017, 2017 6th Data Driven Control and Learning Systems (DDCLS).

[5]  Xizhao Wang,et al.  Incremental Perspective for Feature Selection Based on Fuzzy Rough Sets , 2018, IEEE Transactions on Fuzzy Systems.

[6]  Peng Wang,et al.  An improved approach to feature selection , 2010, 2010 International Conference on Machine Learning and Cybernetics.

[7]  Pradipta Maji,et al.  Rough Sets and Stomped Normal Distribution for Simultaneous Segmentation and Bias Field Correction in Brain MR Images , 2015, IEEE Transactions on Image Processing.

[8]  Jianhua Dai,et al.  Rough set approach to incomplete numerical data , 2013, Inf. Sci..

[9]  Tony R. Martinez,et al.  Improved Heterogeneous Distance Functions , 1996, J. Artif. Intell. Res..

[10]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[11]  Li Na,et al.  Subjective logic application research—Rough set belief rule model and trust evaluation model of the intrusion detection system , 2015 .

[12]  Hua Zhao,et al.  Mixed feature selection in incomplete decision table , 2014, Knowl. Based Syst..

[13]  Yi Pan,et al.  A Parallel Matrix-Based Method for Computing Approximations in Incomplete Information Systems , 2015, IEEE Transactions on Knowledge and Data Engineering.

[14]  Zdzislaw Pawlak,et al.  Rough sets and intelligent data analysis , 2002, Inf. Sci..

[15]  Mengjie Zhang,et al.  Feature selection based on PSO and decision-theoretic rough set model , 2013, 2013 IEEE Congress on Evolutionary Computation.

[16]  Ahmad Taher Azar,et al.  Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis , 2014, Comput. Methods Programs Biomed..

[17]  Wang Guo,et al.  EXTENSION OF ROUGH SET UNDER INCOMPLETE INFORMATION SYSTEMS , 2002 .

[18]  Mingliang Suo,et al.  Fault diagnosis of satellite power system using variable precision fuzzy neighborhood rough set , 2017, 2017 36th Chinese Control Conference (CCC).

[19]  Leslie S. Smith,et al.  Feature subset selection in large dimensionality domains , 2010, Pattern Recognit..

[20]  Xinye Cai,et al.  Neighborhood based decision-theoretic rough set models , 2016, Int. J. Approx. Reason..

[21]  Tao Jing,et al.  Research of Aircraft Integrated Drive Generator fault diagnostic decision based on attribute reduction in rough sets , 2016, 2016 IEEE International Conference on Aircraft Utility Systems (AUS).

[22]  Alexis Tsoukiàs,et al.  On the Extension of Rough Sets under Incomplete Information , 1999, RSFDGrC.

[23]  王璐,et al.  A method for feature selection based on rough sets and ant colonyoptimization algorithm , 2010 .

[24]  Mengjie Zhang,et al.  A Multi-objective Feature Selection Approach Based on Binary PSO and Rough Set Theory , 2013, EvoCOP.

[25]  Guo Wenzhong,et al.  Feature Selection of the Intrusion Detection Data Based on Particle Swarm Optimization and Neighborhood Reduction , 2010 .

[26]  Marzena Kryszkiewicz,et al.  Rough Set Approach to Incomplete Information Systems , 1998, Inf. Sci..

[27]  Ming-Wen Shao,et al.  Feature subset selection based on fuzzy neighborhood rough sets , 2016, Knowl. Based Syst..