A Two-phase Flight Data Feature Selection Method Using both Filter and Wrapper

Feature selection is an important issue in flight data mining. By selecting only relevant features of flight data, higher prediction accuracy can be expected and computational complexity can be reduced. In this paper we propose a novel two-phase flight data feature selection approach using both filter and wrapper. It begins by running artificial neural network weight analysis (ANNWA) as a filter approach to remove irrelevant features, then it runs genetic algorithm as a wrapper approach to remove redundant or useless features. We demonstrate the usefulness of the proposed approach on two real- world datasets based on flight data. Our algorithm reduces the size of flight data feature space significantly without compromising the classification or the prediction performance.

[1]  Andy Neely,et al.  Performance measurement system design: developing and testing a process‐based approach , 2000 .

[2]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[3]  Hua Li,et al.  The study on technologies for feature selection , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[4]  Chong-Ho Choi,et al.  Input feature selection for classification problems , 2002, IEEE Trans. Neural Networks.

[5]  Pier Luca Lanzi,et al.  Fast feature selection with genetic algorithms: a filter approach , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).

[6]  Chun-Nan Hsu,et al.  The ANNIGMA-wrapper approach to fast feature selection for neural nets , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[7]  Antanas Verikas,et al.  Feature selection with neural networks , 2002, Pattern Recognit. Lett..

[8]  Shian-Shyong Tseng,et al.  A two-phase feature selection method using both filter and wrapper , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[9]  Krzysztof Michalak,et al.  Correlation-based Feature Selection Strategy in Neural Classification , 2006, Sixth International Conference on Intelligent Systems Design and Applications.