Feature Selection for Time Series Modeling

In machine learning, selecting useful features and rejecting redundant features is the prerequisite for better modeling and prediction. In this paper, we first study representative feature selection methods based on correlation analysis, and demonstrate that they do not work well for time series though they can work well for static systems. Then, theoretical analysis for linear time series is carried out to show why they fail. Based on these observations, we propose a new correlation-based feature selection method. Our main idea is that the features highly correlated with progressive response while lowly correlated with other features should be selected, and for groups of selected features with similar residuals, the one with a smaller number of features should be selected. For linear and nonlinear time series, the proposed method yields high accuracy in both feature selection and feature rejection.

[1]  Kashif Javed,et al.  Feature Selection Based on Class-Dependent Densities for High-Dimensional Binary Data , 2012, IEEE Transactions on Knowledge and Data Engineering.

[2]  J. Le Roux,et al.  A fixed point computation of partial correlation coefficients in linear prediction , 1977 .

[3]  李幼升,et al.  Ph , 1989 .

[4]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[5]  Shuzhi Sam Ge,et al.  Chinese Stock Price and Volatility Predictions with Multiple Technical Indicators , 2011 .

[6]  Slobodan Petrovic,et al.  Improving Effectiveness of Intrusion Detection by Correlation Feature Selection , 2010, 2010 International Conference on Availability, Reliability and Security.

[7]  J. L. Roux,et al.  A fixed point computation of partial correlation coefficients , 1977 .

[8]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[9]  Sinisa Todorovic,et al.  Local-Learning-Based Feature Selection for High-Dimensional Data Analysis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  A new bias-compensating least-squares method for identification of stochastic linear systems in presence of coloured noise , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.

[11]  P. A. Blight The Analysis of Time Series: An Introduction , 1991 .

[12]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[13]  Siu-Yeung Cho,et al.  A Spearman correlation coefficient ranking for matching-score fusion on speaker recognition , 2010, TENCON 2010 - 2010 IEEE Region 10 Conference.