论文信息 - Learning from time series: Supervised Aggregative Feature Extraction

Learning from time series: Supervised Aggregative Feature Extraction

Many modeling problems require to estimate a scalar output from one or more time series. Such problems are usually tackled by extracting a fixed number of features from the time series (like their statistical moments), with a consequent loss in information that leads to suboptimal predictive models. Moreover, feature extraction techniques usually make assumptions that are not met by real world settings (e.g. uniformly sampled time series of constant length), and fail to deliver a thorough methodology to deal with noisy data. In this paper a methodology based on functional learning is proposed to overcome the aforementioned problems; the proposed Supervised Aggregative Feature Extraction (SAFE) approach allows to derive continuous, smooth estimates of time series data (yielding aggregate local information), while simultaneously estimating a continuous shape function yielding optimal predictions. The SAFE paradigm enjoys several properties like closed form solution, incorporation of first and second order derivative information into the regressor matrix, interpretability of the generated functional predictor and the possibility to exploit Reproducing Kernel Hilbert Spaces setting to yield nonlinear predictive models. Simulation studies are provided to highlight the strengths of the new methodology w.r.t. standard unsupervised feature selection approaches.

Gian Antonio Susto | Simone Pampuri | Andrea Schirru | Seán F. McLoone

[1] Giuseppe De Nicolao,et al. Bayesian Online Multitask Learning of Gaussian Processes , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Wen-Lian Hsu,et al. New Challenges for Biological Text-Mining in the Next Decade , 2010, Journal of Computer Science and Technology.

[3] M. Aizerman,et al. Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning , 1964 .

[4] Robert Tibshirani,et al. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[5] László Monostori. AI and machine learning techniques for managing complexity, changes and uncertainties in manufacturing , 2002 .

[6] J. Gurland. Multidimensional Gaussian Distributions (Kenneth S. Miller) , 1966 .

[7] N. Aronszajn. Theory of Reproducing Kernels. , 1950 .

[8] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .

[9] László Monostori,et al. AI and machine learning techniques for managing complexity, changes and uncertainties in manufacturing , 2003 .

[10] A. Tikhonov. On the stability of inverse problems , 1943 .

[11] Giuseppe De Nicolao,et al. Multilevel Kernel Methods for Virtual Metrology in Semiconductor Manufacturing , 2011 .

[12] W. Rudin. Real and complex analysis , 1968 .