Transductive Feature Selection Using Clustering-Based Sample Entropy for Temperature Prediction in Weather Forecasting

Entropy measures have been a major interest of researchers to measure the information content of a dynamical system. One of the well-known methodologies is sample entropy, which is a model-free approach and can be deployed to measure the information transfer in time series. Sample entropy is based on the conditional entropy where a major concern is the number of past delays in the conditional term. In this study, we deploy a lag-specific conditional entropy to identify the informative past values. Moreover, considering the seasonality structure of data, we propose a clustering-based sample entropy to exploit the temporal information. Clustering-based sample entropy is based on the sample entropy definition while considering the clustering information of the training data and the membership of the test point to the clusters. In this study, we utilize the proposed method for transductive feature selection in black-box weather forecasting and conduct the experiments on minimum and maximum temperature prediction in Brussels for 1–6 days ahead. The results reveal that considering the local structure of the data can improve the feature selection performance. In addition, despite the large reduction in the number of features, the performance is competitive with the case of using all features.

[1]  R. Shah,et al.  Least Squares Support Vector Machines , 2022 .

[2]  Léon Bottou,et al.  Local Learning Algorithms , 1992, Neural Computation.

[3]  Hao Liao,et al.  An efficient semi-supervised representatives feature selection algorithm based on information theory , 2017, Pattern Recognit..

[4]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[5]  Lotfi Lakhal,et al.  A Causality Based Feature Selection Approach for Multivariate Time Series Forecasting , 2017, DBKDA 2017.

[6]  Guoyin Wang,et al.  Online Streaming Feature Selection Based on Conditional Information Entropy , 2017, 2017 IEEE International Conference on Big Knowledge (ICBK).

[7]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[8]  Johan A. K. Suykens,et al.  Multiway Spectral Clustering with Out-of-Sample Extensions through Weighted Kernel PCA , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Eric R. Ziegel,et al.  Geographically Weighted Regression , 2006, Technometrics.

[10]  Giuseppe Baselli,et al.  Conditional entropy approach for the evaluation of the coupling strength , 1999, Biological Cybernetics.

[11]  Jürgen Kurths,et al.  Escaping the curse of dimensionality in estimating multivariate transfer entropy. , 2012, Physical review letters.

[12]  Verónica Bolón-Canedo,et al.  An Information Theory-Based Feature Selection Framework for Big Data Under Apache Spark , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[13]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[14]  L. Faes,et al.  Information-based detection of nonlinear Granger causality in multivariate processes via a nonuniform embedding technique. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Johan A. K. Suykens,et al.  Approximate Confidence and Prediction Intervals for Least Squares Support Vector Regression , 2011, IEEE Transactions on Neural Networks.

[16]  J. Detre,et al.  Brain Entropy Mapping Using fMRI , 2014, PloS one.

[17]  Luca Faes,et al.  Entropy measures, entropy estimators, and their performance in quantifying complex dynamics: Effects of artifacts, nonstationarity, and long-range correlations. , 2017, Physical review. E.

[18]  Jürgen Kurths,et al.  Statistical Mechanics and Information-Theoretic Perspectives on Complexity in the Earth System , 2013, Entropy.

[19]  Daniele Marinazzo,et al.  Causal Information Approach to Partial Conditioning in Multivariate Data Sets , 2011, Comput. Math. Methods Medicine.

[20]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[21]  Karsten Keller,et al.  Ordinal Patterns, Entropy, and EEG , 2014, Entropy.

[22]  Johan A. K. Suykens,et al.  LS-SVMlab Toolbox User's Guide , 2010 .

[23]  J. Mercer Functions of Positive and Negative Type, and their Connection with the Theory of Integral Equations , 1909 .

[24]  Luca Faes,et al.  Lag-Specific Transfer Entropy as a Tool to Assess Cardiovascular and Cardiorespiratory Information Transfer , 2014, IEEE Transactions on Biomedical Engineering.

[25]  Johan A. K. Suykens,et al.  Moving Least Squares Support Vector Machines for weather temperature prediction , 2017, ESANN.

[26]  W. Ebeling Entropy, information and predictability of evolutionary systems , 1997 .

[27]  Leonidas Sandoval,et al.  Structure of a Global Network of Financial Companies Based on Transfer Entropy , 2014, Entropy.

[28]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[29]  Ginestra Bianconi,et al.  Entropy measures for networks: toward an information theory of complex topologies. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[31]  Peter Bauer,et al.  The quiet revolution of numerical weather prediction , 2015, Nature.

[32]  Olivier J. J. Michel,et al.  The relation between Granger causality and directed information theory: a review , 2012, Entropy.

[33]  Ya. G. Sinai,et al.  On the Notion of Entropy of a Dynamical System , 2010 .

[34]  Li Shuangcheng,et al.  Measurement of climate complexity using sample entropy , 2006 .

[35]  Marc M. Van Hulle,et al.  Speeding Up the Wrapper Feature Subset Selection in Regression by Mutual Information Relevance and Redundancy Analysis , 2006, ICANN.

[36]  Johan A. K. Suykens,et al.  Soft kernel spectral clustering , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[37]  Johan A. K. Suykens,et al.  Clustering-based feature selection for black-box weather temperature prediction , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).