ScaleNet-multiscale neural-network architecture for time series prediction

The effectiveness of a multiscale neural-network (NN) architecture for the time series prediction of nonlinear dynamic systems has been investigated. The prediction task is simplified by decomposing different scales of past windows into different scales of wavelets (local frequencies), and predicting the coefficients of each scale of wavelets by means of a separate multilayer perceptron NN. The short-term history (short past windows) is decomposed into the lower scales of wavelet coefficients (high frequencies) which are utilized for "detailed" analysis and prediction, while the long-term history (long past window) is decomposed into higher scales of wavelet coefficients (low frequencies) that are used for the analysis and prediction of slow trends in the time series. These coordinated scales of time and frequency provides an interpretation of the series structures, and more information about the history of the series, using fewer coefficients than other methods. The prediction's results concerning all the different scales of time and frequencies are combined by another "expert" perceptron NN which learns the weight of each scale in the goal-prediction of the original time series. Each network is trained by the backpropagation algorithm using the Levenberg-Marquadt method. The weights and biases are initialized by a new clustering algorithm of the temporal patterns of the time series, which improves the prediction results as compared to random initialization. Three main sets of data were analyzed: the sunspots' benchmark, fluctuations in a farinfrared laser and a nonlinear numerically generated series. Taking the ultimate goal to be the accuracy of the prediction, we found that the suggested multiscale architecture outperforms the corresponding single-scale architectures. The employment of improved learning methods for each of the ScaleNet networks can further improve the prediction results.

[1]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[2]  David E. Rumelhart,et al.  Predicting the Future: a Connectionist Approach , 1990, Int. J. Neural Syst..

[3]  Qinghua Zhang,et al.  Using wavelet network in nonparametric estimation , 1997, IEEE Trans. Neural Networks.

[4]  J. Andrew Ware,et al.  Layered Neural Networks as Universal Approximators , 1997, Fuzzy Days.

[5]  James D. Hamilton Time Series Analysis , 1994 .

[6]  Jukka Saarinen,et al.  Initializing Weights of a Multilayer Perceptron Network by Using the Orthogonal Least Squares Algorithm , 1995, Neural Computation.

[7]  Emile Fiesler,et al.  High-order and multilayer perceptron initialization , 1997, IEEE Trans. Neural Networks.

[8]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[9]  J. J. Moré,et al.  Levenberg--Marquardt algorithm: implementation and theory , 1977 .

[10]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[11]  Bhavik R. Bakshi,et al.  Reasoning in Time: Modeling, Analysis, and Pattern Recognition of Temporal Process Trends , 1995 .

[12]  H. Tong,et al.  Threshold Autoregression, Limit Cycles and Cyclical Data , 1980 .

[13]  Qinghua Zhang,et al.  Wavelet networks , 1992, IEEE Trans. Neural Networks.

[14]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[15]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[16]  Bernard Delyon,et al.  Accuracy analysis for wavelet approximations , 1995, IEEE Trans. Neural Networks.

[17]  Tomaso A. Poggio,et al.  Extensions of a Theory of Networks for Approximation and Learning , 1990, NIPS.

[18]  Isak Gath,et al.  Decision making in electromyography using wavelet-type analysis and fuzzy clustering , 1996, 1996 IEEE International Conference on Systems, Man and Cybernetics. Information Intelligence and Systems (Cat. No.96CH35929).

[19]  Bhavik R. Bakshi,et al.  Wave‐net: a multiresolution, hierarchical neural network with localized learning , 1993 .

[20]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[21]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Thierry Denoeux,et al.  Initializing back propagation networks with prototypes , 1993, Neural Networks.

[23]  Amir B. Geva Unsupervised hierarchical fuzzy clustering methods in forecasting medical events from biomedical signals , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[24]  Andreas S. Weigend,et al.  Time Series Prediction: Forecasting the Future and Understanding the Past , 1994 .

[25]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..