Optimizing the echo state network with a binary particle swarm optimization algorithm

The echo state network (ESN) is a novel and powerful method for the temporal processing of recurrent neural networks. It has tremendous potential for solving a variety of problems, especially real-valued, time-series modeling tasks. However, its complicated topologies and random reservoirs are difficult to implement in practice. For instance, the reservoir must be large enough to capture all data features given that the reservoir is generated randomly. To reduce network complexity and to improve generalization ability, we present a novel optimized ESN (O-ESN) based on binary particle swarm optimization (BPSO). Because the optimization of output weights connection structures is a feature selection problem and PSO has been used as a promising method for feature selection problems, BPSO is employed to determine the optimal connection structures for output weights in the O-ESN. First, we establish and train an ESN with sufficient internal units using training data. The connection structure of output weights, i.e., connection or disconnection, is then optimized through BPSO with validation data. Finally, the performance of the O-ESN is evaluated through test data. This performance is demonstrated in three different types of problems, namely, a system identification and two time-series benchmark tasks. Results show that the O-ESN outperforms the classical feature selection method, least angle regression (LAR) method in that its architecture is simpler than that of LAR.

[1]  H Nezamabadi Pour,et al.  BINARY PARTICLE SWARM OPTIMIZATION: CHALLENGES AND NEW SOLUTIONS , 2008 .

[2]  Meng Joo Er,et al.  NARMAX time series model prediction: feedforward and recurrent fuzzy neural network approaches , 2005, Fuzzy Sets Syst..

[3]  Andreas S. Weigend,et al.  Time Series Prediction: Forecasting the Future and Understanding the Past , 1994 .

[4]  Chia-Feng Juang,et al.  Temperature control by chip-implemented adaptive recurrent fuzzy controller designed by evolutionary algorithm , 2005, IEEE Trans. Circuits Syst. I Regul. Pap..

[5]  Harald Haas,et al.  Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication , 2004, Science.

[6]  Walter Cedeño,et al.  A comparison of particle swarms techniques for the development of quantitative structure-activity relationship models for drug design , 2005, 2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05).

[7]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[8]  Hui-Ming Wee,et al.  Particle swarm optimization for bi-level pricing problems in supply chains , 2011, J. Glob. Optim..

[9]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[10]  Hans-Ulrich Kobialka,et al.  Echo State Networks with Sparse Output Connections , 2010, ICANN.

[11]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[12]  Leandro dos Santos Coelho,et al.  Binary optimization using hybrid particle swarm optimization and gravitational search algorithm , 2014, Neural Computing and Applications.

[13]  Yunhe Pan,et al.  Particle Swarm Algorithm for Minimal Attribute Reduction of Decision Data Tables , 2006, First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'06).

[14]  Seyed Mohammad Mirjalili How effective is the Grey Wolf optimizer in training multi-layer perceptrons , 2014, Applied Intelligence.

[15]  Andrew Lewis,et al.  S-shaped versus V-shaped transfer functions for binary Particle Swarm Optimization , 2013, Swarm Evol. Comput..

[16]  Li-Yeh Chuang,et al.  Chaotic maps based on binary particle swarm optimization for feature selection , 2011, Appl. Soft Comput..

[17]  Hendrik Van Brussel,et al.  Pruning and regularization in reservoir computing , 2009, Neurocomputing.

[18]  Li-Yeh Chuang,et al.  Boolean binary particle swarm optimization for feature selection , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[19]  Caihong Li,et al.  Multi-steps prediction of chaotic time series based on echo state network , 2010, 2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA).

[20]  Min Han,et al.  Support Vector Echo-State Machine for Chaotic Time-Series Prediction , 2007, IEEE Transactions on Neural Networks.

[21]  Hubert Cardot,et al.  A new boosting algorithm for improved time-series forecasting with recurrent neural networks , 2008, Inf. Fusion.

[22]  Leandro N. de Castro,et al.  Data Clustering with Particle Swarms , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[23]  Mahdi Vasighi,et al.  Genetic Algorithms for architecture optimisation of Counter-Propagation Artificial Neural Networks , 2011 .

[24]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[25]  Jochen J. Steil,et al.  Improving reservoirs using intrinsic plasticity , 2008, Neurocomputing.

[26]  J. Kennedy,et al.  Matching algorithms to problems: an experimental test of the particle swarm and some genetic algorithms on the multimodal problem generator , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[27]  Amir F. Atiya,et al.  New results on recurrent network training: unifying the algorithms and accelerating convergence , 2000, IEEE Trans. Neural Networks Learn. Syst..

[28]  Alper Ekrem Murat,et al.  A discrete particle swarm optimization method for feature selection in binary classification problems , 2010, Eur. J. Oper. Res..

[29]  Herbert Jaeger,et al.  Optimization and applications of echo state networks with leaky- integrator neurons , 2007, Neural Networks.

[30]  Andrew Lewis,et al.  How important is a transfer function in discrete heuristic algorithms , 2015, Neural Computing and Applications.

[31]  A. Massa,et al.  An innovative computational approach based on a particle swarm strategy for adaptive phased-arrays control , 2006, IEEE Transactions on Antennas and Propagation.

[32]  Jochen J. Steil,et al.  Online reservoir adaptation by intrinsic plasticity for backpropagation-decorrelation and echo state learning , 2007, Neural Networks.

[33]  L. Glass,et al.  Oscillation and chaos in physiological control systems. , 1977, Science.

[34]  Jang Myung Lee,et al.  Fuzzy Echo State Neural Networks and Funnel Dynamic Surface Control for Prescribed Performance of a Nonlinear Dynamic System , 2014, IEEE Transactions on Industrial Electronics.

[35]  Zehong Yang,et al.  Short-term stock price prediction based on echo state networks , 2009, Expert Syst. Appl..

[36]  Herbert Jaeger,et al.  Reservoir computing approaches to recurrent neural network training , 2009, Comput. Sci. Rev..

[37]  S. Ramachandran,et al.  Entropy based Binary Particle Swarm Optimization and classification for ear detection , 2014, Eng. Appl. Artif. Intell..

[38]  Antonio Luchetta Automatic generation of the optimum threshold for parameter weighted pruning in multiple heterogeneous output neural networks , 2008, Neurocomputing.

[39]  John G. Harris,et al.  Minimum mean squared error time series classification using an echo state network prediction model , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[40]  Seyed Mohammad Mirjalili,et al.  Multi-Verse Optimizer: a nature-inspired algorithm for global optimization , 2015, Neural Computing and Applications.

[41]  Mengjie Zhang,et al.  Single Feature Ranking and Binary Particle Swarm Optimisation Based Feature Subset Ranking for Feature Selection , 2012, ACSC.

[42]  Walter Cedeñto,et al.  Particle swarms for drug design , 2005, 2005 IEEE Congress on Evolutionary Computation.

[43]  Seyed Mohammad Mirjalili,et al.  The Ant Lion Optimizer , 2015, Adv. Eng. Softw..

[44]  Andrew Lewis,et al.  Let a biogeography-based optimizer train your Multi-Layer Perceptron , 2014, Inf. Sci..

[45]  Tao Chen,et al.  Condition prediction of flue gas turbine based on Echo State Network , 2010, 2010 Sixth International Conference on Natural Computation.

[46]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[47]  Benjamin Schrauwen,et al.  Stable Output Feedback in Reservoir Computing Using Ridge Regression , 2008, ICANN.

[48]  Christopher MacLeod,et al.  Incremental growth in modular neural networks , 2009, Eng. Appl. Artif. Intell..

[49]  Amaury Lendasse,et al.  Methodology for long-term prediction of time series , 2007, Neurocomputing.

[50]  Shih-Hung Yang,et al.  An evolutionary constructive and pruning algorithm for artificial neural networks and its prediction applications , 2012, Neurocomputing.

[51]  Seyed Mohammad Mirjalili,et al.  Ions motion algorithm for solving optimization problems , 2015, Appl. Soft Comput..

[52]  Garrison W. Cottrell,et al.  2007 Special Issue: Learning grammatical structure with Echo State Networks , 2007 .

[53]  Shian-Shyong Tseng,et al.  A two-phase feature selection method using both filter and wrapper , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[54]  Christian Jacob,et al.  Making soccer kicks better: a study in particle swarm optimization and evolution strategies , 2005, Congress on Evolutionary Computation.

[55]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[56]  Danilo P. Mandic,et al.  An Augmented Echo State Network for Nonlinear Adaptive Filtering of Complex Noncircular Signals , 2011, IEEE Transactions on Neural Networks.

[57]  José Carlos Príncipe,et al.  Analysis and Design of Echo State Networks , 2007, Neural Computation.

[58]  Jürgen Schmidhuber,et al.  Training Recurrent Networks by Evolino , 2007, Neural Computation.

[59]  F. Grimaccia,et al.  Genetical Swarm Optimization: Self-Adaptive Hybrid Evolutionary Algorithm for Electromagnetics , 2007, IEEE Transactions on Antennas and Propagation.

[60]  Zhidong Deng,et al.  Collective Behavior of a Small-World Recurrent Neural System With Scale-Free Distribution , 2007, IEEE Transactions on Neural Networks.

[61]  Xin-She Yang,et al.  Binary bat algorithm , 2013, Neural Computing and Applications.

[62]  Xue-feng Yan,et al.  Reservoir Computing with Sensitivity Analysis Input Scaling Regulation and Redundant Unit Pruning for Modeling Fed-Batch Bioprocesses , 2014 .

[63]  Henry Markram,et al.  Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations , 2002, Neural Computation.

[64]  Siti Zaiton Mohd Hashim,et al.  Training feedforward neural networks using hybrid particle swarm optimization and gravitational search algorithm , 2012, Appl. Math. Comput..

[65]  J.J. Steil,et al.  Backpropagation-decorrelation: online recurrent learning with O(N) complexity , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[66]  A. Massa,et al.  PSO-Based Real-Time Control of Planar Uniform Circular Arrays , 2006, IEEE Antennas and Wireless Propagation Letters.

[67]  Alex N. Kalos Modeling MIDI Music as Multivariate Time Series , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[68]  Peng Yu,et al.  Clustered complex echo state networks for traffic forecasting with prior knowledge , 2011, 2011 IEEE International Instrumentation and Measurement Technology Conference.

[69]  Leandro dos Santos Coelho,et al.  Nonlinear System Identification Based on B-Spline Neural Network and Modified Particle Swarm Optimization , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[70]  Zhenfeng He,et al.  Instance selection for time series classification based on immune binary particle swarm optimization , 2013, Knowledge-Based Systems.

[71]  Herbert Jaeger,et al.  Adaptive Nonlinear System Identification with Echo State Networks , 2002, NIPS.

[72]  B. Mozafari,et al.  An improved model for optimal under voltage load shedding: particle swarm approach , 2006, 2006 IEEE Power India Conference.

[73]  Benjamin Schrauwen,et al.  An experimental unification of reservoir computing methods , 2007, Neural Networks.