论文信息 - Stacked Autoencoder Based Deep Random Vector Functional Link Neural Network for Classification

Stacked Autoencoder Based Deep Random Vector Functional Link Neural Network for Classification

Extreme learning machine (ELM), which can be viewed as a variant of Random Vector Functional Link (RVFL) network without the input-output direct connections, has been extensively used to create multi-layer (deep) neural networks. Such networks employ randomization based autoencoders (AE) for unsupervised feature extraction followed by an ELM classifier for final decision making. Each randomization based AE acts as an independent feature extractor and a deep network is obtained by stacking several such AEs. Inspired by the better performance of RVFL over ELM, in this paper, we propose several deep RVFL variants by utilizing the framework of stacked autoencoders. Specifically, we introduce direct connections (feature reuse) from preceding layers to the fore layers of the network as in the original RVFL network. Such connections help to regularize the randomization and also reduce the model complexity. Furthermore, we also introduce denoising criterion, recovering clean inputs from their corrupted versions, in the autoencoders to achieve better higher level representations than the ordinary autoencoders. Extensive experiments on several classification datasets show that our proposed deep networks achieve overall better and faster generalization than the other relevant state-of-the-art deep neural networks.

P. N. Suganthan | Rakesh Katuwal

[1] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Bernard Widrow,et al. The No-Prop algorithm: A new learning algorithm for multilayer neural networks , 2013, Neural Networks.

[3] Robert P. W. Duin,et al. Feedforward neural networks with random weights , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[4] Bijaya K. Panigrahi,et al. Indian summer monsoon rainfall prediction: A comparison of iterative and non-iterative approaches , 2017, Appl. Soft Comput..

[5] Ling Tang,et al. A non-iterative decomposition-ensemble learning paradigm using RVFL network for crude oil price forecasting , 2017, Appl. Soft Comput..

[6] Joshua D. Knowles,et al. Fifty years of pulsar candidate selection: from simple filters to a new principled real-time classification approach , 2016, Monthly Notices of the Royal Astronomical Society.

[7] L. C. Kasun,et al. Representational Learning with Extreme Learning Machine for Big Data Liyanaarachchi , 2022 .

[8] Ponnuthurai N. Suganthan,et al. Random vector functional link network for short-term electricity load demand forecasting , 2016, Inf. Sci..

[9] Le Zhang,et al. Visual Tracking With Convolutional Random Vector Functional Link Network , 2017, IEEE Transactions on Cybernetics.

[10] Sepp Hochreiter,et al. Self-Normalizing Neural Networks , 2017, NIPS.

[11] Stephen P. Boyd,et al. Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[12] A. Asuncion,et al. UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[13] R. E. Lee,et al. Distribution-free multiple comparisons between successive treatments , 1995 .

[14] P. N. Suganthan,et al. A comprehensive evaluation of random vector functional link networks , 2016, Inf. Sci..

[15] Guang-Bin Huang,et al. Extreme Learning Machine for Multilayer Perceptron , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[16] Leonardo Ramos Rodrigues,et al. Building selective ensembles of Randomization Based Neural Networks with the successive projections algorithm , 2017, Appl. Soft Comput..

[17] Le Zhang,et al. A survey of randomized algorithms for training neural networks , 2016, Inf. Sci..

[18] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[19] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[20] Martin J. Wainwright,et al. Divide and conquer kernel ridge regression: a distributed algorithm with minimax optimal rates , 2013, J. Mach. Learn. Res..

[21] Bernhard Schölkopf,et al. A Generalized Representer Theorem , 2001, COLT/EuroCOLT.

[22] J. J. Mulawka,et al. A modified backpropagation algorithm , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[23] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Janez Demsar,et al. Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[25] Jiuwen Cao,et al. Kernel-Based Multilayer Extreme Learning Machines for Representation Learning , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[26] Ji Feng,et al. Multi-Layered Gradient Boosting Decision Trees , 2018, NeurIPS.

[27] Xin Geng,et al. Incremental Learning , 2009, Encyclopedia of Biometrics.

[28] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.

[29] P. N. Suganthan,et al. Benchmarking Ensemble Classifiers with Novel Co-Trained Kernal Ridge Regression and Random Vector Functional Link Ensembles [Research Frontier] , 2017, IEEE Computational Intelligence Magazine.

[30] Yoshua Bengio,et al. Difference Target Propagation , 2014, ECML/PKDD.

[31] Pradipta Kishore Dash,et al. Short-term wind speed and wind power prediction using hybrid empirical mode decomposition and kernel ridge regression , 2017, Appl. Soft Comput..

[32] Le Zhang,et al. An ensemble of decision trees with random vector functional link networks for multi-class classification , 2017, Appl. Soft Comput..

[33] Chunxia Zhang,et al. Generalized extreme learning machine autoencoder and a new deep neural network , 2017, Neurocomputing.

[34] Pascal Vincent,et al. Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..