论文信息 - Unsupervised Pre-training on Improving the Performance of Neural Network in Regression

Unsupervised Pre-training on Improving the Performance of Neural Network in Regression

The paper aims to empirically analyse the perfor-mance of the prediction capability of Artificial Neural Network by applying a pre-training mechanism. The pre-training used here is same as the training of Deep Belief Network where the network is formed by stacking Restricted Boltzmann Machine one above the other successively. A different set of experiments are performed to understand in what scenario pre-trained ANN performed better than randomly initialised ANN. The results of experiments showed that pre-trained model performed better than randomly initialised ANN in terms of generalised error, computational units required and most importantly robust to change in hyperparameters such as learning rate and model architecture. The only cost is in additional time involved in the pre-training phase. Further, the learned knowledge in pretraining, which is stored as weights in ANN, are analysed using Hinton diagram. The analysis could provide clear picture of the pre-training that learned some of the hidden characteristics of the data.

[1] Peter Glöckner,et al. Why Does Unsupervised Pre-training Help Deep Learning? , 2013 .

[2] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[3] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[4] Geoffrey E. Hinton. A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[5] Yoshua Bengio,et al. Scaling learning algorithms towards AI , 2007 .

[6] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[7] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[8] Ioana Sporea,et al. Supervised Learning in Multilayer Spiking Neural Networks , 2012, Neural Computation.

[9] Luis M. Candanedo,et al. Data driven prediction models of energy use of appliances in a low-energy house , 2017 .

[10] Shingo Mabu,et al. Time Series Prediction Using DBN and ARIMA , 2015, 2015 International Conference on Computer Application Technologies.

[11] Yurong Liu,et al. A survey of deep neural network architectures and their applications , 2017, Neurocomputing.

[12] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.