Optimization of Convolutional Neural Network Using the Linearly Decreasing Weight Particle Swarm Optimization

Convolutional neural network (CNN) is one of the most frequently used deep learning techniques. Various forms of models have been proposed and improved for learning at CNN. When learning with CNN, it is necessary to determine the optimal hyperparameters. However, the number of hyperparameters is so large that it is difficult to do it manually, so much research has been done on automation. A method that uses metaheuristic algorithms is attracting attention in research on hyperparameter optimization. Metaheuristic algorithms are naturally inspired and include evolution strategies, genetic algorithms, antcolony optimization and particle swarm optimization. In particular, particle swarm optimization converges faster than genetic algorithms, and various models have been proposed. In this paper, we propose CNN hyperparameter optimization with linearly decreasing weight particle swarm optimization (LDWPSO). In the experiment, the MNIST data set and CIFAR-10 data set, which are often used as benchmark data sets, are used. By optimizing CNN hyperparameters with LDWPSO, learning the MNIST and CIFAR-10 datasets, we compare the accuracy with a standard CNN based on LeNet-5. As a result, when using the MNIST dataset, the baseline CNN is 94.02% at the 5th epoch, compared to 98.95% for LDWPSO CNN, which improves accuracy. When using the CIFAR-10 dataset, the Baseline CNN is 28.07% at the 10th epoch, compared to 69.37% for the LDWPSO CNN, which greatly improves accuracy.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Dong Yu,et al.  Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[3]  R. Eberhart,et al.  Empirical study of particle swarm optimization , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[4]  R. Eberhart,et al.  Comparing inertia weights and constriction factors in particle swarm optimization , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[5]  Yong Feng,et al.  Chaotic Inertia Weight in Particle Swarm Optimization , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[6]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[7]  Mohamad Ivan Fanany,et al.  Metaheuristic Algorithms for Convolution Neural Network , 2016, Comput. Intell. Neurosci..

[8]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[9]  Kotaro Hirasawa,et al.  Genetic algorithm optimization of a convolutional neural network for autonomous crack detection , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[10]  João Paulo Papa,et al.  Fine-Tuning Convolutional Neural Networks Using Harmony Search , 2015, CIARP.

[11]  Yong Feng,et al.  Comparing with Chaotic Inertia Weights in Particle Swarm Optimization , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[12]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[13]  Mohamad Ivan Fanany,et al.  Simulated Annealing Algorithm for Deep Learning , 2015 .

[14]  Sabyasachi Pattnaik,et al.  Fast Convergence Particle Swarm Optimization for Functions Optimization , 2012 .

[15]  Wei-Chang Yeh,et al.  New Parameter-Free Simplified Swarm Optimization for Artificial Neural Network Training and its Application in the Prediction of Time Series , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Wang Hu,et al.  Adaptive Multiobjective Particle Swarm Optimization Based on Parallel Cell Coordinate System , 2015, IEEE Transactions on Evolutionary Computation.

[17]  Anselmo Cardoso de Paiva,et al.  Convolutional neural network-based PSO for lung nodule false positive reduction on CT images , 2018, Comput. Methods Programs Biomed..

[18]  Pu Yunming,et al.  The Genetic Convolutional Neural Network Model Based on Random Sample , 2015 .

[19]  Russell C. Eberhart,et al.  Tracking and optimizing dynamic systems with particle swarms , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).