Dropout Probability Estimation in Convolutional Neural Networks by the Enhanced Bat Algorithm

In recent years, deep learning has reached exceptional accomplishment in diverse applications, such as visual and speech recognition, natural language processing. The convolutional neural network represents a particular type of neural network commonly used for the task of digital image classification. A common issue in deep neural network models is the high variance problem, or also called over-fitting. Over-fitting occurs when the model fits well with the training data and fails to generalize on new data. To prevent over-fitting, several regularization methods can be used; one such powerful method is the dropout regularization. To find the optimal value of the dropout rate is a very time-consuming process; hence, we propose a model to find the optimal value by utilizing a metaheuristic algorithm instead of a manual search. In this paper, we propose a hybridized bat algorithm to find the optimal dropout probability rate in a convolutional neural network and compare the results to similar techniques. The experimental results show that the proposed hybrid method overperforms other metaheuristic techniques.

[1]  Xin-She Yang,et al.  Handling dropout probability estimation in convolution neural networks using meta-heuristics , 2018, Soft Comput..

[2]  Milan Tuba,et al.  Convolutional Neural Network Architecture Design by the Tree Growth Algorithm Framework , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[3]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[4]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Oscar Castillo,et al.  Comparative Study of Type-2 Fuzzy Particle Swarm, Bee Colony and Bat Algorithms in Optimization of Fuzzy Controllers , 2017, Algorithms.

[6]  Milan Tuba,et al.  Artificial Flora Optimization Algorithm for Task Scheduling in Cloud Computing Environment , 2019, IDEAL.

[7]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[10]  Milan Tuba,et al.  Resource Scheduling in Cloud Computing Based on a Hybridized Whale Optimization Algorithm , 2019, Applied Sciences.

[11]  Dervis Karaboga,et al.  A modified Artificial Bee Colony (ABC) algorithm for constrained optimization problems , 2011, Appl. Soft Comput..

[12]  Marko Beko,et al.  Designing Convolutional Neural Network Architecture by the Firefly Algorithm , 2019, 2019 International Young Engineers Forum (YEF-ECE).

[13]  Xin-She Yang,et al.  Firefly Algorithms for Multimodal Optimization , 2009, SAGA.

[14]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Milan Tuba,et al.  Optimal Path Planning in Environments with Static Obstacles by Harmony Search Algorithm , 2019 .

[16]  Xin-She Yang,et al.  A New Metaheuristic Bat-Inspired Algorithm , 2010, NICSO.

[17]  Yann LeCun,et al.  Regularization of Neural Networks using DropConnect , 2013, ICML.

[18]  Amir Hossein Gandomi,et al.  Cuckoo search algorithm: a metaheuristic approach to solve structural optimization problems , 2011, Engineering with Computers.

[19]  Milan Tuba,et al.  Hybridized bat algorithm for multi-objective radio frequency identification (RFID) network planning , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[20]  Milan Tuba,et al.  Hybridized Elephant Herding Optimization Algorithm for Constrained Optimization , 2017, HIS.

[21]  Marko Beko,et al.  Modified and Hybridized Monarch Butterfly Algorithms for Multi-Objective Optimization , 2018, HIS.

[22]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[23]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[25]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[26]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[28]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[29]  Nebojsa Bacanin,et al.  Artificial Bee Colony Algorithm Hybridized with Firefly Algorithm for Cardinality Constrained Mean-Variance Portfolio Selection Problem , 2014 .

[30]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[31]  A. Ng Feature selection, L1 vs. L2 regularization, and rotational invariance , 2004, Twenty-first international conference on Machine learning - ICML '04.

[32]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Milan Tuba,et al.  Artificial Bee Colony (ABC) Algorithm for Constrained Optimization Improved with Genetic Operators , 2012 .

[34]  Milan Tuba,et al.  Hybridized Particle Swarm Optimization for Constrained Problems , 2019 .

[35]  Milan Tuba,et al.  JPEG quantization tables selection by the firefly algorithm , 2014, 2014 International Conference on Multimedia Computing and Systems (ICMCS).

[36]  Milan Tuba,et al.  Firefly Algorithm for Cardinality Constrained Mean-Variance Portfolio Optimization Problem with Entropy Diversity Constraint , 2014, TheScientificWorldJournal.

[37]  Oscar Castillo,et al.  Modification of the Bat Algorithm using fuzzy logic for dynamical parameter adaptation , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[38]  Milan Tuba,et al.  RFID Network Planning by ABC Algorithm Hybridized with Heuristic for Initial Number and Locations of Readers , 2015, 2015 17th UKSim-AMSS International Conference on Modelling and Simulation (UKSim).

[39]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[40]  Oscar Castillo,et al.  Interval type-2 fuzzy logic for dynamic parameter adaptation in the bat algorithm , 2017, Soft Comput..

[41]  Milan Tuba,et al.  Mobile Robot Path Planning by Improved Brain Storm Optimization Algorithm , 2018, 2018 IEEE Congress on Evolutionary Computation (CEC).

[42]  Fei-Fei Li,et al.  Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[44]  Milan Tuba,et al.  Performance of Elephant Herding Optimization and Tree Growth Algorithm Adapted for Node Localization in Wireless Sensor Networks , 2019, Sensors.

[45]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Milan Tuba,et al.  Constrained Portfolio Optimization by Hybridized Bat Algorithm , 2016, 2016 7th International Conference on Intelligent Systems, Modelling and Simulation (ISMS).