Optimal Convolutional Neural Network Architecture Design Using Clonal Selection Algorithm

In this paper, the Clonal Selection Algorithm (CSA) is implemented to determine the optimal architecture of a convolutional neural network (CNN) for the classification of handwritten character digits. The efficacy of CNN in image recognition is one of the central motives why the world has woken up to the effectiveness of deep learning. During training, an optimal CNN architecture can extract complex features from the data that is being trained; however, the ideal architecture of a CNN for a specific problem cannot be determined by some standard procedure. In practice, CNN architectures are generally designed using human expertise and domain knowledge. By using CSA, optimal architecture of CNN can be determined autonomously through evolution of hyperparameters of the architecture for a given dataset. In this work, proposed methodology is tested on EMNIST dataset which is an enhanced version of MNIST dataset. The results have proven that the CSA based tuning is capable of generating optimal CNN architectures. Through this proposed technique, the best architecture of CNN for a given problem can be selfdetermined without any human intervention.

[1]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[2]  I. Guyon,et al.  Handwritten digit recognition: applications of neural network chips and automatic learning , 1989, IEEE Communications Magazine.

[3]  Ali Al Bataineh,et al.  A Comparative Analysis of Nonlinear Machine Learning Algorithms for Breast Cancer Detection , 2019, International Journal of Machine Learning and Computing.

[4]  Halife Kodaz,et al.  Optimization of job shop scheduling problems using modified clonal selection algorithm , 2014 .

[6]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[7]  Ramesh Raskar,et al.  Designing Neural Network Architectures using Reinforcement Learning , 2016, ICLR.

[8]  J.J. Hopfield,et al.  Artificial neural networks , 1988, IEEE Circuits and Devices Magazine.

[10]  D. Kaur,et al.  A Comparative Study of Different Curve Fitting Algorithms in Artificial Neural Network using Housing Dataset , 2018, NAECON 2018 - IEEE National Aerospace and Electronics Conference.

[11]  Patrick J. Grother,et al.  NIST Special Database 19 Handprinted Forms and Characters Database , 1995 .

[12]  Ahmad Y. Javaid,et al.  Evolving Rule-Based Explainable Artificial Intelligence for Unmanned Aerial Vehicles , 2019, IEEE Access.

[13]  Fernando José Von Zuben,et al.  Immune and Neural Network Models: Theoretical and Empirical Comparisons , 2001, Int. J. Comput. Intell. Appl..

[14]  Jason Brownlee,et al.  Clever Algorithms: Nature-Inspired Programming Recipes , 2012 .

[15]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[16]  Devinder Kaur,et al.  Designing Convolutional Neural Network Architecture Using Genetic Algorithms , 2021, International Journal of Advanced Network, Monitoring and Controls.

[17]  David E. Goldberg,et al.  Genetic algorithms and Machine Learning , 1988, Machine Learning.

[18]  Devinder Kaur,et al.  Enhancing the Parallelization of Backpropagation Neural Network Algorithm for Implementation on FPGA Platform , 2018, NAECON 2018 - IEEE National Aerospace and Electronics Conference.

[19]  Amin Jarrah,et al.  High-Speed FPGA-based of the Particle Swarm Optimization using HLS Tool , 2019, International Journal of Advanced Computer Science and Applications.

[20]  Ali Al Bataineh,et al.  Immuno-Computing-based Neural Learning for Data Classification , 2019, International Journal of Advanced Computer Science and Applications.

[21]  Gregory Cohen,et al.  EMNIST: an extension of MNIST to handwritten letters , 2017, CVPR 2017.