论文信息 - Evaluating Performance of Deep Learning Architectures for Image Classification

Evaluating Performance of Deep Learning Architectures for Image Classification

VGGNet is a significantly more accurate CNN architecture that is more recently introduced. There has always been a question of which neural network architecture performs well in which scenario. Thus, using the Fashion MNIST dataset, the performance of VGGNet and CNN deep learning architectures is reviewed, and the metrics are compared. A 3 Layer CNN architecture was used in this work to achieve 98.92% training accuracy and 0.02 training loss and a maximum test accuracy of 90.77% in classifying 10000 images of 10 different types.

[1] K. R. Bindu,et al. An Algorithm for Text Prediction Using Neural Networks , 2018 .

[2] Xiaofeng Wang,et al. Optimizing Filter Size in Convolutional Neural Networks for Facial Action Unit Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[4] R. Karthika,et al. Study of Gabor Wavelet for Face Recognition Invariant to Pose and Orientation , 2016 .

[5] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[6] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7] Roland Vollgraf,et al. Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[8] Gregory Cohen,et al. EMNIST: Extending MNIST to handwritten letters , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[9] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[10] Maheshkumar H. Kolekar,et al. Classification of fashion article images using convolutional neural networks , 2017, 2017 Fourth International Conference on Image Information Processing (ICIIP).

[11] Hyperparameter Optimization and Regularization on Fashion-MNIST Classification , 2019, International Journal of Recent Technology and Engineering.

[12] Wei Xu,et al. CNN-RNN: A Unified Framework for Multi-label Image Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[14] Abien Fred Agarap. Deep Learning using Rectified Linear Units (ReLU) , 2018, ArXiv.

[15] Gaël Varoquaux,et al. The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[16] S. N. Sivanandam,et al. An Improved Approach for Detecting Car in Video using Neural Network Model , 2012 .