论文信息 - Dual Cross-Entropy Loss for Small-Sample Fine-Grained Vehicle Classification

Dual Cross-Entropy Loss for Small-Sample Fine-Grained Vehicle Classification

Fine-grained vehicle classification is a challenging topic in computer vision due to the high intraclass variance and low interclass variance. Recently, considerable progress has been made in fine-grained vehicle classification due to the huge success of deep neural networks. Most studies of fine-grained vehicle classification based on neural networks, focus on the neural network structure to improve the classification performance. In contrast to existing works on fine-grained vehicle classification, we focus on the loss function of the neural network. We add a regularization term to the cross-entropy loss and propose a new loss function, Dual Cross-Entropy Loss. The regularization term places a constraint on the probability that a data point is assigned to a class other than its ground-truth class, which can alleviate the vanishing of the gradient when the value of the cross-entropy loss is close to zero. To demonstrate the effectiveness of our loss function, we perform two sets of experiments. The first set is conducted on a small-sample fine-grained vehicle classification dataset, the Stanford Cars-196 dataset. The second set is conducted on two small-sample datasets, the LabelMe dataset and the UIUC-Sports dataset, as well as on one large-sample dataset, the CIFAR-10 dataset. The experimental results show that the proposed loss function improves the fine-grained vehicle classification performance and has good performance on three other general image classification tasks.

[1] Thierry Chateau,et al. Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Yu Qiao,et al. A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[3] Jun Guo,et al. Variational Bayesian Learning for Dirichlet Process Mixture of Inverted Dirichlet Distributions in Non-Gaussian Image Feature Modeling , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[4] Nei Kato,et al. State-of-the-Art Deep Learning: Evolving Machine Intelligence Toward Tomorrow’s Intelligent Network Traffic Control Systems , 2017, IEEE Communications Surveys & Tutorials.

[5] Xiaoou Tang,et al. A large-scale car dataset for fine-grained categorization and verification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Fei-Yue Wang,et al. MFR-CNN: Incorporating Multi-Scale Features and Global Information for Traffic Object Detection , 2018, IEEE Transactions on Vehicular Technology.

[7] Subhransu Maji,et al. Bilinear CNN Models for Fine-Grained Visual Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8] Hamid Hassanpour,et al. A Cascaded Part-Based System for Fine-Grained Vehicle Classification , 2018, IEEE Transactions on Intelligent Transportation Systems.

[9] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[11] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[12] Adam Herout,et al. BoxCars: 3D Boxes as CNN Input for Improved Fine-Grained Vehicle Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Tim Salimans,et al. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.

[14] Bo Zhao,et al. Diversified Visual Attention Networks for Fine-Grained Object Classification , 2016, IEEE Transactions on Multimedia.

[15] Meng Yang,et al. Large-Margin Softmax Loss for Convolutional Neural Networks , 2016, ICML.

[16] Fei-Fei Li,et al. What, where and who? Classifying events by scene and object recognition , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[17] Xiaojun Wu,et al. Selective Multi-Convolutional Region Feature Extraction based Iterative Discrimination CNN for Fine-Grained Vehicle Model Recognition , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[18] Kaiming He,et al. Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19] Derrick Liu. Monza : Image Classification of Vehicle Make and Model Using Convolutional Neural Networks and Transfer Learning , .

[20] Xiaogang Wang,et al. Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[21] Gang Wang,et al. Video tracking using learned hierarchical features. , 2015, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[22] Muhammed Gökhan Cinsdikici,et al. Vehicle-Classification Algorithm Based on Component Analysis for Single-Loop Inductive Detector , 2010, IEEE Transactions on Vehicular Technology.

[23] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[24] Jun Guo,et al. Cross-modal subspace learning for sketch-based image retrieval: A comparative study , 2016, 2016 IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC).

[25] Xiaogang Wang,et al. Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-Temporal Path Proposals , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[26] Doo-Kwon Baik,et al. Vehicle-Classification Algorithm for Single-Loop Detectors Using Neural Networks , 2006, IEEE Transactions on Vehicular Technology.

[27] Honggang Zhang,et al. Variational Bayesian Matrix Factorization for Bounded Support Data , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28] Nei Kato,et al. A Handwritten Character Recognition System Using Directional Element Feature and Asymmetric Mahalanobis Distance , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[29] Jürgen Beyerer,et al. A systematic evaluation of recent deep learning architectures for fine-grained vehicle classification , 2018, Defense + Security.

[30] Nei Kato,et al. The Deep Learning Vision for Heterogeneous Network Traffic Control: Proposal, Challenges, and Future Perspective , 2017, IEEE Wireless Communications.

[31] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.

[32] Junmo Kim,et al. Active Convolution: Learning the Shape of Convolution for Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Matthieu Cord,et al. WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] René Vidal,et al. Car Segmentation and Pose Estimation using 3D Object Models , 2015, ArXiv.

[35] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[36] Ning Wang,et al. Vehicle Distributions in Large and Small Cities: Spatial Models and Applications , 2018, IEEE Transactions on Vehicular Technology.

[37] Huibing Wang,et al. Deep CNNs With Spatially Weighted Pooling for Fine-Grained Car Recognition , 2017, IEEE Transactions on Intelligent Transportation Systems.

[38] Xu Chen,et al. Car detection and classification using cascade model , 2018, IET Intelligent Transport Systems.

[39] Jae Wook Jeon,et al. A Fast Evolutionary Algorithm for Real-Time Vehicle Detection , 2013, IEEE Transactions on Vehicular Technology.

[40] Jun Guo,et al. The Role of Data Analysis in the Development of Intelligent Energy Networks , 2017, IEEE Network.

[41] Sergio L. Toral Marín,et al. An Enhanced Background Estimation Algorithm for Vehicle Detection in Urban Traffic Scenes , 2010, IEEE Transactions on Vehicular Technology.

[42] Markus Flierl,et al. Bayesian estimation of Dirichlet mixture model with variational inference , 2014, Pattern Recognit..

[43] Tiejun Huang,et al. Deep Relative Distance Learning: Tell the Difference between Similar Vehicles , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[45] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[46] Ngai-Man Cheung,et al. Image-based vehicle analysis using deep neural network: A systematic study , 2016, 2016 IEEE International Conference on Digital Signal Processing (DSP).

[47] Arne Leijon,et al. Bayesian Estimation of Beta Mixture Models with Variational Inference , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.