论文信息 - Automatic identification of commodity label images using lightweight attention network

Automatic identification of commodity label images using lightweight attention network

Recent research has raised interest in applying image classification techniques to automatically identify the commodity label images for the business automation of retail enterprises. These techniques can help enterprises improve their service efficiency and realize digital transformation. In this work, we developed a lightweight attention network with a small size and comparable precision, namely MS-DenseNet, to identify the commodity label images. MS-DenseNet is based on the recent well-known DenseNet architecture, where we replaced the regular planner convolution in dense blocks with depthwise separable convolution to compress the model size. Further, the SE modules were incorporated in the proposed network to highlight the useful feature channels while suppressing the useless feature channels, which made good use of interdependencies between channels and realized the maximum reuse of inter-channel relations. Besides, the two-stage progressive strategy was adopted in model training. The proposed procedure achieved significant performance gain with an average accuracy of 97.60% on the identification of commodity label images task. Also, it realized a 94.90% average accuracy on public datasets. The experimental findings present a substantial performance compared with existing methods and also demonstrate the effectiveness and extensibility of the proposed procedure. Our code is available at https://github.com/xtu502/Automatic-identification-of-commodity-label-images .

[1] Xiao Ping Hu,et al. Algorithm Research of Two-Dimensional Size Measurement on Parts Based on Machine Vision , 2013 .

[2] Gerald Schaefer,et al. Transfer learning using a multi-scale and multi-network ensemble for skin lesion classification , 2020, Comput. Methods Programs Biomed..

[3] Shaomin Mu,et al. Two-attribute e-commerce image classification based on a convolutional neural network , 2019, The Visual Computer.

[4] Mobyen Uddin Ahmed,et al. A Machine Learning Approach to Classify Pedestrians’ Event based on IMU and GPS , 2019 .

[5] Nuno Vasconcelos,et al. Scene classification with low-dimensional semantic spaces and weak supervision , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Charles X. Ling,et al. Pelee: A Real-Time Object Detection System on Mobile Devices , 2018, NeurIPS.

[7] Soumik Mondal,et al. A study on continuous authentication using a combination of keystroke and mouse biometrics , 2017, Neurocomputing.

[8] Anabela Afonso,et al. Overview of Friedman’s Test and Post-hoc Analysis , 2015, Commun. Stat. Simul. Comput..

[9] Paolo Napoletano,et al. CNN-based features for retrieval and classification of food images , 2018, Comput. Vis. Image Underst..

[10] Mark Sandler,et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[11] Mostafa Mehdipour-Ghazi,et al. Plant identification using deep neural networks via optimization of transfer learning parameters , 2017, Neurocomputing.

[12] Ramar Ahila Priyadharshini,et al. Maize leaf disease classification using deep convolutional neural networks , 2019, Neural Computing and Applications.

[13] Tommy W. S. Chow,et al. Graph model-based salient object detection using objectness and multiple saliency cues , 2019, Neurocomputing.

[14] Fang Liu,et al. SAR Image segmentation based on convolutional-wavelet neural network and markov random field , 2017, Pattern Recognit..

[15] Nataliia Kussul,et al. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data , 2017, IEEE Geoscience and Remote Sensing Letters.

[16] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[17] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[18] Pablo Ezzatti,et al. Accelerating the Calculation of Friedman Test Tables on Many-Core Processors , 2019, CARLA.

[19] R. GeethaRamani,et al. Identification of plant leaf diseases using a nine-layer deep convolutional neural network , 2019, Comput. Electr. Eng..

[20] Kenli Li,et al. Multi-task cascade deep convolutional neural networks for large-scale commodity recognition , 2019, Neural Computing and Applications.

[21] João Paulo Papa,et al. Embedded real-time speed limit sign recognition using image processing and machine learning techniques , 2016, Neural Computing and Applications.

[22] Liang Zheng,et al. Image Classification base on PCA of Multi-view Deep Representation , 2019, J. Vis. Commun. Image Represent..

[23] Yoshua Bengio,et al. Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[24] Hao Su,et al. Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.

[25] Xiangyu Zhang,et al. ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[26] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[27] Xiao Dong Chen,et al. Effect of Moisture Content on the Physical Properties of Fibered Flaxseed , 2007 .

[28] Kaiming He,et al. Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Jing Ma,et al. Machine Learning Based Cross-border E-Commerce Commodity Customs Product Name Recognition Algorithm , 2019, PRICAI.

[31] Yang Hu,et al. Fault diagnostics between different type of components: A transfer learning approach , 2020, Appl. Soft Comput..

[32] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[33] Lukasz Kaiser,et al. Depthwise Separable Convolutions for Neural Machine Translation , 2017, ICLR.

[34] Alexandra-Bianca Borlea,et al. Evolving Fuzzy Models for Prosthetic Hand Myoelectric-Based Control , 2020, IEEE Transactions on Instrumentation and Measurement.

[35] Stéphane Mallat,et al. Rigid-Motion Scattering for Texture Classification , 2014, ArXiv.

[36] Vural Gökmen,et al. A Non-Contact Computer Vision Based Analysis of Color in Foods , 2007 .

[37] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[38] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[39] Vijay Vasudevan,et al. Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[40] Zepeng Wang,et al. A survey of recent work on fine-grained image classification techniques , 2019, J. Vis. Commun. Image Represent..

[41] Chen Enqing,et al. Product Recognition Algorithm Based on HOG and Bag of Words Model , 2019, 2019 8th International Symposium on Next Generation Electronics (ISNE).

[42] Xiangrong Zhou,et al. Classification of teeth in cone-beam CT using deep convolutional neural network , 2017, Comput. Biol. Medicine.

[43] Chen Chen,et al. Research and Realization of Commodity Image Retrieval System Based on Deep Learning , 2017, PAAP.

[44] Yihong Gong,et al. Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[45] Gang Sun,et al. Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.