A Unified Object Detection Framework for Intelligent Retail Container Commodities

Motivated by commodity detection in retail, this paper studies the problem of generating object bounding boxes in retail images. The existing object detection algorithms are not suitable due to the characteristics of the intensive distribution of commodities in intelligent retail containers. Therefore, we present a unified object detection framework used for dense scenarios in retail images, which consists of the hierarchical labeling pattern, the Similarity Recognition Network (SRN) subnetwork and the optimized NMS algorithm on the basis of YOLOv3. Three datasets are built to verify the effectiveness of the proposed framework, two of which are used for training the object detection network, and the other one is used for training the SRN subnetwork. The experimental results show that the proposed framework achieves a significant improvement for dense scenario in retail images compared with the traditional object detection algorithms.

[1]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[2]  Yuning Jiang,et al.  Repulsion Loss: Detecting Pedestrians in a Crowd , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Bernt Schiele,et al.  What Makes for Effective Detection Proposals? , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[6]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[8]  Larry S. Davis,et al.  Soft-NMS — Improving Object Detection with One Line of Code , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Chenxi Liu,et al.  ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[11]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[15]  Bernt Schiele,et al.  How good are detection proposals, really? , 2014, BMVC.

[16]  Emanuele Frontoni,et al.  Customers’ activity recognition in intelligent retail environments , 2013 .

[17]  Bernt Schiele,et al.  A Convnet for Non-maximum Suppression , 2015, GCPR.

[18]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[20]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[21]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[22]  Bernt Schiele,et al.  Learning Non-maximum Suppression , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Samuel Fosso Wamba,et al.  Enabling intelligent B-to-B eCommerce supply chain management using RFID and the EPC network: a case study in the retail industry , 2006, ICEC '06.

[24]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[25]  Vittorio Ferrari,et al.  End-to-End Training of Object Class Detectors for Mean Average Precision , 2016, ACCV.