Classification System with Capability to Reject Unknowns

In this paper, we propose a novel method for object classification with capability to reject unknown inputs. In the real world application such as an image-recognition-based checkout system, it is crucial to reject unknown inputs while correctly classifying registered objects. Conventional deep-learning-based classification systems with softmax output suffer from overconfident score on unknown objects. We tackled the problem by the following two approaches. First, we incorporated a metric-learning-based method proposed for face verification into object classification. Second, we utilize available unregistered objects (known unknowns) in the training phase by proposing a novel “Margined Unknown Loss”. In the experiment, we showed the effectiveness of the proposed method by confirming that it outperformed conventional softmax-based approaches which also use the known unknowns, on two datasets, MNIST dataset and a retail product dataset, in terms of Recall at a low false positive rate.

[1]  Asaf Tzadok,et al.  Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Stefanos Zafeiriou,et al.  ArcFace: Additive Angular Margin Loss for Deep Face Recognition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Soma Shiraishi,et al.  Robust Multi-Object Detection Based on Data Augmentation with Realistic Image Synthesis for Point-of-Sale Automation , 2019, AAAI.

[4]  Bhiksha Raj,et al.  SphereFace: Deep Hypersphere Embedding for Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[6]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Yann LeCun,et al.  Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Jian Cheng,et al.  Additive Margin Softmax for Face Verification , 2018, IEEE Signal Processing Letters.

[9]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Nir Ailon,et al.  Deep Metric Learning Using Triplet Network , 2014, SIMBAD.

[11]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[13]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[14]  Xing Ji,et al.  CosFace: Large Margin Cosine Loss for Deep Face Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Kota Iwamoto,et al.  Robust Shelf Monitoring Using Supervised Learning for Improving On-Shelf Availability in Retail Stores † , 2019, Sensors.

[16]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[17]  Kibok Lee,et al.  Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples , 2017, ICLR.

[18]  Terrance E. Boult,et al.  Reducing Network Agnostophobia , 2018, NeurIPS.