Adapting New Categories for Food Recognition with Deep Representation

Learning to classify new (target) data in a different domain is always an interesting and challenging task in data mining. The classifier could suffer the dataset bias when predicting the new categories from target domain. Many adaptation methods have been proposed to adjust this bias but are limited to using data either from similar categories or requiring a large number of labeled examples from the target domain. Automatically adapting and recognizing new food categories is a very practical task in daily life. In this paper, we propose a new method that can alleviate the dataset bias for food image recognition. To obtain less biased feature representation from the food images, we fine-tuned GoogLeNet as our deep feature extractor and achieve state-of-the-art performance on the Food-101 dataset. Using the deep representation, our method can learn efficient classifiers with fewer labeled examples. More specifically, our method employs an external classifier for adaptation, called "negative classifier".Experiment results show that utilizing the parameters of the negative classifier, our method can achieve better performance and converge faster to adapt the new categories.

[1]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[2]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[3]  Koby Crammer,et al.  Learning Bounds for Domain Adaptation , 2007, NIPS.

[4]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[5]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[6]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[7]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[8]  William-Chandra Tjhi,et al.  Dual Fuzzy-Possibilistic Co-clustering for Document Categorization , 2007 .

[9]  Keiji Yanai,et al.  Automatic Expansion of a Food Image Dataset Leveraging Existing Categories with Domain Adaptation , 2014, ECCV Workshops.

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Bo-Yu Chu,et al.  Warm Start for Parameter Selection of Linear Classifiers , 2015, KDD.

[12]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[13]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[14]  LeCunYann,et al.  Learning Hierarchical Features for Scene Labeling , 2013 .

[15]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[16]  E. Alper Yildirim,et al.  Implementation of warm-start strategies in interior-point methods for linear programming in fixed dimension , 2008, Comput. Optim. Appl..

[17]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[18]  Jitendra Malik,et al.  Analyzing the Performance of Multilayer Neural Networks for Object Recognition , 2014, ECCV.

[19]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20]  Stephen J. Wright,et al.  Warm-Start Strategies in Interior-Point Methods for Linear Programming , 2002, SIAM J. Optim..

[21]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[22]  Andrew Zisserman,et al.  Tabula rasa: Model transfer for object category detection , 2011, 2011 International Conference on Computer Vision.

[23]  Rong Yan,et al.  Adapting SVM Classifiers to Data with Shifted Distributions , 2007, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007).

[24]  Koby Crammer,et al.  Analysis of Representations for Domain Adaptation , 2006, NIPS.

[25]  Matthieu Guillaumin,et al.  Food-101 - Mining Discriminative Components with Random Forests , 2014, ECCV.

[26]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[27]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Alexei A. Efros,et al.  Unsupervised Discovery of Mid-Level Discriminative Patches , 2012, ECCV.

[29]  Manfred Morari,et al.  Real-time suboptimal model predictive control using a combination of explicit MPC and online optimization , 2008, 2008 47th IEEE Conference on Decision and Control.

[30]  Trevor Darrell,et al.  One-Shot Adaptation of Supervised Deep Convolutional Models , 2013, ICLR.

[31]  Trevor Darrell,et al.  Part-Based R-CNNs for Fine-Grained Category Detection , 2014, ECCV.