Uncertainty-Aware Data Augmentation for Food Recognition

Food recognition has recently attracted attention of many researchers. However, high food ambiguity, inter-class variability and intra-class similarity define a real challenge for the Deep learning and Computer Vision algorithms. In order to improve their performance, it is necessary to better understand what the model learns and, from this, to determine the type of data that should be additionally included for being the most beneficial to the training procedure. In this paper, we propose a new data augmentation strategy that estimates and uses the epistemic uncertainty to guide the model training. The method follows an active learning framework, where the new synthetic images are generated from the hard to classify real ones present in the training data based on the epistemic uncertainty. Hence, it allows the food recognition algorithm to focus on difficult images in order to learn their discriminatives features. On the other hand, avoiding data generation from images that do not contribute to the recognition makes it faster and more efficient. We show that the proposed method allows to improve food recognition and provides a better trade-off between micro- and macro-recall measures.

[1]  Chong-Wah Ngo,et al.  Food Photo Recognition for Dietary Tracking: System and Experiment , 2018, MMM.

[2]  Julien Cornebise,et al.  Weight Uncertainty in Neural Network , 2015, ICML.

[3]  Petia Radeva,et al.  Food Recognition by Integrating Local and Flat Classifiers , 2019, IbPRIA.

[4]  Michele Merler,et al.  Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition , 2016, ACM Multimedia.

[5]  Radu Timofte,et al.  Adversarial Sampling for Active Learning , 2018, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[6]  Keiji Yanai,et al.  Food image recognition using deep convolutional network with pre-training and fine-tuning , 2015, 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[7]  Keiji Yanai,et al.  Recognition of Multiple-Food Images by Detecting Candidate Regions , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[8]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Dmitry P. Vetrov,et al.  Variational Dropout Sparsifies Deep Neural Networks , 2017, ICML.

[11]  Petia Radeva,et al.  Food Recognition Using Fusion of Classifiers Based on CNNs , 2017, ICIAP.

[12]  Lei Yang,et al.  PFID: Pittsburgh fast-food image dataset , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[13]  Chong-Wah Ngo,et al.  Mixed Dish Recognition through Multi-Label Learning , 2019, CEA@ICMR.

[14]  Lei Zhang,et al.  CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Vieira Bruno,et al.  A Survey on Automated Food Monitoring and Dietary Management Systems , 2017, Journal of health & medical informatics.

[16]  Paolo Napoletano,et al.  Food Recognition: A New Dataset, Experiments, and Results , 2017, IEEE Journal of Biomedical and Health Informatics.

[17]  Constantine Bekas,et al.  BAGAN: Data Augmentation with Balancing GAN , 2018, ArXiv.

[18]  Shuqiang Jiang,et al.  Ingredient-Guided Cascaded Multi-Attention Network for Food Recognition , 2019, ACM Multimedia.

[19]  Petia Radeva,et al.  Class-Conditional Data Augmentation Applied to Image Classification , 2019, CAIP.

[20]  Keiji Yanai,et al.  DeepFoodCam: A DCNN-based Real-time Mobile Food Recognition System , 2016, MADiMa @ ACM Multimedia.

[21]  Xiaogang Wang,et al.  StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Petia Radeva,et al.  Uncertainty Modeling and Deep Learning Applied to Food Image Analysis , 2020, BIODEVICES.

[23]  Zoubin Ghahramani,et al.  Deep Bayesian Active Learning with Image Data , 2017, ICML.

[24]  Barbara Korousic-Seljak,et al.  Using Deep Learning for Food and Beverage Image Recognition , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[25]  Keiji Yanai,et al.  A food image recognition system with Multiple Kernel Learning , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[26]  Yi Yang,et al.  Random Erasing Data Augmentation , 2017, AAAI.

[27]  Quoc V. Le,et al.  AutoAugment: Learning Augmentation Strategies From Data , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Petia Radeva,et al.  Regularized uncertainty-based multi-task learning model for food analysis , 2019, J. Vis. Commun. Image Represent..

[29]  Mohammed Ahmed Subhi,et al.  Vision-Based Approaches for Automatic Food Recognition and Dietary Assessment: A Survey , 2019, IEEE Access.

[30]  Gian Luca Foresti,et al.  Wide-Slice Residual Networks for Food Recognition , 2016, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[31]  R. Halfens,et al.  The Effects of Dietary Mobile Apps on Nutritional Outcomes in Adults with Chronic Diseases: A Systematic Review and Meta-Analysis. , 2019, Journal of the Academy of Nutrition and Dietetics.

[32]  Matthieu Guillaumin,et al.  Food-101 - Mining Discriminative Components with Random Forests , 2014, ECCV.

[33]  Luis Perez,et al.  The Effectiveness of Data Augmentation in Image Classification using Deep Learning , 2017, ArXiv.

[34]  Wataru Shimoda,et al.  Unseen Food Creation by Mixing Existing Food Images with Conditional StyleGAN , 2019, MADiMa @ ACM Multimedia.

[35]  Tali Dekel,et al.  SinGAN: Learning a Generative Model From a Single Natural Image , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[36]  Paolo Napoletano,et al.  Learning CNN-based Features for Retrieval of Food Images , 2017, ICIAP Workshops.

[37]  Alex Kendall,et al.  What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[38]  Ramesh C. Jain,et al.  Health Multimedia: Lifestyle Recommendations Based on Diverse Observations , 2017, ICMR.

[39]  Max Welling,et al.  Multiplicative Normalizing Flows for Variational Bayesian Neural Networks , 2017, ICML.

[40]  Christopher Nielsen,et al.  GAN Data Augmentation Through Active Learning Inspired Sample Acquisition , 2019, CVPR Workshops.

[41]  Michael J. Dinneen,et al.  Improved Mixed-Example Data Augmentation , 2018, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[42]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[43]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[44]  Marios Anthimopoulos,et al.  A Food Recognition System for Diabetic Patients Based on an Optimized Bag-of-Features Model , 2014, IEEE Journal of Biomedical and Health Informatics.

[45]  Yoko Yamakata,et al.  Recognition of Multiple Food Items in A Single Photo for Use in A Buffet-Style Restaurant , 2019, IEICE Trans. Inf. Syst..

[46]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Irish Universities Nutrition Alliance National Adult Nutrition Survey , 2011 .

[48]  Wataru Shimoda,et al.  Food category transfer with conditional cycleGAN and a large-scale food image dataset , 2018, MADiMa@IJCAI.

[49]  Steven C. H. Hoi,et al.  FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging , 2019, KDD.

[50]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Ling Shao,et al.  Striking the Right Balance With Uncertainty , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Daniel Urda,et al.  Forward Noise Adjustment Scheme for Data Augmentation , 2018, 2018 IEEE Symposium Series on Computational Intelligence (SSCI).

[53]  Ivan Donadello,et al.  Ontology-Driven Food Category Classification in Images , 2019, ICIAP.

[54]  Chong-Wah Ngo,et al.  Deep-based Ingredient Recognition for Cooking Recipe Retrieval , 2016, ACM Multimedia.

[55]  Andreas Nürnberger,et al.  The Power of Ensembles for Active Learning in Image Classification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[56]  Beatriz Remeseiro,et al.  Grab, Pay, and Eat: Semantic Food Detection for Smart Restaurants , 2018, IEEE Transactions on Multimedia.

[57]  Taesup Kim,et al.  Fast AutoAugment , 2019, NeurIPS.

[58]  Edward J. Delp,et al.  Combining global and local features for food identification in dietary assessment , 2011, 2011 18th IEEE International Conference on Image Processing.

[59]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Murat Sensoy,et al.  Evidential Deep Learning to Quantify Classification Uncertainty , 2018, NeurIPS.

[61]  Ling-Yu Duan,et al.  JDNet: A Joint-Learning Distilled Network for Mobile Visual Food Recognition , 2020, IEEE Journal of Selected Topics in Signal Processing.

[62]  Wataru Shimoda,et al.  Food image generation using a large amount of food images with conditional GAN: ramenGAN and recipeGAN , 2018, MADiMa@IJCAI.

[63]  Petia Radeva,et al.  Simultaneous food localization and recognition , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[64]  Vladimir Pavlovic,et al.  CookGAN: Meal Image Synthesis from Ingredients , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[65]  Monica Mordonini,et al.  Food Image Recognition Using Very Deep Convolutional Networks , 2016, MADiMa @ ACM Multimedia.

[66]  Yi Yang,et al.  PatchShuffle Regularization , 2017, ArXiv.

[67]  Eyad Elyan,et al.  MFC-GAN: Class-imbalanced dataset classification using Multiple Fake Class Generative Adversarial Network , 2019, Neurocomputing.

[68]  Vinod Vokkarane,et al.  DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment , 2016, ICOST.