Benchmarking algorithms for food localization and semantic segmentation

The problem of food segmentation is quite challenging since food is characterized by intrinsic high intra-class variability. Also, segmentation of food images taken in-the-wild may be characterized by acquisition artifacts, and that could be problematic for the segmentation algorithms. A proper evaluating of segmentation algorithms is of paramount importance for the design and improvement of food analysis systems that can work in less-than-ideal real scenarios. In this paper, we evaluate the performance of different deep learning-based segmentation algorithms in the context of food. Due to the lack of large-scale food segmentation datasets, we initially create a new dataset composed of 5000 images of 50 diverse food categories. The images are accurately annotated with pixel-wise annotations. In order to test the algorithms under different conditions, the dataset is augmented with the same images but rendered under different acquisition distortions that comprise illuminant change, JPEG compression, Gaussian noise, and Gaussian blur. The final dataset is composed of 120,000 images. Using standard benchmark measures, we conducted extensive experiments to evaluate ten state-of-the-art segmentation algorithms on two tasks: food localization and semantic food segmentation.

[1]  Michele Merler,et al.  Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition , 2016, ACM Multimedia.

[2]  Wen Tang,et al.  MUSEFood: Multi-Sensor-Based Food Volume Estimation on Smartphones , 2019, 2019 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[3]  Matthieu Guillaumin,et al.  Food-101 - Mining Discriminative Components with Random Forests , 2014, ECCV.

[4]  Paolo Napoletano,et al.  Food Recognition: A New Dataset, Experiments, and Results , 2017, IEEE Journal of Biomedical and Health Informatics.

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  Pritee Khanna,et al.  Classification of Food Images through Interactive Image Segmentation , 2018, ACIIDS.

[7]  Camille Roth,et al.  Natural Scales in Geographical Patterns , 2017, Scientific Reports.

[8]  Raimondo Schettini,et al.  Robust smile detection using convolutional neural networks , 2016, J. Electronic Imaging.

[9]  Paolo Napoletano,et al.  Learning CNN-based Features for Retrieval of Food Images , 2017, ICIAP Workshops.

[10]  Petia Radeva,et al.  Simultaneous food localization and recognition , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[11]  Keiji Yanai,et al.  DeepFoodCam: A DCNN-based Real-time Mobile Food Recognition System , 2016, MADiMa @ ACM Multimedia.

[12]  Amaia Salvador,et al.  Learning Cross-Modal Embeddings for Cooking Recipes and Food Images , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Marios Anthimopoulos,et al.  Segmentation and recognition of multi-food meal images for carbohydrate counting , 2013, 13th IEEE International Conference on BioInformatics and BioEngineering.

[14]  Silvia Corchs,et al.  A Multidistortion Database for Image Quality , 2017, CCIW.

[15]  Luis Herranz,et al.  Being a Supercook: Joint Food Attributes and Multimodal Content Modeling for Recipe Retrieval and Exploration , 2017, IEEE Transactions on Multimedia.

[16]  Shuqiang Jiang,et al.  Ingredient-Guided Cascaded Multi-Attention Network for Food Recognition , 2019, ACM Multimedia.

[17]  Marios Anthimopoulos,et al.  Food Image Segmentation for Dietary Assessment , 2016, MADiMa @ ACM Multimedia.

[18]  Siyao Wang,et al.  Mining Discriminative Food Regions for Accurate Food Recognition , 2019, BMVC.

[19]  Vinod Vokkarane,et al.  A New Deep Learning-Based Food Recognition System for Dietary Assessment on An Edge Computing Service Infrastructure , 2018, IEEE Transactions on Services Computing.

[20]  Edward J. Delp,et al.  Multiple Hypotheses Image Segmentation and Classification With Application to Dietary Assessment , 2015, IEEE Journal of Biomedical and Health Informatics.

[21]  Raimondo Schettini,et al.  Semantic Food Segmentation for Automatic Dietary Monitoring , 2018, 2018 IEEE 8th International Conference on Consumer Electronics - Berlin (ICCE-Berlin).

[22]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Linda G. Shapiro,et al.  ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation , 2018, ECCV.

[24]  David S. Ebert,et al.  Personal dietary assessment using mobile devices , 2009, Electronic Imaging.

[25]  Edward J. Delp,et al.  Weakly supervised food image segmentation using class activation maps , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[26]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[27]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[28]  Keiji Yanai,et al.  Automatic Expansion of a Food Image Dataset Leveraging Existing Categories with Domain Adaptation , 2014, ECCV Workshops.

[29]  Raimondo Schettini,et al.  Spatial Sampling Network for Fast Scene Understanding , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[30]  Shuqiang Jiang,et al.  Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition , 2020, IEEE Transactions on Image Processing.

[31]  Paolo Napoletano,et al.  On the Robustness of Color Texture Descriptors across Illuminants , 2013, ICIAP.

[32]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[33]  Sergio Guadarrama,et al.  Im2Calories: Towards an Automated Mobile Vision Food Diary , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[34]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Paolo Napoletano,et al.  IAT - Image Annotation Tool: Manual , 2015, ArXiv.

[36]  Alan C. Bovik,et al.  Massive Online Crowdsourced Study of Subjective and Objective Picture Quality , 2015, IEEE Transactions on Image Processing.

[37]  Chong-Wah Ngo,et al.  Food Photo Recognition for Dietary Tracking: System and Experiment , 2018, MMM.

[38]  Wataru Shimoda,et al.  CNN-Based Food Image Segmentation Without Pixel-Wise Annotation , 2015, ICIAP Workshops.

[39]  Ajay Divakaran,et al.  FoodX-251: A Dataset for Fine-grained Food Classification , 2019, ArXiv.

[40]  Chong-Wah Ngo,et al.  Deep-based Ingredient Recognition for Cooking Recipe Retrieval , 2016, ACM Multimedia.

[41]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[42]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Paolo Napoletano,et al.  Combining local binary patterns and local color contrast for texture classification under varying illumination. , 2014, Journal of the Optical Society of America. A, Optics, image science, and vision.

[44]  Lei Yang,et al.  PFID: Pittsburgh fast-food image dataset , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[45]  Keiji Yanai,et al.  A food image recognition system with Multiple Kernel Learning , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[46]  Davide Mazzini,et al.  Guided Upsampling Network for Real-Time Semantic Segmentation , 2018, BMVC.

[47]  Giovanni Maria Farinella,et al.  Classifying food images represented as Bag of Textons , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[48]  B. Koroušić Seljak,et al.  NutriNet: A Deep Learning Food and Drink Image Recognition System for Dietary Assessment , 2017, Nutrients.

[49]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[50]  Gian Luca Foresti,et al.  Wide-Slice Residual Networks for Food Recognition , 2016, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[51]  Marina Meila,et al.  Comparing clusterings: an axiomatic view , 2005, ICML.

[52]  Marios Anthimopoulos,et al.  A Food Recognition System for Diabetic Patients Based on an Optimized Bag-of-Features Model , 2014, IEEE Journal of Biomedical and Health Informatics.

[53]  Raimondo Schettini,et al.  On Comparing Color Spaces for Food Segmentation , 2017, ICIAP Workshops.

[54]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Hongyu Li,et al.  ChinFood1000: A Large Benchmark Dataset for Chinese Food Recognition , 2017, ICIC.

[56]  Ming Ouhyoung,et al.  Automatic Chinese food identification and quantity estimation , 2012, SIGGRAPH Asia Technical Briefs.

[57]  Keiji Yanai,et al.  Image Recognition of 85 Food Categories by Feature Fusion , 2010, 2010 IEEE International Symposium on Multimedia.

[58]  Keiji Yanai,et al.  Recognition of Multiple-Food Images by Detecting Candidate Regions , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[59]  Beatriz Remeseiro,et al.  Grab, Pay, and Eat: Semantic Food Detection for Smart Restaurants , 2018, IEEE Transactions on Multimedia.

[60]  Raimondo Schettini,et al.  Semantic segmentation of food images for automatic dietary monitoring , 2018, 2018 26th Signal Processing and Communications Applications Conference (SIU).

[61]  Edward J. Delp,et al.  cTADA: The Design of a Crowdsourcing Tool for Online Food Image Identification and Segmentation , 2018, 2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI).

[62]  Boguslaw Cyganek,et al.  Image recognition with deep neural networks in presence of noise - Dealing with and taking advantage of distortions , 2017, Integr. Comput. Aided Eng..

[63]  Giovanni Maria Farinella,et al.  A multi-task learning approach for meal assessment , 2018, MADiMa@IJCAI.

[64]  Raimondo Schettini,et al.  How to assess image quality within a workflow chain: an overview , 2014, International Journal on Digital Libraries.

[65]  Yong Rui,et al.  You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis , 2018, IEEE Transactions on Multimedia.

[66]  Alan C. Bovik,et al.  A Statistical Evaluation of Recent Full Reference Image Quality Assessment Algorithms , 2006, IEEE Transactions on Image Processing.

[67]  Paolo Napoletano,et al.  CNN-based features for retrieval and classification of food images , 2018, Comput. Vis. Image Underst..

[68]  Petia Radeva,et al.  Regularized uncertainty-based multi-task learning model for food analysis , 2019, J. Vis. Commun. Image Represent..

[69]  Mohammed Ahmed Subhi,et al.  Vision-Based Approaches for Automatic Food Recognition and Dietary Assessment: A Survey , 2019, IEEE Access.

[70]  Makoto Ogawa,et al.  Food Detection and Recognition Using Convolutional Neural Network , 2014, ACM Multimedia.

[71]  Gregory D. Abowd,et al.  Leveraging Context to Support Automated Food Recognition in Restaurants , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[72]  Eduardo Romera,et al.  ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation , 2018, IEEE Transactions on Intelligent Transportation Systems.

[73]  Ramesh C. Jain,et al.  A Survey on Food Computing , 2018, ACM Comput. Surv..

[74]  Giovanni Maria Farinella,et al.  Retrieval and classification of food images , 2016, Comput. Biol. Medicine.

[75]  Paolo Napoletano,et al.  Food Recognition and Leftover Estimation for Daily Diet Monitoring , 2015, ICIAP Workshops.

[76]  Keiji Yanai,et al.  Multi-task learning of dish detection and calorie estimation , 2018, MADiMa@IJCAI.

[77]  Xin Chen,et al.  ChineseFoodNet: A large-scale Image Dataset for Chinese Food Recognition , 2017, ArXiv.

[78]  Keiji Yanai,et al.  Food image recognition using deep convolutional network with pre-training and fine-tuning , 2015, 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).