Long-Tailed Food Classification

Food classification serves as the basic step of image-based dietary assessment to predict the types of foods in each input image. However, food image predictions in a real world scenario are usually long-tail distributed among different food classes, which cause heavy class-imbalance problems and a restricted performance. In addition, none of the existing long-tailed classification methods focus on food data, which can be more challenging due to the lower inter-class and higher intra-class similarity among foods. In this work, we first introduce two new benchmark datasets for long-tailed food classification including Food101-LT and VFN-LT where the number of samples in VFN-LT exhibits the real world long-tailed food distribution. Then we propose a novel 2-Phase framework to address the problem of class-imbalance by (1) undersampling the head classes to remove redundant samples along with maintaining the learned information through knowledge distillation, and (2) oversampling the tail classes by performing visual-aware data augmentation. We show the effectiveness of our method by comparing with existing state-of-the-art long-tailed classification methods and show improved performance on both Food101-LT and VFN-LT benchmarks. The results demonstrate the potential to apply our method to related real life applications.

[1]  E. Delp,et al.  Differences in Dietary Intake Exist among U.S. Adults by Diabetic Status Using NHANES 2009–2016 , 2022, medRxiv.

[2]  F. Zhu,et al.  Exemplar-Free Online Continual Learning , 2022, 2022 IEEE International Conference on Image Processing (ICIP).

[3]  Fengqing M Zhu,et al.  Towards the Creation of a Nutrition and Food Group Based Image Database , 2022, Electronic Imaging.

[4]  J. Choi,et al.  The Majority Can Help the Minority: Context-rich Minority Oversampling for Long-tailed Classification , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Yu-Gang Jiang,et al.  Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition , 2022, IEEE Transactions on Multimedia.

[6]  Zeman Shao,et al.  An Integrated System for Mobile Image-Based Dietary Assessment , 2021, AIxFood @ ACM Multimedia.

[7]  Younghan Jeon,et al.  Influence-Balanced Loss for Imbalanced Visual Classification , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[8]  Fengqing Zhu,et al.  Improving Dietary Assessment Via Integrated Hierarchy Food Classification , 2021, 2021 IEEE 23rd International Workshop on Multimedia Signal Processing (MMSP).

[9]  Fengqing Zhu,et al.  Online Continual Learning For Visual Food Classification , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[10]  Fengqing Zhu,et al.  An End-to-End Food Image Analysis System , 2021, Electronic Imaging.

[11]  Fengqing Zhu,et al.  Visual Aware Hierarchy Based Food Recognition , 2020, ICPR Workshops.

[12]  Hongsheng Li,et al.  Balanced Meta-Softmax for Long-Tailed Visual Recognition , 2020, NeurIPS.

[13]  Fengqing Zhu,et al.  Multi-task Image-Based Dietary Assessment for Food Recognition and Portion Size Estimation , 2020, 2020 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[14]  Fengqing Zhu,et al.  Incremental Learning in Online Scenario , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Colin Wei,et al.  Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss , 2019, NeurIPS.

[17]  Seong Joon Oh,et al.  CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[18]  Stella X. Yu,et al.  Large-Scale Long-Tailed Recognition in an Open World , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Atsuto Maki,et al.  A systematic study of the class imbalance problem in convolutional neural networks , 2017, Neural Networks.

[20]  Martial Hebert,et al.  Learning to Model the Tail , 2017, NIPS.

[21]  C. J. Boushey,et al.  New mobile methods for dietary assessment: review of image-assisted and image-based dietary assessment methods , 2016, Proceedings of the Nutrition Society.

[22]  Chen Huang,et al.  Learning Deep Representation for Imbalanced Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[25]  Matthieu Guillaumin,et al.  Food-101 - Mining Discriminative Components with Random Forests , 2014, ECCV.

[26]  Max Welling,et al.  Herding dynamical weights to learn , 2009, ICML '09.

[27]  Taghi M. Khoshgoftaar,et al.  Experimental perspectives on learning from imbalanced data , 2007, ICML '07.