Revisiting Long-tailed Image Classification: Survey and Benchmarks with New Evaluation Metrics

Recently, long-tailed image classification harvests lots of research attention, since the data distribution is long-tailed in many real-world situations. Piles of algorithms are devised to address the data imbalance problem by biasing the training process towards less frequent classes. However, they usually evaluate the performance on a balanced testing set or multiple independent testing sets having distinct distributions with the training data. Considering the testing data may have arbitrary distributions, existing evaluation strategies are unable to reflect the actual classification performance objectively. We set up novel evaluation benchmarks based on a series of testing sets with evolving distributions. A corpus of metrics are designed for measuring the accuracy, robustness, and bounds of algorithms for learning with long-tailed distribution. Based on our benchmarks, we re-evaluate the performance of existing methods on CIFAR10 and CIFAR100 datasets, which is valuable for guiding the selection of data rebalancing techniques. We also revisit existing methods and categorize them into four types including data balancing, feature balancing, loss balancing, and prediction balancing, according the focused procedure during the training pipeline.

[1]  Guanbin Li,et al.  Compound Batch Normalization for Long-tailed Image Classification , 2022, ACM Multimedia.

[2]  Dingwen Zhang,et al.  Generalized Weakly Supervised Object Localization , 2022, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Zhenguang Liu,et al.  Invariant Feature Learning for Generalized Long-Tailed Classification , 2022, ECCV.

[4]  Liang Lin,et al.  Double-Check Soft Teacher for Semi-Supervised Object Detection , 2022, IJCAI.

[5]  Dingwen Zhang,et al.  Computer-aided Tuberculosis Diagnosis with Attribute Reasoning Assistance , 2022, MICCAI.

[6]  D. Ramanan,et al.  Long- Tailed Recognition via Weight Balancing , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Q. Hu,et al.  Learning Self-supervised Low-Rank Network for Single-Stage Weakly and Semi-supervised Semantic Segmentation , 2022, International Journal of Computer Vision.

[8]  Xu-tao Lin,et al.  Cross-Level Contrastive Learning and Consistency Constraint for Semi-Supervised Medical Image Segmentation , 2022, 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI).

[9]  Junwei Han,et al.  Scribble-Supervised Video Object Segmentation , 2022, IEEE/CAA Journal of Automatica Sinica.

[10]  P. Indyk,et al.  Targeted Supervised Contrastive Learning for Long-Tailed Recognition , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Yiu-ming Cheung,et al.  Long-tailed Visual Recognition via Gaussian Clouded Logit Adjustment , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Changqing Zhang,et al.  Trustworthy Long-Tailed Classification , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Ming-Hsuan Yang,et al.  Weakly Supervised Object Localization and Detection: A Survey , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Xu Tan,et al.  Adaptive Logit Adjustment Loss for Long-Tailed Visual Recognition , 2021, AAAI.

[15]  Qiang Zhang,et al.  Onfocus detection: identifying individual-camera eye contact from unconstrained images , 2021, Science China Information Sciences.

[16]  Junwei Han,et al.  Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Zhengzhuo Xu,et al.  Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective , 2021, NeurIPS.

[18]  Younghan Jeon,et al.  Influence-Balanced Loss for Imbalanced Visual Classification , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[19]  Jenq-Neng Hwang,et al.  ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[20]  Jiaya Jia,et al.  Parametric Contrastive Learning , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[21]  Chaowei Fang,et al.  Deep Transformers For Fast Small Intestine Grounding In Capsule Endoscope Video , 2021, 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI).

[22]  Nick Barnes,et al.  Weakly Supervised Video Salient Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Jungong Han,et al.  Cross-modality deep feature learning for brain tumor segmentation , 2021, Pattern Recognit..

[24]  Seungju Han,et al.  Disentangling Label Distribution for Long-tailed Visual Recognition , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Shu Zhang,et al.  Automatic pancreas segmentation based on lightweight DCNN modules and spatial prior propagation , 2020, Pattern Recognit..

[26]  Yizhou Yu,et al.  Contralaterally Enhanced Networks for Thoracic Disease Detection , 2020, IEEE Transactions on Medical Imaging.

[27]  Stella X. Yu,et al.  Long-tailed Recognition by Routing Diverse Distribution-Aware Experts , 2020, ICLR.

[28]  Ankit Singh Rawat,et al.  Long-tail learning via logit adjustment , 2020, ICLR.

[29]  Bryan Hooi,et al.  Test-Agnostic Long-Tailed Recognition by Test-Time Aggregating Diverse Experts with Self-Supervision , 2021, ArXiv.

[30]  Hanwang Zhang,et al.  Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect , 2020, Neural Information Processing Systems.

[31]  Junwei Han,et al.  Exploring Task Structure for Brain Tumor Segmentation From Multi-Modality MR Images , 2020, IEEE Transactions on Image Processing.

[32]  Hongsheng Li,et al.  Balanced Meta-Softmax for Long-Tailed Visual Recognition , 2020, NeurIPS.

[33]  Junwei Han,et al.  SPFTN: A Joint Learning Framework for Localizing and Segmenting Objects in Weakly Labeled Videos , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Xiu-Shen Wei,et al.  BBN: Bilateral-Branch Network With Cumulative Learning for Long-Tailed Visual Recognition , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Saining Xie,et al.  Decoupling Representation and Classifier for Long-Tailed Recognition , 2019, ICLR.

[36]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Colin Wei,et al.  Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss , 2019, NeurIPS.

[38]  Seong Joon Oh,et al.  CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[39]  Stella X. Yu,et al.  Large-Scale Long-Tailed Recognition in an Open World , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Deyu Meng,et al.  Leveraging Prior-Knowledge for Weakly Supervised Object Detection Under a Collaborative Self-Paced Curriculum Learning Framework , 2018, International Journal of Computer Vision.

[41]  Hongyi Zhang,et al.  mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[42]  Yang Song,et al.  The iNaturalist Species Classification and Detection Dataset , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[43]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[44]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[46]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[47]  Robert C. Holte,et al.  C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .

[48]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..