论文信息 - Feature learning network with transformer for multi-label image classification - 字舞流文

Feature learning network with transformer for multi-label image classification

Hai Hu | Wei Zhou | Tao Su | Pengli Dou | Zhijie Zheng

[1] Wei Zhou,et al. Aligning Image Semantics and Label Concepts for Image Multi-Label Classification , 2022, ACM Transactions on Multimedia Computing, Communications, and Applications.

[2] Nenggan Zheng,et al. HAM: Hybrid attention module in deep convolutional neural networks for image classification , 2022, Pattern Recognit..

[3] Dongdong Li,et al. Semantic Supplementary Network With Prior Information for Multi-Label Image Classification , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[4] Xiaoqin Zhang,et al. SST: Spatial and Semantic Transformers for Multi-Label Image Recognition , 2022, IEEE Transactions on Image Processing.

[5] Songsen Yu,et al. A multi-scale semantic attention representation for multi-label image recognition with graph networks , 2022, Neurocomputing.

[6] B. Baets,et al. Class-specific discriminative metric learning for scene recognition , 2022, Pattern Recognit..

[7] Qi Zhao,et al. A Feature Consistency Driven Attention Erasing Network for Fine-Grained Image Retrieval , 2021, Pattern Recognit..

[8] Jianxin Wu,et al. Residual Attention: A Simple but Effective Method for Multi-Label Recognition , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[9] Jun Zhu,et al. Query2Label: A Simple Transformer Way to Multi-Label Classification , 2021, ArXiv.

[10] Xiangyang Xue,et al. Distance Restricted Transformer Encoder for Multi-Label Classification , 2021, 2021 IEEE International Conference on Multimedia and Expo (ICME).

[11] Tianjiang Wang,et al. CE-FPN: enhancing channel information for object detection , 2021, Multimedia Tools and Applications.

[12] Sheng Huang,et al. Deep Semantic Dictionary Learning for Multi-label Image Classification , 2020, AAAI.

[13] Yu Qiao,et al. Attention-Driven Dynamic Graph Convolutional Network for Multi-label Image Recognition , 2020, ECCV.

[14] Yanjun Qi,et al. General Multi-label Image Classification with Transformers , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Ke Zhou,et al. Fast Graph Convolution Network Based Multi-label Image Recognition via Cross-modal Fusion , 2020, CIKM.

[16] Hefeng Wu,et al. Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] C. V. Jawahar,et al. Recurrent Image Annotation with Explicit Inter-label Dependencies , 2020, ECCV.

[18] Bin-Bin Gao,et al. Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition , 2020, IEEE Transactions on Image Processing.

[19] Shuzhi Sam Ge,et al. ADCM: attention dropout convolutional module , 2020, Neurocomputing.

[20] Itamar Friedman,et al. TResNet: High Performance GPU-Dedicated Architecture , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[21] Zhanyu Ma,et al. Dual-attention Guided Dropblock Module for Weakly Supervised Object Localization , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[22] Bin Fan,et al. Deep Attention Aware Feature Learning for Person Re-Identification , 2020, Pattern Recognit..

[23] Piotr Koniusz,et al. Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors , 2020, ACM Multimedia.

[24] Sid Ying-Ze Bao,et al. Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification , 2019, AAAI.

[25] Weigang Zhang,et al. Multi-Label Image Classification with Attention Mechanism and Graph Convolutional Networks , 2019, MMAsia.

[26] Joost van de Weijer,et al. Orderless Recurrent Models for Multi-Label Classification , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Xiang Bai,et al. Asymmetric Non-Local Neural Networks for Semantic Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[28] Hefeng Wu,et al. Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29] Wu Liu,et al. DELTA: A deep dual-stream network for multi-label image classification , 2019, Pattern Recognit..

[30] Xiu-Shen Wei,et al. Multi-Label Image Recognition with Joint Class-Aware Map Disentangling and Label Correlation Embedding , 2019, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[31] Du Q. Huynh,et al. Hallucinating IDT Descriptors and I3D Optical Flow Features for Action Recognition With CNNs , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[32] Hyunjung Shim,et al. Attention-Based Dropout Layer for Weakly Supervised Object Localization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Hao Guo,et al. Visual Attention Consistency Under Image Transforms for Multi-Label Image Classification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Xiu-Shen Wei,et al. Multi-Label Image Recognition With Graph Convolutional Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Matthieu Cord,et al. Exploiting Negative Evidence for Deep Latent Structured Models , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36] Qi Wu,et al. Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks , 2019, IEEE Transactions on Multimedia.

[37] Xiaodong Gu,et al. Batch DropBlock Network for Person Re-Identification and Beyond , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[38] Yunchao Wei,et al. Self-Erasing Network for Integral Object Attention , 2018, NeurIPS.

[39] Qi Tian,et al. Beyond Part Models: Person Retrieval with Refined Part Pooling , 2017, ECCV.

[40] Abhinav Gupta,et al. Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[41] Gang Sun,et al. Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42] Yi Yang,et al. Random Erasing Data Augmentation , 2017, AAAI.

[43] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[44] Nenghai Yu,et al. Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Serge J. Belongie,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46] Qi Wu,et al. Multilabel Image Classification With Regional Latent Semantic Dependencies , 2016, IEEE Transactions on Multimedia.

[47] Xiaogang Wang,et al. Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Abhishek Das,et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[49] Wei Xu,et al. CNN-RNN: A Unified Framework for Multi-label Image Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[52] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[53] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[54] Yang Wang,et al. Joint Input and Output Space Learning for Multi-Label Image Classification , 2021, IEEE Transactions on Multimedia.

[55] Dongjoo Yun,et al. Dual aggregated feature pyramid network for multi label classification , 2021, Pattern Recognit. Lett..

[56] Stephen Lin,et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[57] Zhuang Miao,et al. Complemental Attention Multi-Feature Fusion Network for Fine-Grained Classification , 2021, IEEE Signal Processing Letters.

[58] Changqing Zhang,et al. Multi-Scale Cross-Modal Spatial Attention Fusion for Multi-label Image Recognition , 2020, ICANN.

[59] Wei Zhou,et al. Double Attention for Multi-Label Image Classification , 2020, IEEE Access.

[60] Weiwei Liu,et al. Multi-Label Image Classification by Feature Attention Network , 2019, IEEE Access.

[61] Christopher K. I. Williams,et al. International Journal of Computer Vision manuscript No. (will be inserted by the editor) The PASCAL Visual Object Classes (VOC) Challenge , 2022 .