Feature learning network with transformer for multi-label image classification

[1]  Wei Zhou,et al.  Aligning Image Semantics and Label Concepts for Image Multi-Label Classification , 2022, ACM Transactions on Multimedia Computing, Communications, and Applications.

[2]  Nenggan Zheng,et al.  HAM: Hybrid attention module in deep convolutional neural networks for image classification , 2022, Pattern Recognit..

[3]  Dongdong Li,et al.  Semantic Supplementary Network With Prior Information for Multi-Label Image Classification , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Xiaoqin Zhang,et al.  SST: Spatial and Semantic Transformers for Multi-Label Image Recognition , 2022, IEEE Transactions on Image Processing.

[5]  Songsen Yu,et al.  A multi-scale semantic attention representation for multi-label image recognition with graph networks , 2022, Neurocomputing.

[6]  B. Baets,et al.  Class-specific discriminative metric learning for scene recognition , 2022, Pattern Recognit..

[7]  Qi Zhao,et al.  A Feature Consistency Driven Attention Erasing Network for Fine-Grained Image Retrieval , 2021, Pattern Recognit..

[8]  Jianxin Wu,et al.  Residual Attention: A Simple but Effective Method for Multi-Label Recognition , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[9]  Jun Zhu,et al.  Query2Label: A Simple Transformer Way to Multi-Label Classification , 2021, ArXiv.

[10]  Xiangyang Xue,et al.  Distance Restricted Transformer Encoder for Multi-Label Classification , 2021, 2021 IEEE International Conference on Multimedia and Expo (ICME).

[11]  Tianjiang Wang,et al.  CE-FPN: enhancing channel information for object detection , 2021, Multimedia Tools and Applications.

[12]  Sheng Huang,et al.  Deep Semantic Dictionary Learning for Multi-label Image Classification , 2020, AAAI.

[13]  Yu Qiao,et al.  Attention-Driven Dynamic Graph Convolutional Network for Multi-label Image Recognition , 2020, ECCV.

[14]  Yanjun Qi,et al.  General Multi-label Image Classification with Transformers , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Ke Zhou,et al.  Fast Graph Convolution Network Based Multi-label Image Recognition via Cross-modal Fusion , 2020, CIKM.

[16]  Hefeng Wu,et al.  Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  C. V. Jawahar,et al.  Recurrent Image Annotation with Explicit Inter-label Dependencies , 2020, ECCV.

[18]  Bin-Bin Gao,et al.  Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition , 2020, IEEE Transactions on Image Processing.

[19]  Shuzhi Sam Ge,et al.  ADCM: attention dropout convolutional module , 2020, Neurocomputing.

[20]  Itamar Friedman,et al.  TResNet: High Performance GPU-Dedicated Architecture , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[21]  Zhanyu Ma,et al.  Dual-attention Guided Dropblock Module for Weakly Supervised Object Localization , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[22]  Bin Fan,et al.  Deep Attention Aware Feature Learning for Person Re-Identification , 2020, Pattern Recognit..

[23]  Piotr Koniusz,et al.  Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors , 2020, ACM Multimedia.

[24]  Sid Ying-Ze Bao,et al.  Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification , 2019, AAAI.

[25]  Weigang Zhang,et al.  Multi-Label Image Classification with Attention Mechanism and Graph Convolutional Networks , 2019, MMAsia.

[26]  Joost van de Weijer,et al.  Orderless Recurrent Models for Multi-Label Classification , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Xiang Bai,et al.  Asymmetric Non-Local Neural Networks for Semantic Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[28]  Hefeng Wu,et al.  Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29]  Wu Liu,et al.  DELTA: A deep dual-stream network for multi-label image classification , 2019, Pattern Recognit..

[30]  Xiu-Shen Wei,et al.  Multi-Label Image Recognition with Joint Class-Aware Map Disentangling and Label Correlation Embedding , 2019, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[31]  Du Q. Huynh,et al.  Hallucinating IDT Descriptors and I3D Optical Flow Features for Action Recognition With CNNs , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[32]  Hyunjung Shim,et al.  Attention-Based Dropout Layer for Weakly Supervised Object Localization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Hao Guo,et al.  Visual Attention Consistency Under Image Transforms for Multi-Label Image Classification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Xiu-Shen Wei,et al.  Multi-Label Image Recognition With Graph Convolutional Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Matthieu Cord,et al.  Exploiting Negative Evidence for Deep Latent Structured Models , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Qi Wu,et al.  Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks , 2019, IEEE Transactions on Multimedia.

[37]  Xiaodong Gu,et al.  Batch DropBlock Network for Person Re-Identification and Beyond , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[38]  Yunchao Wei,et al.  Self-Erasing Network for Integral Object Attention , 2018, NeurIPS.

[39]  Qi Tian,et al.  Beyond Part Models: Person Retrieval with Refined Part Pooling , 2017, ECCV.

[40]  Abhinav Gupta,et al.  Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[41]  Gang Sun,et al.  Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42]  Yi Yang,et al.  Random Erasing Data Augmentation , 2017, AAAI.

[43]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[44]  Nenghai Yu,et al.  Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Serge J. Belongie,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Qi Wu,et al.  Multilabel Image Classification With Regional Latent Semantic Dependencies , 2016, IEEE Transactions on Multimedia.

[47]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[49]  Wei Xu,et al.  CNN-RNN: A Unified Framework for Multi-label Image Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[52]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[53]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[54]  Yang Wang,et al.  Joint Input and Output Space Learning for Multi-Label Image Classification , 2021, IEEE Transactions on Multimedia.

[55]  Dongjoo Yun,et al.  Dual aggregated feature pyramid network for multi label classification , 2021, Pattern Recognit. Lett..

[56]  Stephen Lin,et al.  Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[57]  Zhuang Miao,et al.  Complemental Attention Multi-Feature Fusion Network for Fine-Grained Classification , 2021, IEEE Signal Processing Letters.

[58]  Changqing Zhang,et al.  Multi-Scale Cross-Modal Spatial Attention Fusion for Multi-label Image Recognition , 2020, ICANN.

[59]  Wei Zhou,et al.  Double Attention for Multi-Label Image Classification , 2020, IEEE Access.

[60]  Weiwei Liu,et al.  Multi-Label Image Classification by Feature Attention Network , 2019, IEEE Access.

[61]  Christopher K. I. Williams,et al.  International Journal of Computer Vision manuscript No. (will be inserted by the editor) The PASCAL Visual Object Classes (VOC) Challenge , 2022 .