Multi-level Alignment Network for Domain Adaptive Cross-modal Retrieval
暂无分享,去创建一个
Shouling Ji | Jianfeng Dong | Yuan He | Zhongzi Long | Changting Lin | Xiaofeng Mao | S. Ji | Jianfeng Dong | Yuan He | Changting Lin | Xiaofeng Mao | Zhongzi Long
[1] Biyao Shao,et al. 3D Room Layout Estimation From a Single RGB Image , 2020, IEEE Transactions on Multimedia.
[2] Ioannis Patras,et al. Iti - Certh In Trecvid 2016 Ad - Hoc Video Search (Avs) , 2016 .
[3] Yiannis Kompatsiaris,et al. ITI-CERTH participation in TRECVID 2018 , 2017, TRECVID.
[4] Ivan Laptev,et al. HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[5] Tao Mei,et al. MSR-VTT: A Large Video Description Dataset for Bridging Video and Language , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Alexandros Iosifidis,et al. Supervised Domain Adaptation using Graph Embedding , 2020, ArXiv.
[7] Xirong Li,et al. SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries , 2020, IEEE Transactions on Multimedia.
[8] Diane J. Cook,et al. A Survey of Unsupervised Deep Domain Adaptation , 2018, ACM Trans. Intell. Syst. Technol..
[9] Xirong Li,et al. Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild , 2017, IEEE Transactions on Multimedia.
[10] Donald A. Adjeroh,et al. Unified Deep Supervised Domain Adaptation and Generalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[11] Duy-Dinh Le,et al. NII-HITACHI-UIT at TRECVID 2017 , 2016, TRECVID.
[12] Xiaogang Wang,et al. DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Michael I. Jordan,et al. Deep Transfer Learning with Joint Adaptation Networks , 2016, ICML.
[14] Bo Wang,et al. Moment Matching for Multi-Source Domain Adaptation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[15] Rita Cucchiara,et al. Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach , 2018, ECCV Workshops.
[16] Yang Liu,et al. Use What You Have: Video retrieval using representations from collaborative experts , 2019, BMVC.
[17] Yue Gao,et al. Deep Multi-View Enhancement Hashing for Image Retrieval , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[18] Zhimin Zeng,et al. Exploiting Visual Semantic Reasoning for Video-Text Retrieval , 2020, IJCAI.
[19] Tatsuya Harada,et al. Maximum Classifier Discrepancy for Unsupervised Domain Adaptation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[20] Xin Wang,et al. VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[21] Michael I. Jordan,et al. Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.
[22] Michael I. Jordan,et al. Unsupervised Domain Adaptation with Residual Transfer Networks , 2016, NIPS.
[23] Tat-Seng Chua,et al. Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval , 2020, SIGIR.
[24] Wei Wang,et al. Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval , 2020, IEEE Transactions on Multimedia.
[25] Wei Liu,et al. Pairwise Relationship Guided Deep Hashing for Cross-Modal Retrieval , 2017, AAAI.
[26] Huchuan Lu,et al. Deep Cross-Modal Projection Learning for Image-Text Matching , 2018, ECCV.
[27] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[28] Liang Lin,et al. Deep Cocktail Network: Multi-source Unsupervised Domain Adaptation with Category Shift , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[29] Sanja Fidler,et al. Be Your Own Prada: Fashion Synthesis with Structural Coherence , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[30] Ivan Laptev,et al. Learning a Text-Video Embedding from Incomplete and Heterogeneous Data , 2018, ArXiv.
[31] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.
[32] Xirong Li,et al. University of Amsterdam and Renmin University at TRECVID 2016: Searching Video, Detecting Events and Describing Video , 2016, TRECVID.
[33] Ruslan Salakhutdinov,et al. Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models , 2014, ArXiv.
[34] Mei Wang,et al. Deep Visual Domain Adaptation: A Survey , 2018, Neurocomputing.
[35] Ruxin Chen,et al. Temporal Attentive Alignment for Large-Scale Video Domain Adaptation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[36] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.
[37] Marc'Aurelio Ranzato,et al. DeViSE: A Deep Visual-Semantic Embedding Model , 2013, NIPS.
[38] Chong-Wah Ngo,et al. Interpretable Embedding for Ad-Hoc Video Search , 2020, ACM Multimedia.
[39] Heng Tao Shen,et al. Universal Weighting Metric Learning for Cross-Modal Matching , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Amit K. Roy-Chowdhury,et al. Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval , 2018, ICMR.
[41] Kate Saenko,et al. Return of Frustratingly Easy Domain Adaptation , 2015, AAAI.
[42] Xirong Li,et al. Dual Encoding for Zero-Example Video Retrieval , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[43] David J. Fleet,et al. VSE++: Improving Visual-Semantic Embeddings with Hard Negatives , 2017, BMVC.
[44] Xiaogang Wang,et al. Person Search with Natural Language Description , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[45] Nicolas Courty,et al. DeepJDOT: Deep Joint distribution optimal transport for unsupervised domain adaptation , 2018, ECCV.
[46] Jonathan G. Fiscus,et al. TRECVID 2018: Benchmarking Video Activity Detection, Video Captioning and Matching, Video Storytelling Linking and Video Search , 2018, TRECVID.
[47] Victor S. Lempitsky,et al. Unsupervised Domain Adaptation by Backpropagation , 2014, ICML.
[48] José M. F. Moura,et al. Adversarial Multiple Source Domain Adaptation , 2018, NeurIPS.
[49] Yuxin Peng,et al. Deep Cross-Media Knowledge Transfer , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[50] Larry S. Davis,et al. Automatic Spatially-Aware Fashion Concept Discovery , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[51] Muhammet Bastan,et al. NTU ROSE Lab at TRECVID 2018: Ad-hoc Video Search and Video to Text , 2018, TRECVID.
[52] Shiliang Sun,et al. A survey of multi-source domain adaptation , 2015, Inf. Fusion.
[53] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[54] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[55] Yin Li,et al. Learning Deep Structure-Preserving Image-Text Embeddings , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[56] Yang Yang,et al. Adversarial Cross-Modal Retrieval , 2017, ACM Multimedia.
[57] Heng Tao Shen,et al. Learning Cross-Modal Common Representations by Private–Shared Subspaces Separation , 2020, IEEE Transactions on Cybernetics.
[58] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[59] Xiaogang Wang,et al. CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[60] Qing Li,et al. VIREO @ TRECVID 2017: Video-to-Text, Ad-hoc Video Search, and Video hyperlinking , 2017, TRECVID.
[61] Shizhe Chen,et al. Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[62] Tsuyoshi Murata,et al. {m , 1934, ACML.
[63] Xirong Li,et al. Predicting Visual Features From Text for Image and Video Caption Retrieval , 2017, IEEE Transactions on Multimedia.
[64] Kate Saenko,et al. Deep CORAL: Correlation Alignment for Deep Domain Adaptation , 2016, ECCV Workshops.
[65] François Laviolette,et al. Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..
[66] Xirong Li,et al. Renmin University of China and Zhejiang Gongshang University at TRECVID 2018: Deep Cross-Modal Embeddings for Video-Text Retrieval , 2018, TRECVID.
[67] Xin Yao,et al. Evolutionary Generative Adversarial Networks , 2018, IEEE Transactions on Evolutionary Computation.
[68] Tetsuji Ogawa,et al. Waseda_meisei at TRECVID 2017 Ad-hoc video search(AVS) , 2017 .
[69] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[70] Ioannis A. Kakadiaris,et al. Adversarial Representation Learning for Text-to-Image Matching , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[71] Yale Song,et al. TGIF: A New Dataset and Benchmark on Animated GIF Description , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[72] Chen Sun,et al. Multi-modal Transformer for Video Retrieval , 2020, ECCV.
[73] Cees Snoek,et al. Video2vec Embeddings Recognize Events When Examples Are Scarce , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[74] Chao Li,et al. Shared Predictive Cross-Modal Deep Quantization , 2018, IEEE Transactions on Neural Networks and Learning Systems.
[75] Yongdong Zhang,et al. Depth Image Denoising Using Nuclear Norm and Learning Graph Model , 2020, ACM Trans. Multim. Comput. Commun. Appl..
[76] Xirong Li,et al. W2VV++: Fully Deep Learning for Ad-hoc Video Search , 2019, ACM Multimedia.
[77] Tetsuji Ogawa,et al. Waseda_Meisei at TRECVID 2018: Ad-hoc Video Search , 2018, TRECVID.
[78] Xin Huang,et al. An Overview of Cross-Media Retrieval: Concepts, Methodologies, Benchmarks, and Challenges , 2017, IEEE Transactions on Circuits and Systems for Video Technology.
[79] Ioannis Patras,et al. Query and Keyframe Representations for Ad-hoc Video Search , 2017, ICMR.