Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization
暂无分享,去创建一个
Richang Hong | Liqiang Nie | Zheng Qin | Da Cao | Xiaochi Wei | Yawen Zeng | Liqiang Nie | Richang Hong | Xiaochi Wei | Da Cao | Zheng Qin | Yawen Zeng
[1] Bernt Schiele,et al. Grounding Action Descriptions in Videos , 2013, TACL.
[2] Lars Schmidt-Thieme,et al. BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.
[3] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Larry S. Davis,et al. MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Xiaochun Cao,et al. Adversarial Preference Learning with Pairwise Comparisons , 2019, ACM Multimedia.
[6] Qi Tian,et al. Video-Based Cross-Modal Recipe Retrieval , 2019, ACM Multimedia.
[7] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[8] Ali Farhadi,et al. Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding , 2016, ECCV.
[9] Meng Liu,et al. Attentive Moment Retrieval in Videos , 2018, SIGIR.
[10] Chong-Wah Ngo,et al. Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval , 2018, ACM Multimedia.
[11] Liang Wang,et al. Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Guangyi Xiao,et al. Social-Enhanced Attentive Group Recommendation , 2019, IEEE Transactions on Knowledge and Data Engineering.
[13] Sanja Fidler,et al. Skip-Thought Vectors , 2015, NIPS.
[14] Peng Zhang,et al. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models , 2017, SIGIR.
[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[16] Hanqing Lu,et al. Sketch-based Image Retrieval using Generative Adversarial Networks , 2017, ACM Multimedia.
[17] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[18] Xiaoyu Du,et al. Adversarial Personalized Ranking for Recommendation , 2018, SIGIR.
[19] Matthew J. Hausknecht,et al. Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[20] Yang Yang,et al. Adversarial Cross-Modal Retrieval , 2017, ACM Multimedia.
[21] Yixin Cao,et al. Reinforced Negative Sampling over Knowledge Graph for Recommendation , 2020, WWW.
[22] Depeng Jin,et al. Reinforced Negative Sampling for Recommendation with Exposure Data , 2019, IJCAI.
[23] Qi Tian,et al. Cross-modal Moment Localization in Videos , 2018, ACM Multimedia.
[24] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[25] Trevor Darrell,et al. Localizing Moments in Video with Natural Language , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[26] Bin Jiang,et al. Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention , 2019, ICMR.
[27] Jingkuan Song,et al. Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.
[28] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[29] Amit K. Roy-Chowdhury,et al. Weakly Supervised Video Moment Retrieval From Text Queries , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[30] Jiebo Luo,et al. Localizing Natural Language in Videos , 2019, AAAI.
[31] Tao Chen,et al. Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression , 2019, ACM Multimedia.
[32] Xiao Liu,et al. Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos , 2019, AAAI.
[33] Ramakant Nevatia,et al. TALL: Temporal Activity Localization via Language Query , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[34] Amaia Salvador,et al. Learning Cross-Modal Embeddings for Cooking Recipes and Food Images , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[35] Kate Saenko,et al. Multilevel Language and Vision Integration for Text-to-Clip Retrieval , 2018, AAAI.
[36] Jiebo Luo,et al. Exploiting Temporal Relationships in Video Moment Localization with Natural Language , 2019, ACM Multimedia.
[37] Kunpeng Zhang,et al. Adversarial Point-of-Interest Recommendation , 2019, WWW.
[38] Zhou Zhao,et al. Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos , 2019, SIGIR.
[39] Shangsong Liang,et al. Unsupervised Semantic Generative Adversarial Networks for Expert Retrieval , 2019, WWW.
[40] Chao Yang,et al. Attentive Group Recommendation , 2018, SIGIR.
[41] Bernt Schiele,et al. Script Data for Attribute-Based Recognition of Composite Activities , 2012, ECCV.
[42] Yu-Gang Jiang,et al. TC-GAN: Triangle Cycle-Consistent GANs for Face Frontalization with Facial Features Preserved , 2019, ACM Multimedia.
[43] Trevor Darrell,et al. Natural Language Object Retrieval , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).