暂无分享,去创建一个
[1] Rogério Schmidt Feris,et al. Dialog-based Interactive Image Retrieval , 2018, NeurIPS.
[2] Mitesh M. Khapra,et al. Towards Building Large Scale Multimodal Domain-Aware Conversation Systems , 2017, AAAI.
[3] Zhedong Zheng,et al. Dual-path Convolutional Image-Text Embeddings with Instance Loss , 2017, ACM Trans. Multim. Comput. Commun. Appl..
[4] Xiaogang Wang,et al. DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Nathan Srebro,et al. Exploring Generalization in Deep Learning , 2017, NIPS.
[6] Edward H. Adelson,et al. Discovering states and transformations in image collections , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Arif Mahmood,et al. Do Cross Modal Systems Leverage Semantic Relationships? , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).
[8] Xi Chen,et al. Stacked Cross Attention for Image-Text Matching , 2018, ECCV.
[9] Kristen Grauman,et al. Attributes as Operators , 2018, ECCV.
[10] Razvan Pascanu,et al. A simple neural network module for relational reasoning , 2017, NIPS.
[11] Xiaoxiao Guo,et al. The Fashion IQ Dataset: Retrieving Images by Combining Side Information and Relative Natural Language Feedback , 2019, ArXiv.
[12] Vicente Ordonez,et al. Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries , 2019, NeurIPS.
[13] Bo Zhao,et al. Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Yin Li,et al. Learning Deep Structure-Preserving Image-Text Embeddings , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[15] Li Fei-Fei,et al. Composing Text and Image for Image Retrieval - an Empirical Odyssey , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[16] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[17] Yair Movshovitz-Attias,et al. No Fuss Distance Metric Learning Using Proxies , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[18] Bohyung Han,et al. Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Ryota Tomioka,et al. Norm-Based Capacity Control in Neural Networks , 2015, COLT.
[20] David J. Fleet,et al. VSE++: Improving Visual-Semantic Embeddings with Hard Negatives , 2017, BMVC.
[21] Aaron C. Courville,et al. FiLM: Visual Reasoning with a General Conditioning Layer , 2017, AAAI.
[22] Larry S. Davis,et al. Automatic Spatially-Aware Fashion Concept Discovery , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[23] Geoffrey E. Hinton,et al. Neighbourhood Components Analysis , 2004, NIPS.
[24] Yupeng Gao,et al. Fashion IQ: A New Dataset towards Retrieving Images by Natural Language Feedback , 2019 .
[25] 知秀 柴田. 5分で分かる!? 有名論文ナナメ読み:Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding , 2020 .
[26] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Ser-Nam Lim,et al. A Metric Learning Reality Check , 2020, ECCV.
[28] Huchuan Lu,et al. Deep Cross-Modal Projection Learning for Image-Text Matching , 2018, ECCV.
[29] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.
[30] Ryota Tomioka,et al. In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning , 2014, ICLR.