暂无分享,去创建一个
Erwin M. Bakker | Michael S. Lew | Wei Chen | Yu Liu | M. Lew | E. Bakker | Yu Liu | Wei Chen
[1] Lior Wolf,et al. RNN Fisher Vectors for Action Recognition and Image Annotation , 2015, ECCV.
[2] Zhiyong Wu,et al. Modality-specific and shared generative adversarial network for cross-modal retrieval , 2020, Pattern Recognit..
[3] Feng Xia,et al. A novel strategy to balance the results of cross-modal hashing , 2020, Pattern Recognit..
[4] Di Wang,et al. Joint and individual matrix factorization hashing for large-scale cross-modal retrieval , 2020, Pattern Recognit..
[5] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .
[6] Xiaogang Wang,et al. Person Search with Natural Language Description , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[8] Lior Wolf,et al. Associating neural word embeddings with deep image representations using Fisher Vectors , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Liang Wang,et al. Improving Description-Based Person Re-Identification by Multi-Granularity Image-Text Alignments , 2019, IEEE Transactions on Image Processing.
[10] Ioannis A. Kakadiaris,et al. Adversarial Representation Learning for Text-to-Image Matching , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[11] Xirong Li,et al. Predicting Visual Features From Text for Image and Video Caption Retrieval , 2017, IEEE Transactions on Multimedia.
[12] Wei Wang,et al. A Comprehensive Survey on Cross-modal Retrieval , 2016, ArXiv.
[13] Erwin M. Bakker,et al. CycleMatch: A cycle-consistent embedding network for image-text matching , 2019, Pattern Recognit..
[14] Xin Xu,et al. Cross-Modality Retrieval by Joint Correlation Learning , 2019, ACM Trans. Multim. Comput. Commun. Appl..
[15] Yang Yang,et al. Adversarial Cross-Modal Retrieval , 2017, ACM Multimedia.
[16] Petros Daras,et al. Graph-based multimodal fusion with metric learning for multimodal classification , 2019, Pattern Recognit..
[17] David J. Fleet,et al. VSE++: Improving Visual-Semantic Embeddings with Hard Negatives , 2017, BMVC.
[18] Victor S. Lempitsky,et al. Unsupervised Domain Adaptation by Backpropagation , 2014, ICML.
[19] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[20] Alberto Del Bimbo,et al. Socializing the Semantic Gap , 2015, ACM Comput. Surv..
[21] Jung-Woo Ha,et al. Dual Attention Networks for Multimodal Reasoning and Matching , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[22] Xiaogang Wang,et al. Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association , 2018, ECCV.
[23] Yu Liu,et al. Learning a Recurrent Residual Fusion Network for Multimodal Matching , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[24] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[25] Patrick Pérez,et al. ADVENT: Adversarial Entropy Minimization for Domain Adaptation in Semantic Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Peter Young,et al. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions , 2014, TACL.
[27] Kilian Q. Weinberger,et al. On Calibration of Modern Neural Networks , 2017, ICML.
[28] Xiaogang Wang,et al. Identity-Aware Textual-Visual Matching with Latent Co-attention , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[29] Huchuan Lu,et al. Deep Cross-Modal Projection Learning for Image-Text Matching , 2018, ECCV.
[30] Alberto Del Bimbo,et al. Image Tag Assignment, Refinement and Retrieval , 2015, ACM Multimedia.
[31] Yin Li,et al. Learning Deep Structure-Preserving Image-Text Embeddings , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[32] Mohamed Cheriet,et al. Word spotting and recognition via a joint deep embedding of image and text , 2019, Pattern Recognit..
[33] Louis-Philippe Morency,et al. Multimodal Machine Learning: A Survey and Taxonomy , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[34] Wei Xu,et al. Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN) , 2014, ICLR.
[35] Peter Young,et al. Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics , 2013, J. Artif. Intell. Res..
[36] Wei Wang,et al. Instance-Aware Image and Sentence Matching with Selective Multimodal LSTM , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[37] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.
[38] Zhedong Zheng,et al. Dual-path Convolutional Image-Text Embeddings with Instance Loss , 2017, ACM Trans. Multim. Comput. Commun. Appl..
[39] Wei Chen,et al. Domain Uncertainty Based On Information Theory for Cross-Modal Hash Retrieval , 2019, 2019 IEEE International Conference on Multimedia and Expo (ICME).
[40] Jürgen Schmidhuber,et al. Bidirectional LSTM Networks for Improved Phoneme Classification and Recognition , 2005, ICANN.