Commands 4 Autonomous Vehicles (C4AV) Workshop Summary
暂无分享,去创建一个
Luc Van Gool | Tinne Tuytelaars | Marie-Francine Moens | Thierry Deruyttere | Simon Vandenhende | Dusan Grujicic | Yu Liu | Matthew Blaschko
[1] Yoav Artzi,et al. TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[2] Trevor Darrell,et al. Explainable Neural Computation via Stack Neural Module Networks , 2018, ECCV.
[3] Jianxiong Xiao,et al. R-CNN for Small Object Detection , 2016, ACCV.
[4] Trevor Darrell,et al. Natural Language Object Retrieval , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Jitendra Malik,et al. Habitat: A Platform for Embodied AI Research , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[6] Liwei Wang,et al. Learning Two-Branch Neural Networks for Image-Text Matching Tasks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[7] Jesse Thomason,et al. Vision-and-Dialog Navigation , 2019, CoRL.
[8] Christopher D. Manning,et al. Compositional Attention Networks for Machine Reasoning , 2018, ICLR.
[9] Marie-Francine Moens,et al. Giving Commands to a Self-driving Car: A Multimodal Reasoner for Visual Grounding , 2020, ArXiv.
[10] Alan L. Yuille,et al. Generation and Comprehension of Unambiguous Object Descriptions , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Dengxin Dai,et al. Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory , 2019, Int. J. Comput. Vis..
[12] Vivek Mittal. AttnGrounder: Talking to Cars with Attention , 2020, ECCV Workshops.
[13] Qiang Xu,et al. nuScenes: A Multimodal Dataset for Autonomous Driving , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Peter Young,et al. Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics , 2013, J. Artif. Intell. Res..
[15] Hang Dai,et al. Commands for Autonomous Vehicles by Progressively Stacking Visual-Linguistic Representations , 2020, ECCV Workshops.
[16] Furu Wei,et al. VL-BERT: Pre-training of Generic Visual-Linguistic Representations , 2019, ICLR.
[17] David Berthelot,et al. FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence , 2020, NeurIPS.
[18] Thierry Deruyttere,et al. A Baseline for the Commands For Autonomous Vehicles Challenge , 2020, ArXiv.
[19] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[20] Luc Van Gool,et al. Talk2Nav: Long-Range Vision-and-Language Navigation in Cities , 2019, ArXiv.
[21] Vicente Ordonez,et al. ReferItGame: Referring to Objects in Photographs of Natural Scenes , 2014, EMNLP.
[22] Luc Van Gool,et al. Object Referring in Videos with Language and Human Gaze , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[23] Qi Wu,et al. Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[24] Marie-Francine Moens,et al. Talk2Car: Taking Control of Your Self-Driving Car , 2019, EMNLP.
[25] Justin Johnson,et al. DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer , 2018, ArXiv.
[26] Trevor Darrell,et al. Grounding of Textual Phrases in Images by Reconstruction , 2015, ECCV.
[27] Ramakant Nevatia,et al. Query-Guided Regression Network with Context Policy for Phrase Grounding , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[28] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[29] Peter Young,et al. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions , 2014, TACL.
[30] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[31] Licheng Yu,et al. MAttNet: Modular Attention Network for Referring Expression Comprehension , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[32] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[33] Ramakant Nevatia,et al. PIRC Net : Using Proposal Indexing, Relationships and Context for Phrase Grounding , 2018, ACCV.
[34] Licheng Yu,et al. Modeling Context in Referring Expressions , 2016, ECCV.
[35] K. Madhava Krishna,et al. Cosine meets Softmax: A tough-to-beat baseline for visual grounding , 2020, ECCV Workshops.
[36] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.
[37] Kan Chen,et al. Zero-Shot Grounding of Objects From Natural Language Queries , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[38] Iryna Gurevych,et al. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks , 2019, EMNLP.
[39] Armand Joulin,et al. Deep Fragment Embeddings for Bidirectional Image Sentence Mapping , 2014, NIPS.
[40] Xingyi Zhou,et al. Objects as Points , 2019, ArXiv.
[41] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[42] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.
[43] Luc Van Gool,et al. Revisiting Multi-Task Learning in the Deep Learning Era , 2020, ArXiv.
[44] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[45] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.
[46] Yunchao Wei,et al. Perceptual Generative Adversarial Networks for Small Object Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[47] Li Fei-Fei,et al. Inferring and Executing Programs for Visual Reasoning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[48] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[49] Jie Ou,et al. Attention Enhanced Single Stage Multimodal Reasoner , 2020, ECCV Workshops.
[50] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[51] Luc Van Gool,et al. SCAN: Learning to Classify Images Without Labels , 2020, ECCV.
[52] Jitendra Malik,et al. Learning Rich Features from RGB-D Images for Object Detection and Segmentation , 2014, ECCV.