暂无分享,去创建一个
Ta-Chung Chi | Yi-Ting Yeh | Yau-Shian Wang | Ting-Rui Chiang | Ta-Chung Chi | Yi-Ting Yeh | Yau-Shian Wang | Ting-Rui Chiang
[1] Silvio Savarese,et al. 3D Semantic Parsing of Large-Scale Indoor Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[2] Dieter Fox,et al. A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution , 2021, CoRL.
[3] Trevor Darrell,et al. Modularity Improves Out-of-Domain Instruction Following , 2020, ArXiv.
[4] Raia Hadsell,et al. Learning To Follow Directions in Street View , 2019, AAAI.
[5] Yichi Zhang,et al. Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring , 2021, FINDINGS.
[6] Licheng Yu,et al. Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout , 2019, NAACL.
[7] Ali Farhadi,et al. IQA: Visual Question Answering in Interactive Environments , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[8] Ghassan Al-Regib,et al. Self-Monitoring Navigation Agent via Auxiliary Progress Estimation , 2019, ICLR.
[9] Yuandong Tian,et al. Building Generalizable Agents with a Realistic and Rich 3D Environment , 2018, ICLR.
[10] Dan Klein,et al. Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation , 2019, ACL.
[11] Gökhan Tür,et al. Are We There Yet? Learning to Localize in Embodied Instruction Following , 2021, ArXiv.
[12] Kunal Pratap Singh,et al. Agent with the Big Picture: Perceiving Surroundings for Interactive Instruction Following , 2021 .
[13] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[14] Yoav Artzi,et al. TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[15] Andrew Bennett,et al. Mapping Instructions to Actions in 3D Environments with Visual Goal Prediction , 2018, EMNLP.
[16] Ghassan Al-Regib,et al. The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Dan Klein,et al. Speaker-Follower Models for Vision-and-Language Navigation , 2018, NeurIPS.
[18] Qi Wu,et al. Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[19] Rahul Sukthankar,et al. Cognitive Mapping and Planning for Visual Navigation , 2017, International Journal of Computer Vision.
[20] Dan Klein,et al. Alignment-Based Compositional Semantics for Instruction Following , 2015, EMNLP.
[21] Luke S. Zettlemoyer,et al. Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions , 2013, TACL.
[22] Roozbeh Mottaghi,et al. ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Jacob Krantz,et al. Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments , 2020, ECCV.
[24] Ross A. Knepper,et al. Few-shot Object Grounding and Mapping for Natural Language Robot Instruction Following , 2020, CoRL.
[25] Razvan Pascanu,et al. Learning to Navigate in Complex Environments , 2016, ICLR.
[26] Dorsa Sadigh,et al. Learning Adaptive Language Interfaces through Decomposition , 2020, INTEXSEMPAR.