3D-Aware Scene Change Captioning From Multiview Images
暂无分享,去创建一个
Yue Qiu | Yutaka Satoh | Ryota Suzuki | Kenji Iwata | Hirokatsu Kataoka | Y. Satoh | Hirokatsu Kataoka | K. Iwata | Yue Qiu | Ryota Suzuki
[1] Gordon Wetzstein,et al. Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations , 2019, NeurIPS.
[2] Siddhartha S. Srinivasa,et al. The YCB object and Model set: Towards common benchmarks for manipulation research , 2015, 2015 International Conference on Advanced Robotics (ICAR).
[3] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[4] Stefan Lee,et al. Embodied Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[5] Tamim Asfour,et al. Active Vision for Extraction of Physically Plausible Support Relations , 2019, 2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids).
[6] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[7] Nobuo Kawaguchi,et al. Dense Optical Flow based Change Detection Network Robust to Difference of Camera Viewpoints , 2017, ArXiv.
[8] Michael S. Bernstein,et al. Image retrieval using scene graphs , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Markus Vincze,et al. On-the-fly detection of novel objects in indoor environments , 2017, 2017 IEEE International Conference on Robotics and Biomimetics (ROBIO).
[10] Pieter Abbeel,et al. BigBIRD: A large-scale 3D database of object instances , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).
[11] Nicholas J. Butko,et al. Active perception , 2010 .
[12] Laurens van der Maaten,et al. 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[13] Takayuki Okatani,et al. Change Detection from a Street Image Pair using CNN Features and Superpixel Segmentation , 2015, BMVC.
[14] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[15] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[16] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[17] Alon Lavie,et al. Meteor Universal: Language Specific Translation Evaluation for Any Target Language , 2014, WMT@ACL.
[18] Dieter Fox,et al. Toward object discovery and modeling via 3-D scene comparison , 2011, 2011 IEEE International Conference on Robotics and Automation.
[19] Li Fei-Fei,et al. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[20] Matthias Nießner,et al. Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[21] C. V. Jawahar,et al. Single and Multiple View Support Order Prediction in Clutter for Manipulation , 2016, J. Intell. Robotic Syst..
[22] Koray Kavukcuoglu,et al. Neural scene representation and rendering , 2018, Science.
[23] Trevor Darrell,et al. Robust Change Captioning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[24] Leonidas J. Guibas,et al. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.
[25] Basura Fernando,et al. SPICE: Semantic Propositional Image Caption Evaluation , 2016, ECCV.
[26] Yin Zhou,et al. VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[27] Nathan F. Lepora,et al. Active touch for robust perception under position uncertainty , 2013, 2013 IEEE International Conference on Robotics and Automation.
[28] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.
[29] Rares Ambrus,et al. Meta-rooms: Building and maintaining long term spatial models in a dynamic world , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[30] Harsh Jhamtani,et al. Learning to Describe Differences Between Pairs of Similar Images , 2018, EMNLP.