Holistic++ Scene Understanding: Single-View 3D Holistic Scene Parsing and Human Pose Estimation With Human-Object Interaction and Physical Commonsense
暂无分享,去创建一个
Song-Chun Zhu | Yixin Zhu | Yixin Chen | Siyuan Huang | Siyuan Qi | Tao Yuan | Song-Chun Zhu | Siyuan Qi | Yixin Zhu | Siyuan Huang | Tao Yuan | Yixin Chen
[1] Pat Hanrahan,et al. SceneGrok: inferring action maps in 3D environments , 2014, ACM Trans. Graph..
[2] Yaser Sheikh,et al. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[3] Danica Kragic,et al. Visual object-action recognition: Inferring object affordances from human demonstration , 2011, Comput. Vis. Image Underst..
[4] Francesc Moreno-Noguer,et al. Single image 3D human pose estimation from noisy observations , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[5] T. Kanade,et al. Reconstructing 3D Human Pose from 2D Image Landmarks , 2012, ECCV.
[6] Aimee E. Stahl,et al. Observing the unexpected enhances infants’ learning and exploration , 2015, Science.
[7] Leonidas J. Guibas,et al. Human action recognition by learning bases of action attributes and parts , 2011, 2011 International Conference on Computer Vision.
[8] Leonidas J. Guibas,et al. Frustum PointNets for 3D Object Detection from RGB-D Data , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[9] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.
[10] W. Davis. The Ecological Approach to Visual Perception , 2012 .
[11] Katherine D. Kinzler,et al. Core knowledge. , 2007, Developmental science.
[12] R. Baillargeon. Infants' Physical World , 2004 .
[13] W. K. Hastings,et al. Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .
[14] E. Reed. The Ecological Approach to Visual Perception , 1989 .
[15] Derek Hoiem,et al. LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[16] Silvio Savarese,et al. Understanding Indoor Scenes Using 3D Geometric Phrases , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[17] Katsushi Ikeuchi,et al. Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[18] Lourdes Agapito,et al. Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Song-Chun Zhu,et al. Learning Human-Object Interactions by Graph Parsing Neural Networks , 2018, ECCV.
[20] Rui Ma,et al. Action-driven 3D indoor scene evolution , 2016, ACM Trans. Graph..
[21] Jiajun Wu,et al. Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning , 2015, NIPS.
[22] Yuandong Tian,et al. Single Image 3D Interpreter Network , 2016, ECCV.
[23] Nanning Zheng,et al. Modeling 4D Human-Object Interactions for Event and Object Recognition , 2013, 2013 IEEE International Conference on Computer Vision.
[24] Chenfanfu Jiang,et al. Human-Centric Indoor Scene Synthesis Using Stochastic Grammar , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[25] Hans-Peter Seidel,et al. VNect , 2017, ACM Trans. Graph..
[26] Takeo Kanade,et al. Panoptic Studio: A Massively Multiview System for Social Interaction Capture , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[27] H. Bekkering,et al. Developmental psychology: Rational imitation in preverbal infants , 2002, Nature.
[28] James R. Kubricht,et al. Intuitive Physics: Current Research and Controversies , 2017, Trends in Cognitive Sciences.
[29] E. Spelke,et al. Perception of partly occluded objects in infancy , 1983, Cognitive Psychology.
[30] Song-Chun Zhu,et al. Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation , 2018, NeurIPS.
[31] Lisa Feigenson,et al. Tracking individuals via object-files: evidence from infants' manual search , 2003 .
[32] Ersin Yumer,et al. Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[33] Silvio Savarese,et al. Watch-n-patch: Unsupervised understanding of actions and relations , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[34] Jianxiong Xiao,et al. SUN RGB-D: A RGB-D scene understanding benchmark suite , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[35] Song-Chun Zhu,et al. Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation , 2017, AAAI.
[36] Thomas A. Funkhouser,et al. Semantic Scene Completion from a Single Depth Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[37] Song-Chun Zhu,et al. Scene Parsing by Integrating Function, Geometry and Appearance Models , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[38] Zhijian Liu,et al. Learning to Exploit Stability for 3D Scene Parsing , 2018, NeurIPS.
[39] Jia Deng,et al. Learning to Detect Human-Object Interactions , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).
[40] Abhinav Gupta,et al. Marr Revisited: 2D-3D Alignment via Surface Normal Prediction , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[41] A. Woodward. Infants' ability to distinguish between purposeful and non-purposeful behaviors , 1999 .
[42] Svetlana Lazebnik,et al. Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering , 2016, ECCV.
[43] Chenfanfu Jiang,et al. Configurable 3D Scene Synthesis and 2D Image Rendering with Per-pixel Ground Truth Using Stochastic Grammars , 2017, International Journal of Computer Vision.
[44] E. Spelke,et al. Perceptual completion of surfaces in infancy. , 1987, Journal of experimental psychology. Human perception and performance.
[45] Cristian Sminchisescu,et al. Monocular 3D Pose and Shape Estimation of Multiple People in Natural Scenes: The Importance of Multiple Scene Constraints , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[46] Ali Farhadi,et al. Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[47] Katsushi Ikeuchi,et al. Scene Understanding by Reasoning Stability and Safety , 2015, International Journal of Computer Vision.
[48] Antoni B. Chan,et al. 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network , 2014, ACCV.
[49] Yan Wang,et al. A Simple, Fast and Highly-Accurate Algorithm to Recover 3D Shape from 2D Landmarks on a Single Image , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[50] S. Carey,et al. First-person action experience reveals sensitivity to action efficiency in prereaching infants , 2013, Proceedings of the National Academy of Sciences.
[51] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.
[52] Song-Chun Zhu,et al. Understanding tools: Task-oriented object modeling, learning and recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[53] M. McCloskey,et al. Intuitive physics: the straight-down belief and its origin. , 1983, Journal of experimental psychology. Learning, memory, and cognition.
[54] Song-Chun Zhu,et al. Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image , 2018, ECCV.
[55] A. Needham. Factors Affecting Infants' Use of Featural Information in Object Segregation , 1997 .
[56] Larry S. Davis,et al. Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[57] Jianxiong Xiao,et al. Sliding Shapes for 3D Object Detection in Depth Images , 2014, ECCV.
[58] H. Furth. Object permanence in five-month-old infants. , 1987, Cognition.
[59] Songhwai Oh,et al. Complex Non-rigid 3D Shape Recovery Using a Procrustean Normal Distribution Mixture Model , 2015, International Journal of Computer Vision.
[60] Matthias Nießner,et al. PiGraphs , 2016, ACM Trans. Graph..
[61] Derek Hoiem,et al. Complete 3D Scene Parsing from Single RGBD Image , 2017, ArXiv.
[62] Chenfanfu Jiang,et al. Inferring Forces and Learning Human Utilities from Videos , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).