暂无分享,去创建一个
Chuang Gan | Joshua B. Tenenbaum | Jonas Kubilius | Martin Schrimpf | Aidan Curtis | James J. DiCarlo | Dan Gutfreund | Nick Haber | Damian Mrowca | Julian De Freitas | Jeremy Schwartz | Kuno Kim | Abhishek Bhandwaldar | Josh McDermott | Seth Alter | James Traer | Elias Wang | Michael Lingelbach | Kevin Feigelis | Daniel M. Bear | Daniel L.K. Yamins | David Cox | Megumi Sano
[1] Sergey Levine,et al. Unsupervised Learning for Physical Interaction through Video Prediction , 2016, NIPS.
[2] Fei-Fei Li,et al. Novel Dataset for Fine-Grained Image Categorization : Stanford Dogs , 2012 .
[3] Rob Fergus,et al. Learning Physical Intuition of Block Towers by Example , 2016, ICML.
[4] Subhransu Maji,et al. Fine-Grained Visual Classification of Aircraft , 2013, ArXiv.
[5] Jitendra Malik,et al. Gibson Env: Real-World Perception for Embodied Agents , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[6] Chuang Gan,et al. Music Gesture for Visual Sound Separation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Silvio Savarese,et al. Interactive Gibson Benchmark: A Benchmark for Interactive Navigation in Cluttered Environments , 2020, IEEE Robotics and Automation Letters.
[8] Jiajun Wu,et al. Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids , 2018, ICLR.
[9] Jessica B. Hamrick,et al. Simulation as an engine of physical scene understanding , 2013, Proceedings of the National Academy of Sciences.
[10] Shane Legg,et al. DeepMind Lab , 2016, ArXiv.
[11] Kristen Grauman,et al. Audio-Visual Embodied Navigation , 2019, ArXiv.
[12] Ali Farhadi,et al. AI2-THOR: An Interactive 3D Environment for Visual AI , 2017, ArXiv.
[13] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[14] Elizabeth S. Spelke,et al. Principles of Object Perception , 1990, Cogn. Sci..
[15] Josh McDermott,et al. Object-based synthesis of scraping and rolling sounds based on non-linear physical constraints , 2021, ArXiv.
[16] A. Gupta,et al. See, Hear, Explore: Curiosity via Audio-Visual Association , 2020, NeurIPS.
[17] Susan C. Johnson. Detecting agents. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.
[18] Andrew Zisserman,et al. A Visual Vocabulary for Flower Classification , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[19] Nancy Kanwisher,et al. Physion: Evaluating Physical Prediction from Vision in Humans and Machines , 2021, ArXiv.
[20] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[21] Scott P. Johnson,et al. Infants’ perception of chasing , 2013, Cognition.
[22] Leonidas J. Guibas,et al. SAPIEN: A SimulAted Part-Based Interactive ENvironment , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Chuang Gan,et al. Look, Listen, and Act: Towards Audio-Visual Embodied Navigation , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).
[24] Aren Jansen,et al. Audio Set: An ontology and human-labeled dataset for audio events , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[25] Razvan Pascanu,et al. Interaction Networks for Learning about Objects, Relations and Physics , 2016, NIPS.
[26] James Traer,et al. A PERCEPTUALLY INSPIRED GENERATIVE MODEL OF RIGID-BODY CONTACT SOUNDS , 2019 .
[27] Jitendra Malik,et al. Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.
[28] Georg Heigold,et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2021, ICLR.
[29] Pietro Perona,et al. Building a bird recognition app and large scale dataset with citizen scientists: The fine print in fine-grained dataset collection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[30] Markus Kuhlo,et al. Architectural Rendering with 3ds Max and V-Ray: Photorealistic Visualization , 2010 .
[31] Jonathan Krause,et al. 3D Object Representations for Fine-Grained Categorization , 2013, 2013 IEEE International Conference on Computer Vision Workshops.
[32] Ali Farhadi,et al. "What Happens If..." Learning to Predict the Effect of Forces in Images , 2016, ECCV.
[33] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[34] Joshua B. Tenenbaum,et al. A Compositional Object-Based Approach to Learning Physical Dynamics , 2016, ICLR.
[35] Thomas A. Funkhouser,et al. Semantic Scene Completion from a Single Depth Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[36] Chuang Gan,et al. A Computational Model for Combinatorial Generalization in Physical Auditory Perception , 2019 .
[37] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[38] Matthieu Guillaumin,et al. Food-101 - Mining Discriminative Components with Random Forests , 2014, ECCV.
[39] Chuang Gan,et al. The Sound of Pixels , 2018, ECCV.
[40] Sanja Fidler,et al. VirtualHome: Simulating Household Activities Via Programs , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[41] Roberto Cipolla,et al. Understanding RealWorld Indoor Scenes with Synthetic Data , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[42] Jiajun Wu,et al. Shape and Material from Sound , 2017, NIPS.
[43] Dinesh K. Pai,et al. Precomputed acoustic transfer: output-sensitive, accurate sound generation for geometrically complex vibration sources , 2006, SIGGRAPH 2006.
[44] Yuandong Tian,et al. Building Generalizable Agents with a Realistic and Rich 3D Environment , 2018, ICLR.
[45] Jitendra Malik,et al. Habitat: A Platform for Embodied AI Research , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[46] Chuang Gan,et al. Self-Supervised Moving Vehicle Tracking With Stereo Sound , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[47] Jitendra Malik,et al. Learning Visual Predictive Models of Physics for Playing Billiards , 2015, ICLR.
[48] Daniel L. K. Yamins,et al. Flexible Neural Representation for Physics Prediction , 2018, NeurIPS.
[49] Emmanuel Dupoux,et al. IntPhys: A Framework and Benchmark for Visual Intuitive Physics Reasoning , 2018, ArXiv.
[50] Song-Chun Zhu,et al. Learning Perceptual Causality from Video , 2013, AAAI Workshop: Learning Rich Representations from Low-Level Sensors.
[51] Chuang Gan,et al. CLEVRER: CoLlision Events for Video REpresentation and Reasoning , 2020, ICLR.
[52] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[53] Pierre-Yves Oudeyer,et al. Active learning of inverse models with intrinsically motivated goal exploration in robots , 2013, Robotics Auton. Syst..
[54] Faouzi Ghorbel,et al. A simple and efficient approach for 3D mesh approximate convex decomposition , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).
[55] Daniel Yamins,et al. Active World Model Learning in Agent-rich Environments with Progress Curiosity , 2020, ICML 2020.
[56] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .
[57] Abhinav Gupta,et al. Interpretable Intuitive Physics Model , 2018, ECCV.