Learning from THEODORE: A Synthetic Omnidirectional Top-View Indoor Dataset for Deep Transfer Learning
暂无分享,去创建一个
[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[2] Wojciech Zaremba,et al. Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[3] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[4] Ning Qian,et al. On the momentum term in gradient descent learning algorithms , 1999, Neural Networks.
[5] Sergio Guadarrama,et al. Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Simon Brodeur,et al. HoME: a Household Multimodal Environment , 2017, ICLR.
[7] Emanuele Frontoni,et al. Person Re-identification Dataset with RGB-D Camera in a Top-View Configuration , 2016, VAAM/FFER@ICPR.
[8] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Paul Bourke. iDome: Immersive gaming with the Unity3D game engine , 2009 .
[10] Paul Bourke,et al. Blender and Immersive Gaming in a Hemispherical Dome , 2010, CGAMES 2010.
[11] Jianguo Zhang,et al. The PASCAL Visual Object Classes Challenge , 2006 .
[12] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Luc Van Gool,et al. The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.
[14] Alexandre Bernardino,et al. The HDA+ Data Set for Research on Fully Automated Re-identification Systems , 2014, ECCV Workshops.
[15] Takashi Matsubara,et al. RICAP: Random Image Cropping and Patching Data Augmentation for Deep CNNs , 2018, ACML.
[16] Michael J. Black,et al. SMPL: A Skinned Multi-Person Linear Model , 2023 .
[17] Yi Li,et al. R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.
[18] Albert Ali Salah,et al. Feature-based tracking on a multi-omnidirectional camera dataset , 2012, 2012 5th International Symposium on Communications, Control and Signal Processing.
[19] Yalin Bastanlar,et al. A Direct approach for human detection with catadioptric omnidirectional cameras , 2014, 2014 22nd Signal Processing and Communications Applications Conference (SIU).
[20] Yuandong Tian,et al. Building Generalizable Agents with a Realistic and Rich 3D Environment , 2018, ICLR.
[21] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.
[22] Qiao Wang,et al. VirtualWorlds as Proxy for Multi-object Tracking Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Wenbin Li,et al. InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset , 2018, BMVC.
[25] Sanja Fidler,et al. VirtualHome: Simulating Household Activities Via Programs , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[26] Frank Hutter,et al. SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.
[27] Matthias Nießner,et al. Matterport3D: Learning from RGB-D Data in Indoor Environments , 2017, 2017 International Conference on 3D Vision (3DV).
[28] Andrew J. Chosak,et al. OVVV: Using Virtual Worlds to Design and Evaluate Surveillance Systems , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.
[29] Alexandre Bernardino,et al. A multi-camera video dataset for research on high-definition surveillance , 2014 .
[30] Silvio Savarese,et al. Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[31] Barbara Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..
[32] Max Welling,et al. Spherical CNNs , 2018, ICLR.
[33] Thomas A. Funkhouser,et al. Semantic Scene Completion from a Single Depth Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[34] Andreas Geiger,et al. SphereNet: Learning Spherical Representations for Detection and Classification in Omnidirectional Images , 2018, ECCV.
[35] Xiaogang Wang,et al. Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[36] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[37] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[38] Varun Jampani,et al. Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[39] André Kaup,et al. A data set providing synthetic and real-world fisheye video sequences , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[40] Cristian Sminchisescu,et al. Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[41] Cristian Sminchisescu,et al. Latent structured models for human pose estimation , 2011, 2011 International Conference on Computer Vision.
[42] Roberto Cipolla,et al. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[43] Ali Farhadi,et al. AI2-THOR: An Interactive 3D Environment for Visual AI , 2017, ArXiv.
[44] Cordelia Schmid,et al. Learning from Synthetic Humans , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[45] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.
[46] Thomas A. Funkhouser,et al. MINOS: Multimodal Indoor Simulator for Navigation in Complex Environments , 2017, ArXiv.
[47] Roberto Cipolla,et al. Understanding RealWorld Indoor Scenes with Synthetic Data , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[48] Thomas A. Funkhouser,et al. Learning Where to Look: Data-Driven Viewpoint Set Selection for 3D Scenes , 2017, ArXiv.
[49] Pietro Perona,et al. Describing Common Human Visual Actions in Images , 2015, BMVC.