How Transferable are Video Representations Based on Synthetic Data?
暂无分享,去创建一个
A. Oliva | Kate Saenko | Hilde Kuehne | R. Feris | Venkatesh Saligrama | R. Panda | Hilde Kuehne | Leonid Karlinsky | Samarth Mishra | Yo-whan Kim | SouYoung Jin
[1] Victor G. Turrisi da Costa,et al. Dual-Head Contrastive Domain Adaptation for Video Action Recognition , 2022, 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
[2] Multi-level Attentive Adversarial Learning with Temporal Dilation for Unsupervised Video Domain Adaptation , 2022, 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
[3] Alan Yuille,et al. Masked Feature Prediction for Self-Supervised Visual Pre-Training , 2021, ArXiv.
[4] Cheng Perng Phoo,et al. Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Alexander Kolesnikov,et al. Scaling Vision Transformers , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Vladlen Koltun,et al. Enhancing Photorealism Enhancement , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[7] Lu Yuan,et al. Florence: A New Foundation Model for Computer Vision , 2021, ArXiv.
[8] A. Torralba,et al. Learning to See by Looking at Noise , 2021, NeurIPS.
[9] Christoph Feichtenhofer,et al. Multiscale Vision Transformers , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[10] Jun Liu,et al. UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Heng Wang,et al. Is Space-Time Attention All You Need for Video Understanding? , 2021, ICML.
[12] Kate Saenko,et al. Semi-Supervised Action Recognition with Temporal Contrastive Learning , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Quanfu Fan,et al. Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Josh H. McDermott,et al. ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation , 2020, NeurIPS Datasets and Benchmarks.
[15] C. Schmid,et al. Synthetic Humans for Action Recognition from Unseen Viewpoints , 2019, International Journal of Computer Vision.
[16] Kenji Fukumizu,et al. A Scaling Law for Synthetic-to-Real Transfer: A Measure of Pre-Training , 2021, ArXiv.
[17] Ig-Jae Kim,et al. ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications , 2020, IEEE Access.
[18] Vibhav Vineet,et al. AutoSimulate: (Quickly) Learning Synthetic Data Generation , 2020, ECCV.
[19] Zi Huang,et al. Adversarial Bipartite Graph Learning for Video Domain Adaptation , 2020, ACM Multimedia.
[20] Wen-mei W. Hwu,et al. Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Philip David,et al. A Curriculum Domain Adaptation Approach to the Semantic Segmentation of Urban Scenes , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[22] Bolei Zhou,et al. Moments in Time Dataset: One Million Videos for Event Understanding , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[23] Chen Gao,et al. Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition , 2019, NeurIPS.
[24] Ruxin Chen,et al. Temporal Attentive Alignment for Large-Scale Video Domain Adaptation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[25] Ivan Laptev,et al. HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[26] Heng Wang,et al. Large-Scale Weakly-Supervised Pre-Training for Video Action Recognition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Sanja Fidler,et al. Meta-Sim: Learning to Generate Synthetic Datasets , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[28] Heng Tao Shen,et al. A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition , 2019, ArXiv.
[29] Jitendra Malik,et al. Habitat: A Platform for Embodied AI Research , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[30] Stanley T. Birchfield,et al. Structured Domain Randomization: Bridging the Reality Gap by Context-Aware Synthetic Data , 2018, 2019 International Conference on Robotics and Automation (ICRA).
[31] Manmohan Krishna Chandraker,et al. Learning To Simulate , 2018, ICLR.
[32] Pascal Fua,et al. Beyond Sharing Weights for Deep Domain Adaptation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[33] Yi Li,et al. RESOUND: Towards Action Recognition Without Representation Bias , 2018, ECCV.
[34] Kate Saenko,et al. Syn2Real: A New Benchmark forSynthetic-to-Real Visual Domain Adaptation , 2018, ArXiv.
[35] Jitendra Malik,et al. Gibson Env: Real-World Perception for Embodied Agents , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[36] Trevor Darrell,et al. Women also Snowboard: Overcoming Bias in Captioning Models , 2018, ECCV.
[37] Timnit Gebru,et al. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.
[38] Yann LeCun,et al. A Closer Look at Spatiotemporal Convolutions for Action Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[39] Yong Jae Lee,et al. Cross-Domain Self-Supervised Multi-task Feature Learning Using Synthetic Imagery , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[40] Taesung Park,et al. CyCADA: Cycle-Consistent Adversarial Domain Adaptation , 2017, ICML.
[41] Cordelia Schmid,et al. AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[42] Ali Farhadi,et al. AI2-THOR: An Interactive 3D Environment for Visual AI , 2017, ArXiv.
[43] Anoop Cherian,et al. Human Pose Forecasting via Deep Markov Models , 2017, 2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA).
[44] Susanne Westphal,et al. The “Something Something” Video Database for Learning and Evaluating Visual Common Sense , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[45] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[46] Fabio Viola,et al. The Kinetics Human Action Video Dataset , 2017, ArXiv.
[47] Trevor Darrell,et al. Adversarial Discriminative Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[48] Gabriela Csurka,et al. Domain Adaptation for Visual Applications: A Comprehensive Survey , 2017, ArXiv.
[49] Li Fei-Fei,et al. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[50] Antonio Manuel López Peña,et al. Procedural Generation of Videos to Train Deep Action Recognition Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[51] Li Fei-Fei,et al. Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos , 2015, International Journal of Computer Vision.
[52] Apostol Natsev,et al. YouTube-8M: A Large-Scale Video Classification Benchmark , 2016, ArXiv.
[53] Luc Van Gool,et al. Temporal Segment Networks: Towards Good Practices for Deep Action Recognition , 2016, ECCV.
[54] Antonio M. López,et al. The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[55] Gang Wang,et al. NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[56] Ali Farhadi,et al. Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding , 2016, ECCV.
[57] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[58] François Laviolette,et al. Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..
[59] MarchandMario,et al. Domain-adversarial training of neural networks , 2016 .
[60] Kate Saenko,et al. Learning Deep Object Detectors from 3D Models , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[61] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[62] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.
[63] S. Gong,et al. Recognising action as clouds of space-time interest points , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[64] B. Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..
[65] A. Verri,et al. Analysis of differential and matching methods for optical flow , 1989, [1989] Proceedings. Workshop on Visual Motion.
[66] Ramakant Nevatia,et al. Description and Recognition of Curved Objects , 1977, Artif. Intell..