Monocular Depth Estimation using Diffusion Models
暂无分享,去创建一个
[1] Han Hu,et al. All in Tokens: Unifying Output Space of Visual Tasks via Soft Token , 2023, 2023 IEEE/CVF International Conference on Computer Vision (ICCV).
[2] Chetan Arora,et al. Attention Attention Everywhere: Monocular Depth Prediction with Skip Attention , 2022, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
[3] David J. Fleet,et al. Image Super-Resolution via Iterative Refinement , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[4] Prafulla Dhariwal,et al. Point-E: A System for Generating 3D Point Clouds from Complex Prompts , 2022, ArXiv.
[5] Alexei A. Efros,et al. InstructPix2Pix: Learning to Follow Image Editing Instructions , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[6] David J. Fleet,et al. A Generalist Framework for Panoptic Segmentation of Images and Videos , 2022, ArXiv.
[7] Mohammad Norouzi,et al. Novel View Synthesis with Diffusion Models , 2022, ICLR.
[8] Ben Poole,et al. DreamFusion: Text-to-3D using 2D Diffusion , 2022, ICLR.
[9] J. Tenenbaum,et al. Prompt-to-Prompt Image Editing with Cross Attention Control , 2022, ICLR.
[10] Han Hu,et al. Revealing the Dark Secrets of Masked Image Modeling , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[11] David J. Fleet,et al. Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding , 2022, NeurIPS.
[12] C. Qi,et al. Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking , 2022, 2022 International Conference on Robotics and Automation (ICRA).
[13] Prafulla Dhariwal,et al. Hierarchical Text-Conditional Image Generation with CLIP Latents , 2022, ArXiv.
[14] David J. Fleet,et al. Video Diffusion Models , 2022, NeurIPS.
[15] Junjun Jiang,et al. BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation , 2022, ArXiv.
[16] Tim Salimans,et al. Progressive Distillation for Fast Sampling of Diffusion Models , 2022, ICLR.
[17] B. Ommer,et al. High-Resolution Image Synthesis with Latent Diffusion Models , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[18] Prafulla Dhariwal,et al. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models , 2021, ICML.
[19] Aäron van den Oord,et al. Step-unrolled Denoising Autoencoders for Text Generation , 2021, ICLR.
[20] S. Ermon,et al. SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations , 2021, ICLR.
[21] David J. Fleet,et al. Cascaded Diffusion Models for High Fidelity Image Generation , 2021, J. Mach. Learn. Res..
[22] Nicu Sebe,et al. Probabilistic Graph Attention Network With Conditional Kernels for Pixel-Wise Prediction , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[23] Konrad Schindler,et al. Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[24] Xiaowei Guo,et al. Transformer-based Dual Relation Graph for Multi-label Image Recognition , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[25] David F. Fouhey,et al. PixelSynth: Generating a 3D-Consistent Experience from a Single Image , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[26] Prafulla Dhariwal,et al. Diffusion Models Beat GANs on Image Synthesis , 2021, NeurIPS.
[27] Vladlen Koltun,et al. Vision Transformers for Dense Prediction , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[28] Prafulla Dhariwal,et al. Improved Denoising Diffusion Probabilistic Models , 2021, ICML.
[29] Varun Jampani,et al. Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[30] Peter Wonka,et al. AdaBins: Depth Estimation Using Adaptive Bins , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[31] Abhishek Kumar,et al. Score-Based Generative Modeling through Stochastic Differential Equations , 2020, ICLR.
[32] Pieter Abbeel,et al. Denoising Diffusion Probabilistic Models , 2020, NeurIPS.
[33] Yair Movshovitz-Attias,et al. Sky Optimization: Semantically aware image processing of skies in low-light photography , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[34] Jiri Matas,et al. Guiding Monocular Depth Estimation Using Depth-Attention Volume , 2020, ECCV.
[35] Gustavo Carneiro,et al. Self-Supervised Monocular Trained Depth Estimation Using Self-Attention and Discrete Disparity Volume , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[36] Dragomir Anguelov,et al. Scalability in Perception for Autonomous Driving: Waymo Open Dataset , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[37] Chunhua Shen,et al. Enforcing Geometric Constraints of Virtual Normal for Depth Prediction , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[38] Il Hong Suh,et al. From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation , 2019, ArXiv.
[39] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.
[40] Vladlen Koltun,et al. Does computer vision matter for action? , 2019, Science Robotics.
[41] Liang Lin,et al. Monocular Depth Estimation with Affinity, Vertical Pooling, and Label Enhancement , 2018, ECCV.
[42] Dacheng Tao,et al. Deep Ordinal Regression Network for Monocular Depth Estimation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[43] Bolei Zhou,et al. Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[44] Yong Jae Lee,et al. Cross-Domain Self-Supervised Multi-task Feature Learning Using Synthetic Imagery , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[45] Chunhua Shen,et al. Estimating Depth From Monocular Images as Classification Using Deep Fully Convolutional Residual Networks , 2016, IEEE Transactions on Circuits and Systems for Video Technology.
[46] Matthias Nießner,et al. ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[47] Lantao Yu,et al. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient , 2016, AAAI.
[48] Oisin Mac Aodha,et al. Unsupervised Monocular Depth Estimation with Left-Right Consistency , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[49] Stefan Leutenegger,et al. SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth , 2016, ArXiv.
[50] Yoshua Bengio,et al. Professor Forcing: A New Algorithm for Training Recurrent Networks , 2016, NIPS.
[51] Nassir Navab,et al. Deeper Depth Prediction with Fully Convolutional Residual Networks , 2016, 2016 Fourth International Conference on 3D Vision (3DV).
[52] Alexei A. Efros,et al. Colorful Image Colorization , 2016, ECCV.
[53] Gregory Shakhnarovich,et al. Learning Representations for Automatic Colorization , 2016, ECCV.
[54] Gustavo Carneiro,et al. Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue , 2016, ECCV.
[55] Marc'Aurelio Ranzato,et al. Sequence Level Training with Recurrent Neural Networks , 2015, ICLR.
[56] Leonidas J. Guibas,et al. ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.
[57] Roberto Cipolla,et al. SceneNet: Understanding Real World Indoor Scenes With Synthetic Data , 2015, ArXiv.
[58] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.
[59] Surya Ganguli,et al. Deep Unsupervised Learning using Nonequilibrium Thermodynamics , 2015, ICML.
[60] Rob Fergus,et al. Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[61] Rob Fergus,et al. Depth Map Prediction from a Single Image using a Multi-Scale Deep Network , 2014, NIPS.
[62] Michael Beetz,et al. Inpainting of Missing Values in the Kinect Sensor's Depth Maps Based on Background Estimates , 2014, IEEE Sensors Journal.
[63] Andreas Geiger,et al. Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..
[64] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.
[65] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[66] Ashutosh Saxena,et al. Learning Depth from Single Monocular Images , 2005, NIPS.
[67] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.
[68] A. Ng,et al. Make3D: Learning 3D Scene Structure from a Single Still Image , 2022 .