论文信息 - iOrthoPredictor: model-guided deep prediction of teeth alignment

iOrthoPredictor: model-guided deep prediction of teeth alignment

Fig. 1. Given a face photograph of a patient with malpositioned teeth and a corresponding 3D teeth model (obtained by dental scanning), our method is able to produce a face image with the teeth aligned, mimicking an orthodontic treatment effect. The input teeth model and the automatically aligned teeth for the first patient are overlaid with the mouth area shown aside. All the results are obtained fully automatically.

[1] Björn Ommer,et al. A Variational U-Net for Conditional Appearance and Shape Generation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2] Bogdan Raducanu,et al. Invertible Conditional GANs for image editing , 2016, ArXiv.

[3] Patrick Pérez,et al. Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[4] Zenghui Wang,et al. Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.

[5] Norman W. Kingsley,et al. A Treatise on Oral Deformities as a Branch of Mechanical Surgery , 1880, The American Journal of Dental Science.

[6] Rama Chellappa,et al. ExprGAN: Facial Expression Editing with Controllable Expression Intensity , 2017, AAAI.

[7] Edward Y. Chang,et al. RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8] Chao Yang,et al. Realistic Dynamic Facial Textures from a Single Image Using GANs , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9] Xiaochun Cao,et al. Makeup Like a Superstar: Deep Localized Makeup Transfer Network , 2016, IJCAI.

[10] Stephen Lin,et al. Deformable ConvNets V2: More Deformable, Better Results , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11] M Sherriff,et al. Dolphin Imaging Software: an analysis of the accuracy of cephalometric digitization and orthognathic prediction. , 2005, International journal of oral and maxillofacial surgery.

[12] Xindong Wu,et al. Object Detection With Deep Learning: A Review , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[13] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[14] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.

[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16] Ziwei Liu,et al. Semantic Facial Expression Editing using Autoencoded Flow , 2016, ArXiv.

[17] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[18] Markus H. Gross,et al. Simulating facial surgery using finite element models , 1996, SIGGRAPH.

[19] Leonidas J. Guibas,et al. Learning Representations and Generative Models for 3D Point Clouds , 2017, ICML.

[20] Eli Shechtman,et al. Space-Time Completion of Video , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Steve Marschner,et al. Appearance capture and modeling of human teeth , 2018, ACM Trans. Graph..

[22] Shigeo Morishima,et al. Data-Driven Speech Animation Synthesis Focusing on Realistic Inside of the Mouth , 2014, J. Inf. Process..

[23] Alexei A. Efros,et al. Scene completion using millions of photographs , 2007, SIGGRAPH 2007.

[24] Iasonas Kokkinos,et al. Deforming Autoencoders: Unsupervised Disentangling of Shape and Appearance , 2018, ECCV.

[25] Wojciech Matusik,et al. Video face replacement , 2011, ACM Trans. Graph..

[26] Assaf Zomet,et al. Learning how to inpaint from global image statistics , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[27] Smita Krishnaswamy,et al. TraVeLGAN: Image-To-Image Translation by Transformation Vector Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Yi Li,et al. Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29] Chang Liu,et al. 3D Tooth Segmentation and Labeling Using Deep Convolutional Neural Networks , 2019, IEEE Transactions on Visualization and Computer Graphics.

[30] Fumin Shen,et al. Make a Face: Towards Arbitrary High Fidelity Face Manipulation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[31] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[32] Kun Zhou,et al. Warp-guided GANs for single-photo facial animation , 2018, ACM Trans. Graph..

[33] Tieniu Tan,et al. Geometry Guided Adversarial Facial Expression Synthesis , 2017, ACM Multimedia.

[34] Steven M. Drucker,et al. Quality prediction for image completion , 2012, ACM Trans. Graph..

[35] Adam Finkelstein,et al. PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, SIGGRAPH 2009.

[36] Wei Xiong,et al. Foreground-Aware Image Inpainting , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Miao Yu,et al. Progressive Pose Attention Transfer for Person Image Generation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38] ScienceDirect. International journal of oral & maxillofacial surgery , 1986 .

[39] Guillermo Sapiro,et al. Filling-in by joint interpolation of vector fields and gray levels , 2001, IEEE Trans. Image Process..

[40] Lucas Theis,et al. Fast Face-Swap Using Convolutional Neural Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[41] Bo Zhao,et al. Modular Generative Adversarial Networks , 2018, ECCV.

[42] John D. Muir,et al. Orthodontics: Current principles and techniques , 1985 .

[43] Denis Simakov,et al. Summarizing visual data using bidirectional similarity , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[44] Patrick Pérez,et al. Deep video portraits , 2018, ACM Trans. Graph..

[45] Serge J. Belongie,et al. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[46] Victor S. Lempitsky,et al. DeepWarp: Photorealistic Image Resynthesis for Gaze Manipulation , 2016, ECCV.

[47] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[48] Xiaoyong Shen,et al. Attribute-Driven Spontaneous Motion in Unpaired Image Translation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[49] Niloy J. Mitra,et al. Coupled structure-from-motion and 3D symmetry detection for urban facades , 2014, ACM Trans. Graph..

[50] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[51] Tomaso A. Poggio,et al. Reanimating Faces in Images and Video , 2003, Comput. Graph. Forum.

[52] Ping Tan,et al. DualGAN: Unsupervised Dual Learning for Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[53] Kun Zhou,et al. AutoHair: fully automatic hair modeling from a single image , 2016, ACM Trans. Graph..

[54] Jianfei Cai,et al. Pluralistic Image Completion , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[55] Shigeo Morishima,et al. Photorealistic inner mouth expression in speech animation , 2013, SIGGRAPH '13.

[56] Thomas H. Li,et al. StructureFlow: Image Inpainting via Structure-Aware Appearance Flow , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[57] Justus Thies,et al. Real-time expression transfer for facial reenactment , 2015, ACM Trans. Graph..

[58] Francesc Moreno-Noguer,et al. GANimation: Anatomically-aware Facial Animation from a Single Image , 2018, ECCV.

[59] Zunlei Feng,et al. Neural Style Transfer: A Review , 2017, IEEE Transactions on Visualization and Computer Graphics.

[60] Li Xu,et al. Shepard Convolutional Neural Networks , 2015, NIPS.

[61] Daniel Cohen-Or,et al. Fragment-based image completion , 2003, ACM Trans. Graph..

[62] Chi-Keung Tang,et al. LADN: Local Adversarial Disentangling Network for Facial Makeup and De-Makeup , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[63] Alexei A. Efros,et al. Image quilting for texture synthesis and transfer , 2001, SIGGRAPH.

[64] Patrick Pérez,et al. VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track , 2015, Comput. Graph. Forum.

[65] Nipun Kwatra,et al. Texture optimization for example-based synthesis , 2005, ACM Trans. Graph..

[66] Qionghai Dai,et al. Graph Laplace for Occluded Face Completion and Recognition , 2011, IEEE Transactions on Image Processing.

[67] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[68] Jan Kautz,et al. Visio-lization: generating novel facial images , 2009, ACM Trans. Graph..

[69] Daniel Cohen-Or,et al. Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[70] Hui Chen,et al. Geometry-Contrastive Generative Adversarial Network for Facial Expression Synthesis , 2018, ArXiv.

[71] Adam Finkelstein,et al. The Generalized PatchMatch Correspondence Algorithm , 2010, ECCV.

[72] Robert J. Peterman,et al. Accuracy of Dolphin visual treatment objective (VTO) prediction software on class III patients treated with maxillary advancement and mandibular setback , 2016, Progress in orthodontics.

[73] David M. Sarver,et al. Contemporary Orthodontics Online , 2007 .

[74] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[75] Shiguang Shan,et al. AttGAN: Facial Attribute Editing by Only Changing What You Want , 2017, IEEE Transactions on Image Processing.

[76] Youyi Zheng,et al. Controlling Stroke Size in Fast Style Transfer with Recurrent Convolutional Neural Network , 2018, Comput. Graph. Forum.

[77] Enhua Wu,et al. Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78] P. Vig. Orthodontics, current principles and techniques , 1985 .

[79] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[80] Jan Kautz,et al. Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[81] Leif Kobbelt,et al. Interactive image completion with perspective correction , 2006, The Visual Computer.

[82] Thomas S. Huang,et al. Generative Image Inpainting with Contextual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[83] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[84] Simo-SerraEdgar,et al. Globally and locally consistent image completion , 2017 .

[85] Derek Bradley,et al. Model-based teeth reconstruction , 2016, ACM Trans. Graph..

[86] Zhiming Cui,et al. ToothNet: Automatic Tooth Instance Segmentation and Identification From Cone Beam CT Images , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[87] Adam Finkelstein,et al. PairedCycleGAN: Asymmetric Style Transfer for Applying and Removing Makeup , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[88] Yun Fu,et al. Face Behind Makeup , 2016, AAAI.

[89] Jaakko Lehtinen,et al. Analyzing and Improving the Image Quality of StyleGAN , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[90] Kun Zhou,et al. Displaced dynamic expression regression for real-time facial tracking and animation , 2014, ACM Trans. Graph..

[91] Alexander Lerchner,et al. Spatial Broadcast Decoder: A Simple Architecture for Learning Disentangled Representations in VAEs , 2019, ArXiv.

[92] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[93] Enhong Chen,et al. Image Denoising and Inpainting with Deep Neural Networks , 2012, NIPS.

[94] Markus H. Gross,et al. Gaze Correction for Home Video Conferencing , 2012 .

[95] Guillaume Lample,et al. Fader Networks: Manipulating Images by Sliding Attributes , 2017, NIPS.

[96] Nima Tajbakhsh,et al. Learning Fixed Points in Generative Adversarial Networks: From Image-to-Image Translation to Disease Detection and Localization , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[97] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[98] Andrew Zisserman,et al. Get Out of my Picture! Internet-based Inpainting , 2009, BMVC.

[99] Hao Li,et al. High-Resolution Image Inpainting Using Multi-scale Neural Patch Synthesis , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[100] Bernhard Schölkopf,et al. Mask-Specific Inpainting with Deep Neural Networks , 2014, GCPR.

[101] Harry Shum,et al. Image completion with structure propagation , 2005, ACM Trans. Graph..

[102] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[103] Narendra Ahuja,et al. Image completion using planar structure guidance , 2014, ACM Trans. Graph..

[104] Neil Smith,et al. Latent Filter Scaling for Multimodal Unsupervised Image-To-Image Translation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[105] Wangmeng Zuo,et al. Image Inpainting With Learnable Bidirectional Attention Maps , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[106] Alexei A. Efros,et al. Texture synthesis by non-parametric sampling , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[107] Guillermo Sapiro,et al. Image inpainting , 2000, SIGGRAPH.

[108] Ira Kemelmacher-Shlizerman,et al. Synthesizing Obama , 2017, ACM Trans. Graph..

[109] M B Asbell,et al. A brief history of orthodontics. , 1990, American journal of orthodontics and dentofacial orthopedics : official publication of the American Association of Orthodontists, its constituent societies, and the American Board of Orthodontics.

[110] Hiroshi Ishikawa,et al. Globally and locally consistent image completion , 2017, ACM Trans. Graph..

[111] Alexei A. Efros,et al. Toward Multimodal Image-to-Image Translation , 2017, NIPS.

[112] Jung-Woo Ha,et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[113] Chen Qian,et al. TransGaGa: Geometry-Aware Unsupervised Image-To-Image Translation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).