论文信息 - Learning a Deep Reinforcement Learning Policy Over the Latent Space of a Pre-trained GAN for Semantic Age Manipulation

Learning a Deep Reinforcement Learning Policy Over the Latent Space of a Pre-trained GAN for Semantic Age Manipulation

Learning a disentangled representation of the latent space has become one of the most fundamental problems studied in computer vision. Recently, many Generative Adversarial Networks (GANs) have shown promising results in generating high fidelity images. However, studies to understand the semantic layout of the latent space of pre-trained models are still limited. Several works train conditional GANs to generate faces with required semantic attributes. Unfortunately, in these attempts, the generated output is often not as photo-realistic as the unconditional state-of-the-art models. Besides, they also require large computational resources and specific datasets to generate high fidelity images. In our work, we have formulated a Markov Decision Process (MDP) over the latent space of a pre-trained GAN model to learn a conditional policy for semantic manipulation along specific attributes under defined identity bounds. Further, we have defined a semantic age manipulation scheme using a locally linear approximation over the latent space. Results show that our learned policy samples high fidelity images with required age alterations, while preserving the identity of the person.

[1] Lars Kai Hansen,et al. Latent Space Oddity: on the Curvature of Deep Generative Models , 2017, ICLR.

[2] Phillip Isola,et al. On the "steerability" of generative adversarial networks , 2019, ICLR.

[3] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[4] Eric T. Nalisnick,et al. Detecting Out-of-Distribution Inputs to Deep Generative Models Using Typicality , 2019 .

[5] Bogdan Raducanu,et al. Invertible Conditional GANs for image editing , 2016, ArXiv.

[6] Martin A. Riedmiller,et al. Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.

[7] Roman Vershynin,et al. High-Dimensional Probability , 2018 .

[8] O. Papaspiliopoulos. High-Dimensional Probability: An Introduction with Applications in Data Science , 2020 .

[9] Bolei Zhou,et al. Disentangled Inference for GANs with Latently Invertible Autoencoder , 2019 .

[10] C. Perone,et al. Deep semi-supervised segmentation with weight-averaged consistency targets , 2018, DLMIA/ML-CDS@MICCAI.

[11] Trevor Darrell,et al. Adversarial Feature Learning , 2016, ICLR.

[12] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.

[14] Buket D. Barkana,et al. Deep Convolutional Neural Network for Age Estimation based on VGG-Face Model , 2017, ArXiv.

[15] Heng Tao Shen,et al. Dual Conditional GANs for Face Aging and Rejuvenation , 2018, IJCAI.

[16] Luc Van Gool,et al. Deep Expectation of Real and Apparent Age from a Single Image Without Facial Landmarks , 2016, International Journal of Computer Vision.

[17] Iftekharul Mobin,et al. A Comparative Study on Variational Autoencoders and Generative Adversarial Networks , 2019, 2019 International Conference of Artificial Intelligence and Information Technology (ICAIIT).

[18] Bolei Zhou,et al. Image Processing Using Multi-Code GAN Prior , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Djemel Ziou,et al. Image Quality Metrics: PSNR vs. SSIM , 2010, 2010 20th International Conference on Pattern Recognition.

[20] Jaakko Lehtinen,et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[21] Pascal Frossard,et al. Tangent-based manifold approximation with locally linear models , 2012, Signal Process..

[22] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[23] Xu Tang,et al. Face Aging with Identity-Preserved Conditional Generative Adversarial Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24] J. Bartko. The Intraclass Correlation Coefficient as a Measure of Reliability , 1966, Psychological reports.

[25] Yiying Tong,et al. Age-Invariant Face Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[27] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[28] P. Thomas Fletcher,et al. The Riemannian Geometry of Deep Generative Models , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[29] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..

[30] Michael Betancourt,et al. A Conceptual Introduction to Hamiltonian Monte Carlo , 2017, 1701.02434.

[31] Sertac Karaman,et al. Invertibility of Convolutional Generative Networks from Partial Measurements , 2018, NeurIPS.

[32] Marcin Andrychowicz,et al. Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research , 2018, ArXiv.

[33] Prafulla Dhariwal,et al. Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.

[34] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.

[35] Daniel Cohen-Or,et al. Face identity disentanglement via latent space mapping , 2020, ACM Trans. Graph..

[36] Bolei Zhou,et al. Interpreting the Latent Space of GANs for Semantic Face Editing , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Timo Aila,et al. A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Martin A. Riedmiller,et al. Autonomous reinforcement learning on raw visual input data in a real world application , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[39] Shiguang Shan,et al. AttGAN: Facial Attribute Editing by Only Changing What You Want , 2017, IEEE Transactions on Image Processing.

[40] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[41] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[42] Thomas Hofmann,et al. Semantic Interpolation in Implicit Models , 2018, ICLR.

[43] Janghoon Yang,et al. Semi-Supervised FaceGAN for Face-Age Progression and Regression with Synthesized Paired Images , 2020 .

[44] Christopher Burgess,et al. DARLA: Improving Zero-Shot Transfer in Reinforcement Learning , 2017, ICML.

[45] Jules-Raymond Tapamo,et al. Age estimation via face images: a survey , 2018, EURASIP Journal on Image and Video Processing.

[46] Stefan Schaal,et al. Learning to Control in Operational Space , 2008, Int. J. Robotics Res..

[47] Tien D. Bui,et al. Automatic Face Aging in Videos via Deep Reinforcement Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Gene Cheung,et al. SiGAN: Siamese Generative Adversarial Network for Identity-Preserving Face Hallucination , 2018, IEEE Transactions on Image Processing.

[49] Harri Valpola,et al. Weight-averaged consistency targets improve semi-supervised deep learning results , 2017, ArXiv.

[50] S. Shankar Sastry,et al. Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.

[51] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[52] Xueyan Jiang,et al. Metrics for Deep Generative Models , 2017, AISTATS.

[53] Xi Chen,et al. Evolution Strategies as a Scalable Alternative to Reinforcement Learning , 2017, ArXiv.

[54] Yun Fu,et al. Age Synthesis and Estimation via Faces: A Survey , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55] Alexei A. Efros,et al. Generative Visual Manipulation on the Natural Image Manifold , 2016, ECCV.

[56] Pushmeet Kohli,et al. Learning to Understand Goal Specifications by Modelling Reward , 2018, ICLR.

[57] Sergey Levine,et al. SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning , 2018, ICML.

[58] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .

[59] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[60] W. Bajwa. GEOMETRIC MANIFOLD APPROXIMATION USING LOCALLY LINEAR APPROXIMATIONS , 2016 .

[61] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.

[62] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[63] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[64] Pieter Abbeel,et al. Learning Plannable Representations with Causal InfoGAN , 2018, NeurIPS.