High-Resolution Gaze-Corrected Image Generation based on Combined Conditional GAN and Residual Dense Network

In recent years, the ease of taking selfie photos has improved due to widespread use of smartphones and social networking services (SNSs). In a typical smartphone, the camera layout and display are different, rendering the gaze often not front-facing. Although various gaze-correction methods have been proposed for this problem, many of them cannot generate sufficiently natural images. Furthermore, no improvement can typically be seen in the unnatural images generated with these methods. In this paper, we propose a gaze-correction method using a deep generative model. This model can determine the naturalness of the resulting images and learn to provide natural results. In addition, recent smartphones have the ability to take large-sized photos. However, it is difficult for this model to generate large-sized images. In this paper, we use super-resolution techniques to generate images that have a larger size and higher resolution than those generated by conventional methods. In our experiments, to reduce the size of our neural network, we input a small-sized image and convert it back to its original size after performing gaze-correction.

[1]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Lior Wolf,et al.  An eye for an eye: A single camera gaze-replacement method , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Andrew Blake,et al.  Gaze manipulation for one-to-one teleconferencing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[4]  Ruigang Yang,et al.  Eye contact in video conference via fusion of time-of-flight depth sensor and stereo , 2011 .

[5]  Jean-Charles Bazin,et al.  Gaze correction with a single webcam , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[6]  Peter Robinson,et al.  GazeDirector: Fully Articulated Eye Gaze Redirection in Video , 2017, Comput. Graph. Forum.

[7]  Victor S. Lempitsky,et al.  Learning to look up: Realtime monocular gaze correction using machine learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Andrew Jones,et al.  Achieving eye contact in a one-to-many 3D video teleconferencing system , 2009, ACM Trans. Graph..

[9]  Yun Fu,et al.  Residual Dense Network for Image Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[10]  Markus H. Gross,et al.  Gaze Correction for Home Video Conferencing , 2012 .

[11]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[12]  Ruigang Yang,et al.  Eye gaze correction with stereovision for video-teleconferencing , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.