Inter-frame video image generation based on spatial continuity generative adversarial networks

AbstractThis paper proposes a method for generating inter-frame video images based on spatial continuity generative adversarial networks (SC-GANs) to smooth the playing of low-frame rate videos and to clarify blurry image edges caused by the use of traditional methods to improve the video frame rate. Firstly, the auto-encoder is used as a discriminator and Wasserstein distance is applied to represent the difference between the loss distribution of the real sample and the generated sample, instead of the typical method of generative adversarial networks to directly match data distribution. Secondly, the hyperparameter between generator and discriminator is used to stabilize the training process, which effectively prevents the model from collapsing. Finally, taking advantage of the spatial continuity of the image features of continuous video frames, an optimal value between two consecutive frames is found by Adam and then mapped to the image space to generate inter-frame images. In order to illustrate the authenticity of the generated inter-frame images, PSNR and SSIM are adopted to evaluate the inter-frame images, and the results show that the generated inter-frame images have a high degree of authenticity. The feasibility and validity of the proposed method based on SC-GAN are also verified.

[1]  Yücel Altunbasak,et al.  Novel True-Motion Estimation Algorithm and Its Application to Motion-Compensated Temporal Frame Interpolation , 2013, IEEE Transactions on Image Processing.

[2]  Sangkeun Lee,et al.  An Optimal Low Dynamic Range Image Generation Method Using a Neural Network , 2018, IEEE Transactions on Consumer Electronics.

[3]  Alexander Tanchenko,et al.  Visual-PSNR measure of image quality , 2014, J. Vis. Commun. Image Represent..

[4]  Yann LeCun,et al.  Deep multi-scale video prediction beyond mean square error , 2015, ICLR.

[5]  Navin Rajpal,et al.  A matching criterion for motion compensation in the temporal coding of video signal , 2011, Signal Image Video Process..

[6]  Victor H. S. Ha,et al.  Portable receivers for digital multimedia broadcasting , 2004, IEEE Transactions on Consumer Electronics.

[7]  Jaeseok Kim,et al.  Motion compensated frame interpolation by new block-based motion estimation algorithm , 2004, IEEE Trans. Consumer Electron..

[8]  V. S. K. Reddy,et al.  Multilayer reference frame motion estimation for video coding , 2015, Signal Image Video Process..

[9]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[10]  A. Enis Çetin,et al.  Content-based video copy detection based on motion vectors estimated using a lower frame rate , 2014, Signal Image Video Process..

[11]  Fei Zhou,et al.  MvSSIM: A quality assessment index for hyperspectral images , 2018, Neurocomputing.

[12]  Chul Lee,et al.  Motion-Compensated Frame Interpolation Based on Multihypothesis Motion Estimation and Texture Optimization , 2013, IEEE Transactions on Image Processing.

[13]  Cheng Xueqi,et al.  Survey on Big Data System and Analytic Technology , 2014 .