An Analytical Study of CNN-based Video Frame Interpolation Techniques

Videos are made up of a series of continuous image frames. Given these consecutive two frames, video frame interpolation techniques aim for synthesizing video frame or frames that lie temporally in between the given frames, and that is/are spatially adjusted. Video frame interpolation finds various use cases in computer vision activities including but not limited to video restoration i.e., to generate clear video frames in sections where a video is blurred, generating video animations (software editing tools), view synthesis and so on. Relying upon standard optical flow-based video frame interpolation methods is difficult as they often produce blurry results in the cases of occlusion and extensive motion handling between the objects in the frames. Remarkable achievements have been made in the recent past to capture varied large scale motion using deep convolution networks. In this paper, we will discuss how some deep convolution networks based methods have evolved over the years to improve the quality of the synthesized frames both qualitatively and quantitatively for video frame interpolation task.

[1]  Alain Trémeau,et al.  Residual Conv-Deconv Grid Network for Semantic Segmentation , 2017, BMVC.

[2]  Xiaoou Tang,et al.  Video Frame Synthesis Using Deep Voxel Flow , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[3]  Yann LeCun,et al.  Deep multi-scale video prediction beyond mean square error , 2015, ICLR.

[4]  Richard Szeliski,et al.  A Database and Evaluation Methodology for Optical Flow , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[5]  Xiaoyun Zhang,et al.  Depth-Aware Video Frame Interpolation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Jan Kautz,et al.  Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Pavel Zemcík,et al.  Compression Artifacts Removal Using Convolutional Neural Networks , 2016, J. WSCG.

[8]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Zhiyong Gao,et al.  MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Hongdong Li,et al.  Learning Image Matching by Simply Watching Video , 2016, ECCV.

[11]  Cordelia Schmid,et al.  EpicFlow: Edge-preserving interpolation of correspondences for optical flow , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[13]  Jiajun Wu,et al.  Video Enhancement with Task-Oriented Flow , 2018, International Journal of Computer Vision.

[14]  Yanli Li,et al.  A Spatial Prediction-Based Motion-Compensated Frame Rate Up-Conversion , 2019, Future Internet.

[15]  Jan Kautz,et al.  PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Feng Liu,et al.  Video Frame Interpolation via Adaptive Convolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[18]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19]  Yung-Yu Chuang,et al.  Deep Video Frame Interpolation Using Cyclic Frame Generation , 2019, AAAI.

[20]  Yasuyuki Matsushita,et al.  Motion detail preserving optical flow estimation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Vijayakumar T Dr,et al.  COMPARATIVE STUDY OF CAPSULE NEURAL NETWORK IN VARIOUS APPLICATIONS , 2019, Journal of Artificial Intelligence and Capsule Networks.

[23]  Markus H. Gross,et al.  PhaseNet for Video Frame Interpolation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Thomas Brox,et al.  FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Xiangzhong Fang,et al.  Motion-Compensated Frame Interpolation With Multiframe-Based Occlusion Handling , 2016, Journal of Display Technology.

[26]  Taejeong Kim,et al.  MAP-Based Motion Refinement Algorithm for Block-Based Motion-Compensated Frame Interpolation , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Qionghai Dai,et al.  A Polynomial Approximation Motion Estimation Model for Motion-Compensated Frame Interpolation , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Feng Liu,et al.  Context-Aware Synthesis for Video Frame Interpolation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29]  Mubarak Shah,et al.  UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[30]  Alexei A. Efros,et al.  Light field video capture using a learning-based hybrid imaging system , 2017, ACM Trans. Graph..

[31]  Weifeng Chen,et al.  Single-Image Depth Perception in the Wild , 2016, NIPS.

[32]  Thomas Brox,et al.  FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[33]  Feng Liu,et al.  Video Frame Interpolation via Adaptive Separable Convolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[34]  Michael J. Black,et al.  Optical Flow Estimation Using a Spatial Pyramid Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Tomer Peleg,et al.  IM-Net for High Resolution Video Frame Interpolation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Z. Gong,et al.  Video Frame Interpolation and Extrapolation , 2017 .