论文信息 - Cost-Optimized Video Transfer using Real-Time Super Resolution Convolutional Neural Networks

Cost-Optimized Video Transfer using Real-Time Super Resolution Convolutional Neural Networks

The explosion of video generation and consumption, coupled with an inadequate rise in network bandwidth has led to network delays and decreased Quality of Experience, limiting the opportunities to tap into the full potential of video data. These deficiencies in network resources with a shift to cloud computing models have resulted in the need to revisit the overall mechanism for video transfer and storage of videos between edge devices and the cloud. We propose a novel multi-scale real-time super-resolution convolutional neural network to achieve the composite task of optimizing the entire cost of video transfer with minimal loss of quality that can be used for any application involving the transfer of video data. To achieve this, we develop a cost-optimized video transfer system that optimizes the metrics of video transfer to give the best quality video output, given the user budget. The model makes use of Convolution blocks for extracting features and output creation with multiple sub-pixel convolutions in a novel structure. For upscaling to full High Definition video at 30 fps, the model successfully retained the frame rate while the system achieved savings in transfer time and bandwidth usage. This model has been trained on surveillance videos (VIRAT dataset), but consistent results were obtained during testing even on feature films and sports videos which demonstrates its content invariance. The evaluation of our approach averaged over 376 videos, yielded meager quality losses of 8%, measured by a novel non-referential quality metric, also proposed in this paper. Additionally, average network bandwidth savings of 80% and average video transfer time reduction of 52% were achieved.

[1] Aline Roumy,et al. Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding , 2012, BMVC.

[2] Christophe Charrier,et al. DCT statistics model-based blind image quality assessment , 2011, 2011 18th IEEE International Conference on Image Processing.

[3] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Kyoung Mu Lee,et al. Deeply-Recursive Convolutional Network for Image Super-Resolution , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Ping An,et al. Video Super-Resolution Based on Generative Adversarial Network and Edge Enhancement , 2021, Electronics.

[6] Jinwoo Shin,et al. Neural Adaptive Content-aware Internet Video Delivery , 2018, OSDI.

[7] Hans-Ulrich Prokosch,et al. A scoping review of cloud computing in healthcare , 2015, BMC Medical Informatics and Decision Making.

[8] Besmir Sejdiu,et al. Pricing Schemes in Cloud Computing: An Overview , 2016 .

[9] Aggelos K. Katsaggelos,et al. Video Super-Resolution With Convolutional Neural Networks , 2016, IEEE Transactions on Computational Imaging.

[10] Dongsu Han,et al. How will Deep Learning Change Internet Video Delivery? , 2017, HotNets.

[11] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[12] Kunihiko Fukushima,et al. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[13] Yuanying Chen,et al. Rapid evolution of piRNA clusters in the Drosophila melanogaster ovary , 2023, bioRxiv.

[14] Christophe Charrier,et al. Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[15] Xiaoou Tang,et al. Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[17] Nuno Roma,et al. Efficient Hybrid DCT-Domain Algorithm for Video Spatial Downscaling , 2007, EURASIP J. Adv. Signal Process..

[18] Yung-Yu Chuang,et al. Deep Video Frame Interpolation Using Cyclic Frame Generation , 2019, AAAI.

[19] Bernhard Schölkopf,et al. Spatio-Temporal Transformer Network for Video Restoration , 2018, ECCV.

[20] Alan C. Bovik,et al. No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[21] Yi Li,et al. Convolutional Neural Networks for No-Reference Image Quality Assessment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[22] Aggelos K. Katsaggelos,et al. Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[23] Jitendra Malik,et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[24] Michael Goesele,et al. Rapid, detail-preserving image downscaling , 2016, ACM Trans. Graph..

[25] Chia-Hung Yeh,et al. Parabolic Motion-Vector Re-estimation Algorithm for Compressed Video Downscaling , 2010, J. Signal Process. Syst..

[26] Chih-Yuan Yang,et al. Single-Image Super-Resolution: A Benchmark , 2014, ECCV.

[27] Weisi Lin,et al. Perceptual visual quality metrics: A survey , 2011, J. Vis. Commun. Image Represent..

[28] Michael Elad,et al. Fast and robust multiframe super resolution , 2004, IEEE Transactions on Image Processing.

[29] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[30] Michael Elad,et al. On Single Image Scale-Up Using Sparse-Representations , 2010, Curves and Surfaces.

[31] Raman Paranjape,et al. Fundamental enhancement techniques , 2000 .

[32] Jie Qiu,et al. The Method and Tool of Cost Analysis for Cloud Computing , 2009, 2009 IEEE International Conference on Cloud Computing.

[33] Larry S. Davis,et al. AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video , 2011, AVSS.

[34] Daniel Rueckert,et al. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).