论文信息 - Reconstruction Algorithm for Lost Frame of Multiview Videos in Wireless Multimedia Sensor Network Based on Deep Learning Multilayer Perceptron Regression

Reconstruction Algorithm for Lost Frame of Multiview Videos in Wireless Multimedia Sensor Network Based on Deep Learning Multilayer Perceptron Regression

Wireless multimedia sensor network (WMSN) is important for environmental monitoring. When the sensors are used as cameras, the network can be regarded as a multiview video system. The Packet loss may occur when the multiview videos are transmitted wirelessly. When the video frames are lost during transmission, a frame reconstruction method is needed in the decoder to estimate the missing pixels. In the proposed work, a reconstruction algorithm for lost frame of multiview videos in the WMSN based on deep learning methods is presented. A novel pixel estimation algorithm is developed using multilayer perceptron regression (MPR) with the deep learning method. Furthermore, a modified inpainting method is proposed with the use of the information from the optical flow algorithm with the neighboring available frames. Compared with the state-of-the-art method, the proposed MPR method with the traditional inpainting method increased the average peak signal-to-noise ratio up to 5.62 dB. The combination of the proposed MPR method with the proposed inpainting method outperformed previous proposed combination up to 8.32 dB on average, showing the significance of the proposed inpainting method.

[1] Yuan Zhou,et al. Frame Loss Concealment for Multiview Video Transmission Over Wireless Multimedia Sensor Networks , 2015, IEEE Sensors Journal.

[2] Bo Yan. A Novel H.264 Based Motion Vector Recovery Method for 3D Video Transmission , 2007, IEEE Transactions on Consumer Electronics.

[3] Gabriel Ibarra-Berastegi,et al. Artificial Neural Networks vs Linear Regression in a Fluid Mechanics and Chemical Modelling Problem: Elimination of Hydrogen Sulphide in a Lab-Scale Biofilter , 2007, 2007 IEEE/ACS International Conference on Computer Systems and Applications.

[4] Bo Yan,et al. Efficient Frame Concealment for Depth Image-Based 3-D Video Transmission , 2012, IEEE Transactions on Multimedia.

[5] Aljoscha Smolic,et al. View Synthesis for Advanced 3D Video Systems , 2008, EURASIP J. Image Video Process..

[6] Ting-Lan Lin,et al. Improved interview video error concealment on whole frame packet loss , 2014, J. Vis. Commun. Image Represent..

[7] Chang-Su Kim,et al. Frame loss concealment for stereoscopic video plus depth sequences , 2011, IEEE Transactions on Consumer Electronics.

[8] Lei Zhang,et al. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[9] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .

[10] Ting-Lan Lin,et al. Hole filling using multiple frames and iterative texture synthesis with illumination compensation , 2014, Multimedia Tools and Applications.

[11] Ce Liu,et al. Exploring new representations and applications for motion analysis , 2009 .

[12] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[13] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14] Jin Wang,et al. Depth Image-Based Temporal Error Concealment for 3-D Video Transmission , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[15] Huijun Gao,et al. Sparsity-Based Image Error Concealment via Adaptive Dual Dictionary Learning and Regularization , 2017, IEEE Transactions on Image Processing.

[16] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[17] Quansen Sun,et al. Single Image Super-Resolution Based on Deep Learning Features and Dictionary Model , 2017, IEEE Access.

[18] Geoffrey E. Hinton,et al. Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[19] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[20] David J. Kriegman,et al. Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[21] P. Werbos,et al. Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[22] Patrick Pérez,et al. Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[23] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[24] Andrea Vedaldi,et al. Understanding deep image representations by inverting them , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Jun-Horng Chen,et al. A novel error concealment approach based on general regression neural network , 2011, 2011 International Conference on Consumer Electronics, Communications and Networks (CECNet).