A Feature-based Video Transmission Framework for Visual IoT in Fog Computing Systems

The rapid development of the internet of things (IoT)promotes research in smart city and Fog computing. The vast volume of real-time visual data produced from the tremendous end devices in IoT is a big challenge for the network to transmit and for the data center to store. The typical case is the huge volume of visual data produced by the surveillance cameras in a smart city. In this paper, we consider the problem of how to allocate the calculation ability of the Fog node to handle the surveillance data to obtain low delay meanwhile maintain the video quality. To solve this challenge, we attempt to reduce the tremendous video data using deep learning models in the computational Fog node and optimize the transmission function for high efficiency. To reduce data, we extract video feature and keep salient zones with high resolution meanwhile leave the unavoidable distortion in less important areas. To obtain the least transmission delay under the dynamic bandwidth in Fog computing, we model the transmission delay function and solve it by Lagrangian dual decomposition. We make experiments on public dataset Cityscapes and 4G/LTE Bandwidth Log to evaluate our method. The experiment results show that our feature-based image processing method obtains around 68.7% higher average SSIM (structural similarity index)than the traditional HEVC in the salient zones, and our solution reduces the system delay by 71.02 % comparing with the plain transmission method. It proves our solution reduces the video transmission latency meanwhile keeps the SSIM of salient areas in the video.

[1]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[2]  Jiebo Luo,et al.  VSCC'2017: Visual Analysis for Smart and Connected Communities , 2017, ACM Multimedia.

[3]  Wuzhen Shi,et al.  An End-to-End Compression Framework Based on Convolutional Neural Networks , 2017, 2017 Data Compression Conference (DCC).

[4]  Filip De Turck,et al.  HTTP/2-Based Adaptive Streaming of HEVC Video Over 4G/LTE Networks , 2016, IEEE Communications Letters.

[5]  Qi Tian,et al.  Image Retargeting for Preserving Robust Local Feature: Application to Mobile Visual Search , 2016, IEEE Transactions on Multimedia.

[6]  Alireza Nasiri Avanaki,et al.  Exact global histogram specification optimized for structural similarity , 2008, 0901.0065.

[7]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, CVPR 2004.

[8]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Erik Blasch,et al.  Enabling Smart Urban Surveillance at The Edge , 2017, 2017 IEEE International Conference on Smart Cloud (SmartCloud).

[10]  Raja Lavanya,et al.  Fog Computing and Its Role in the Internet of Things , 2019, Advances in Computer and Electrical Engineering.

[11]  Erik Blasch,et al.  Degree of nonlinearity (DoN) measure for target tracking in videos , 2016, 2016 19th International Conference on Information Fusion (FUSION).

[12]  Xiaofeng Tao,et al.  Mobile Edge Computing Enhanced Adaptive Bitrate Video Delivery With Joint Cache and Radio Resource Allocation , 2017, IEEE Access.

[13]  Ralph R. Martin,et al.  A Shape‐Preserving Approach to Image Resizing , 2009, Comput. Graph. Forum.

[14]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .