A Constrained Spatial-Temporal Frame Rate Control Model of Conversational Video for Multipath Transport System

Conversational video service is becoming increasingly popular on the Internet. However, higher video resolution and frame rate are often accompanied by enlarging the bit rate of the video coding output, and further lead to a large bandwidth requirement. In this paper, we propose a constrained spatial-temporal frame rate control model for conversational video service, which combines video coding with the feedback of data transmission in multipath transmission to enhance rate control. The goal of the model is to optimize the constraint relationship among frame rate, resolution and bitrate, and achieve a better QoE of conversational video service. The target bitrate and frame rate are determined on the fly by considering the network condition. Moreover, the model introduces a new concept termed as region rate, which allows different regions (region of interests and non-region of interests) of a frame to have different rates in accordance with their importance. Furthermore, the paper adopts the perceptual video compression method to improve the perceptual quality, which allocates more resources to the ROIs and selects paths with appropriate status to transmit ROIs data. The experimental results show that the proposed method could control the bit rate while ensuring the video quality and realize the joint optimization of frame rate, resolution and bit rate in comparison to existing approaches.

[1]  Shao-Yi Chien,et al.  Region-Based perceptual quality regulable bit allocation and rate control for video coding applications , 2012, 2012 Visual Communications and Image Processing.

[2]  Touradj Ebrahimi,et al.  Perceptual Video Compression: A Survey , 2012, IEEE Journal of Selected Topics in Signal Processing.

[3]  Simon Lucey,et al.  Face alignment through subspace constrained mean-shifts , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[4]  Zhengguo Li,et al.  Region-of-Interest Based Resource Allocation for Conversational Video Communication of H.264/AVC , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Touradj Ebrahimi,et al.  Semantic video analysis for adaptive content delivery and automatic description , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Zhigang Jin,et al.  A Adaptive Frame Rate QoS Control in Wireless Video Surveillance System , 2009, 2009 5th International Conference on Wireless Communications, Networking and Mobile Computing.

[7]  Antti Ylä-Jääski,et al.  Multipath Transmission for the Internet: A Survey , 2016, IEEE Communications Surveys & Tutorials.

[8]  Weimin Lei,et al.  A general framework of multipath transport system based on application-level relay , 2014, Comput. Commun..

[9]  Shaul Hochstein,et al.  At first sight: A high-level pop out effect for faces , 2005, Vision Research.

[10]  Danda B. Rawat,et al.  Software Defined Networking Architecture, Security and Energy Efficiency: A Survey , 2017, IEEE Communications Surveys & Tutorials.

[11]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Sungjoo Yoo,et al.  Dual Motion Estimation for Frame Rate Up-Conversion , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Jian Sun,et al.  Face Alignment at 3000 FPS via Regressing Local Binary Features , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Shengxi Li,et al.  Region-of-Interest Based Conversational HEVC Coding with Hierarchical Perception Model of Face , 2014, IEEE Journal of Selected Topics in Signal Processing.

[15]  Weisi Lin,et al.  Rate control for videophone using local perceptual cues , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Bharadwaj S. Amrutur,et al.  Skip Decision and Reference Frame Selection for Low-Complexity H.264/AVC Surveillance Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  S. Mohamad R. Soroushmehr,et al.  Frame rate up-conversion using nonparametric estimator , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[18]  Chang-Su Kim,et al.  Efficient stereo video coding based on frame skipping for real-time mobile applications , 2008, IEEE Transactions on Consumer Electronics.

[19]  Shaowei Liu,et al.  A Framework of Multipath Transport System Based on Application-Level Relay (MPTS-AR) , 2018 .

[20]  Haoshan Shi,et al.  Dynamic Frame-Skipping Scheme for Live Video Encoders , 2010, 2010 International Conference on Multimedia Technology.

[21]  Xiuchang Zhu,et al.  SSIM-based content adaptive frame skipping for low bit rate H.264 video coding , 2010, 2010 IEEE 12th International Conference on Communication Technology.