Fast H.264 Encoding Based on Statistical Learning

H.264/AVC, the latest video coding standard of the Joint Video Team, greatly outperforms previous standards in terms of coding bitrate and video quality, because it adopts several new techniques. However, the computational complexity is also considerably increased due to these new components. In this paper, we propose fast algorithms based on statistical learning to reduce the computational cost involved in three main components in H.264 encoder, i.e., intermode decision, multi-reference motion estimation (ME), and intra-mode prediction. First, representative features are extracted to build the learning models. Then, an offline pre-classification approach is used to determine the best results from the extracted features, thus a significant amount of computation is reduced based on the classification strategy. The proposed statistical learning-based approach is applied to the aforementioned three main components in H.264 encoder to speed up the computation. Experimental results show that the ME time of the proposed system is significantly sped up with 12 times faster than the conventional fast ME algorithm of H.264, and the total encoding time of the proposed encoder is greatly reduced with about four times faster than the fast encoder EPZS in the H.264 reference code with negligible video quality degradation.

[1]  Jhing-Fa Wang,et al.  A novel fast algorithm for intra mode decision in H.264/AVC encoders , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[2]  Liang-Gee Chen,et al.  Analysis and reduction of reference frames for motion estimation in MPEG-4 AVC/JVT/H.264 , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[3]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[4]  Truong Q. Nguyen,et al.  Statistical learning based intra prediction in H.264 , 2008, 2008 15th IEEE International Conference on Image Processing.

[5]  Yuzhuo Zhong,et al.  Fast mode decision for P-Slices in H.264/AVC based on probabilistic learning , 2009, 2009 11th International Conference on Advanced Communication Technology.

[6]  Tien-Ying Kuo,et al.  Fast Variable Block Size Motion Estimation for H.264 Using Likelihood and Correlation of Motion Field , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Baochun Hou,et al.  A novel fast inter mode decision algorithm based on statistic and adaptive adjustment for H.264/AVC , 2007, 2007 15th International Conference on Software, Telecommunications and Computer Networks.

[8]  Shuyuan Yang,et al.  Fast mode selection scheme for H.264/AVC inter prediction based on statistical learning method , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[9]  Shang-Hong Lai,et al.  Fast Intermode Decision Via Statistical Learning for H.264 Video Coding , 2008, MMM.

[10]  Tien-Ying Kuo,et al.  Efficient Reference Frame Selector for H.264 , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Yongdong Zhang,et al.  Fast 4*4 intra-prediction mode selection for H.264 , 2004, ICME.

[12]  Gary Sullivan,et al.  Recommended Simulation Common Conditions for H.26L Coding Efficiency Experiments on Low Resolution Progressive Scan Source Material , 2001 .

[13]  Xiaoyang Wu,et al.  A Content-adaptive Fast Multiple Reference Frames Motion Estimation in H.264 , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[14]  Susanto Rahardja,et al.  Fast intermode decision in H.264/AVC video coding , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Zhang Yong-dong,et al.  Fast 4/spl times/4 intra-prediction mode selection for H.264 , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[16]  Shang-Hong Lai,et al.  Fast multi-reference motion estimation via statistical learning for H.264/AVC , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[17]  Jill M. Boyce,et al.  Fast mode decision and motion estimation for JVT/H.264 , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[18]  Shang-Hong Lai,et al.  Efficient Intra Mode Decision Via Statistical Learning , 2007, PCM.

[19]  Tian-Sheuan Chang,et al.  Fast three step intra prediction algorithm for 4/spl times/4 blocks in H.264 , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[20]  Peng Wu,et al.  An adaptive fast multiple reference frames selection algorithm for H.264/AVC , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[21]  Susanto Rahardja,et al.  Fast mode decision algorithm for intraprediction in H.264/AVC video coding , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Ming-Ting Sun,et al.  Fast multiple reference frame motion estimation for H.264/AVC , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Dong-Gyu Sim,et al.  Context-adaptive mode selection for intra-block coding in H.264/MPEG-4 Part 10 , 2005, Real Time Imaging.

[24]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[25]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[26]  Ahmed Tamtaoui,et al.  Fast Mode Decision Algorithm for Intra prediction in H.264/AVC Video Coding , 2007 .

[27]  Alexis M. Tourapis,et al.  Enhanced predictive zonal search for single and multiple frame motion estimation , 2002, IS&T/SPIE Electronic Imaging.

[28]  K. R. Ramakrishnan,et al.  Speedup macroblock mode decision in H.264/SVC encoding using cost-sensitive learning , 2010, 2010 Digest of Technical Papers International Conference on Consumer Electronics (ICCE).