Using machine learning for fast intra MB coding in H.264

H.264 is a highly efficient and complex video codec. The complexity of the codec makes it difficult to use all its features in resource constrained mobile devices. This paper presents a machine learning approach to reducing the complexity of Intra encoding in H.264. Determining the macro block coding mode requires substantial computational resources in H.264 video encoding. The goal of this work to reduce MB mode computation from a search operation, as is done in the encoders today, to a computation. We have developed a methodology based on machine learning that computes the MB coding mode instead of searching for the best match thus reducing the complexity of Intra 16x16 coding by 17 times and Intra 4x4 MB coding by 12.5 times. The proposed approach uses simple mean value metrics at the block level to characterize the coding complexity of a macro block. A generic J4.8 classifier is used to build the decision trees to quickly determine the mode. We present a methodology for Intra MB coding. The results show that intra MB mode can be determined with over 90% accuracy. The proposed can also be used for determining MB prediction modes with an accuracy varying between 70% and 80%.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  Oscar C. Au,et al.  Efficient Intra-Prediction Mode Selection for 4x4 Blocks in H.264 , 2003 .

[3]  Michèle Sebag,et al.  C4.5 competence map: a phase transition-inspired approach , 2004, ICML '04.

[4]  C.-C. Jay Kuo,et al.  Feature-based intra-prediction mode decision for H.264 , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[5]  Fast Intra-Frame Mode Selection for H . 264 , 2004 .

[6]  Susanto Rahardja,et al.  Fast intra mode decision algorithm for H.264-AVC video coding , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[7]  Pedro Cuenca,et al.  Speeding-Up the Macroblock Partition Mode Decision in MPEG-2/H.264 Transcoding , 2006, 2006 International Conference on Image Processing.

[8]  Pedro Cuenca,et al.  Very low complexity MPEG-2 to H.264 transcoding using machine learning , 2006, MM '06.