An H.264/AVC to HEVC video transcoder based on mode mapping

The emerging video coding standard, HEVC, was developed to replace the current standard, H.264/AVC. However, in order to promote inter-operability with existing systems using the H.264/AVC, transcoding from H.264/AVC to the HEVC codec is highly needed. This paper presents a transcoding solution that uses machine learning techniques in order to map H.264/AVC macroblocks into HEVC coding units (CUs). Two alternatives to build the machine learning model are evaluated. The first uses a static training, where the model is built offline and used to transcode any video sequence. The other uses a dynamic training, with two well-defined stages: a training stage and a transcoding stage. In the training stage, full re-encoding is performed while the H.264/AVC and the HEVC information are gathered. This information is then used to build a model, which is used in the transcoding stage to classify the HEVC CU partitioning. Both solutions are tested with well-known video sequences and evaluated in terms of rate-distortion (RD) and complexity. The proposed method is on average 2.26 times faster than the trivial transcoder using fast motion estimation, while yielding a RD loss of only 3.6% in terms of bitrate.

[1]  Yu Sun,et al.  Video transcoding: an overview of various techniques and research issues , 2005, IEEE Transactions on Multimedia.

[2]  Chen-Hsiu Huang Video Transcoding Architectures and Techniques : An Overview , 2003 .

[3]  Hari Kalva,et al.  Improved machine learning techniques for low complexity MPEG-2 to H.264 transcoding using optimized codecs , 2009, 2009 Digest of Technical Papers International Conference on Consumer Electronics.

[4]  Eduardo Peixoto,et al.  A complexity-scalable transcoder from H.264/AVC to the new HEVC codec , 2012, 2012 19th IEEE International Conference on Image Processing.

[5]  Xiaoyan Sun,et al.  Fast H.264/MPEG-4 AVC Transcoding Using Power-Spectrum Based Rate-Distortion Optimization , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Khaled Assaleh,et al.  Feature modeling using polynomial classifiers and stepwise regression , 2010, Neurocomputing.

[7]  Ming-Ting Sun,et al.  Digital Video Transcoding , 2005, Proceedings of the IEEE.

[8]  Eduardo Peixoto Fernandes da Silva Advanced heterogeneous video transcoding , 2012 .

[9]  José Luis Martínez,et al.  An MPEG-2 to H.264 Video Transcoder in the Baseline Profile , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Bin Li,et al.  Fast Transcoding from H.264 AVC to High Efficiency Video Coding , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[11]  K. R. Rao,et al.  High Efficiency Video Coding(HEVC) , 2014 .

[12]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[13]  Pedro Cuenca,et al.  Very low complexity MPEG-2 to H.264 transcoding using machine learning , 2006, MM '06.

[14]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.