A Scalable Massively Parallel Motion and Disparity Estimation Scheme for Multiview Video Coding

Multiview video coding (MVC) has recently received considerable attention. It is proposed as an extension of H.264/Advanced Video Coding (AVC) standard for multiple video source compression. To resolve the extremely high computational complexity of MVC (and in fact other AVC techniques), suitable parallel algorithms need to be developed that are amenable to implementation on low-cost massively parallel architecture, platforms that have found a common place due to recent advances in the parallel computer architecture. The high complexity of MVC is due to its prediction structure, where motion estimation (ME) between the frames and disparity estimation (DE) between the views contribute to more than 99% of overall complexity of the coder. This paper presents the development and implementation of a scalable massively parallel fast search algorithm to significantly reduce the computational cost of ME/DE over the current best available full block matching, and suboptimal fast search algorithms. The proposed massively parallel fast search algorithm (DZfast), when evaluated over eight views, outperforms the existing full search and fast search MVC algorithms by a factor of up to 245.8 and 8.4, respectively. This speedup comes at no or minute loss in rate-distortion performance.

[1]  Hsueh-Ming Hang,et al.  H.264/AVC motion estimation implmentation on Compute Unified Device Architecture (CUDA) , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[2]  Heiko Schwarz,et al.  Analysis of Hierarchical B Pictures and MCTF , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[3]  Kai-Kuang Ma,et al.  A new diamond search algorithm for fast block-matching motion estimation , 2000, IEEE Trans. Image Process..

[4]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[5]  B. Girod,et al.  Multiview Video Compression , 2007, IEEE Signal Processing Magazine.

[6]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[7]  Li Song,et al.  Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[8]  José Luis Martínez,et al.  Reducing complexity in H.264/AVC motion estimation by using a GPU , 2011, 2011 IEEE 13th International Workshop on Multimedia Signal Processing.

[9]  Mateo Valero,et al.  Scalability of Macroblock-level Parallelism for H.264 Decoding , 2009, 2009 15th International Conference on Parallel and Distributed Systems.

[10]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Sergio Bampi,et al.  Multi-level pipelined parallel hardware architecture for high throughput motion and disparity estimation in Multiview Video Coding , 2011, 2011 Design, Automation & Test in Europe.

[12]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[13]  Saeid Nooshabadi,et al.  GPU accelerated motion and disparity estimations for multiview coding , 2013, 2013 IEEE International Conference on Image Processing.

[14]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Alexis M. Tourapis,et al.  Enhanced predictive zonal search for single and multiple frame motion estimation , 2002, IS&T/SPIE Electronic Imaging.

[16]  Moshe Sniedovich Dynamic Programming: Foundations and Principles, Second Edition , 2010 .

[17]  Aljoscha Smolic,et al.  Efficient Compression of Multi-View Video Exploiting Inter-View Dependencies Based on H.264/MPEG4-AVC , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[18]  Marco Mattavelli,et al.  An Introduction to the Special Issue on Algorithm/Architecture Co-Exploration of Visual Computing on Emerging Platforms , 2009 .

[19]  Wei Feng,et al.  A Framework for Heuristic Scheduling for Parallel Processing on Multicore Architecture: A Case Study With Multiview Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Hsueh-Ming Hang,et al.  Multiview encoder parallelized fast search realization on NVIDIA CUDA , 2011, 2011 Visual Communications and Image Processing (VCIP).

[21]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Liang-Gee Chen,et al.  A 212 MPixels/s 4096 $\times$ 2160p Multiview Video Encoder Chip for 3D/Quad Full HDTV Applications , 2010, IEEE Journal of Solid-State Circuits.

[23]  Lap-Pui Chau,et al.  Hexagon-based search pattern for fast block motion estimation , 2002, IEEE Trans. Circuits Syst. Video Technol..

[24]  Gwo Giun Lee,et al.  Algorithm/Architecture Co-Exploration of Visual Computing on Emergent Platforms: Overview and Future Prospects , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Ahmed Hemani,et al.  Classification of Massively Parallel Computer Architectures , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[26]  Wesley De Neve,et al.  Parallel Deblocking Filtering in MPEG-4 AVC/H.264 on Massively Parallel Architectures , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Anantha Chandrakasan,et al.  Multicore Processing and Efficient On-Chip Caching for H.264 and Future Video Decoders , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  M. Sniedovich Dynamic programming : foundations and principles , 2011 .