Rotational motion model for temporal prediction in 360 video coding

The recent boom in the field of virtual and augmented reality has dramatically increased the prevalence of spherical video. Given the enormous amount of data consumed by spherical video, it is critical to achieve efficient compression for storage and transmission. Prevalent approaches simply project (via different geometries) the spherical video onto planes for processing with traditional 2D video coding standards. However, such approaches are significantly sub-optimal as standard video coders only allow for block translations in the critical tool of motion compensated prediction, which is incompatible with the expected motion in projected spherical video. Specifically, the effective sampling density varies over the sphere and the resulting locally varying warping yields complex non-linear motion in the projected domain. Hence, translation in the projected domain does not preserve an object's shape and size on the sphere, and its corresponding motion vector does not have a useful physical interpretation. Instead, we propose to characterize the motion directly on the sphere with a rotational motion model, specifically, in terms of sphere rotations along geodesics. This model preserves object shape and size on the sphere. A motion vector in this model implicitly specifies an axis of rotation and the degree of rotation about that axis, to convey the actual motion of objects on the sphere. Complementary to the novel motion model, we further propose an effective motion search technique that is tailored to the sphere's geometry. Experimental results demonstrate that the proposed framework achieves significant gains over prevalent motion models, across various projection geometries.

[1]  Madhukar Budagavi,et al.  Motion estimation and compensation for fisheye warped video , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[2]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[3]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[4]  Zehdreh Allen-Lafayette,et al.  Flattening the Earth, Two Thousand Years of Map Projections , 1998 .

[5]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Matthias Narroschke,et al.  Extending HEVC by an affine motion model , 2013, 2013 Picture Coding Symposium (PCS).

[7]  Zhu Li,et al.  Projection based advanced motion model for cubic mapping for 360-degree video , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[8]  J. Snyder Flattening the Earth: Two Thousand Years of Map Projections , 1994 .

[9]  Yao Zhao,et al.  Control-Point Representation and Differential Coding Affine-Motion Compensation , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Pascal Frossard,et al.  Multiresolution motion estimation for omnidirectional images , 2005, 2005 13th European Signal Processing Conference.

[11]  Miska M. Hannuksela,et al.  Fisheye video coding using elastic motion compensated reference frames , 2016, 2016 IEEE International Conference on Image Processing (ICIP).