Stereo mosaicking and 3D-video for singleview HDTV aerial sequences using a low bit rate ROI coding framework

Low bit rate coding systems for the transmission of high quality aerial surveillance videos captured from UAVs are of high interest. One way to achieve high quality low bit rate video is to assume a planar surface of the earth, which is valid for sequences captured at high flight altitudes. Those systems only transmit the area of the current frame not contained in the previous frames (New Area) and reconstruct the already known areas by means of Global Motion Compensation (GMC) at decoder side. Although the bit rate can be reduced significantly compared to standardized video coders, no reconstruction of stereo video is possible at the decoder since each image pixel is transmitted only once and thus no motion parallax of objects can be observed in the reconstructed video. In this paper we present a coding system for stereo video reconstruction at very low bit rates. On-board the UAV we employ the camera path estimated from the image data to create a second view of a virtual camera. We derive convenient baseline distances and demonstrate the resulting perceptively good stereo impression for different test sequences. Similar to the coding concept introduced above we transmit a second New Area 2 in addition to the New Area already introduced. By doubling the bit rate to about 2 Mbit/s for a reasonable video quality of more than 38 dB, still saving more than 85% BD-rate compared to common HEVC coding, we are able to reconstruct a full HDTV (30 fps) stereo video at the decoder.

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  Ming-Chieh Chi,et al.  ROI video coding based on H.263+ with robust skin-color detection technique , 2003, IEEE Trans. Consumer Electron..

[3]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[4]  OhmJens-Rainer,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012 .

[5]  Mårten Sjöström,et al.  Spatio-temporal filter for ROI video coding , 2006, 2006 14th European Signal Processing Conference.

[6]  Jörn Ostermann,et al.  Low bit rate ROI based video coding for HDTV aerial surveillance video sequences , 2011, CVPR 2011 WORKSHOPS.

[7]  Yo-Sung Ho Advanced Video Coding for Next-generation Multimedia Services , 2014 .

[8]  Stefanos D. Kollias,et al.  Low bit-rate coding of image sequences using adaptive regions of interest , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Marco Munderloh,et al.  Detection of moving objects for aerial surveillance of arbitrary terrain , 2016 .

[10]  Joern Ostermann,et al.  Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding , 2015, APSIPA Transactions on Signal and Information Processing.

[11]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[12]  Frédéric Dufaux,et al.  Background mosaicking for low bit rate video coding , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[13]  N. Dong,et al.  Fast Stereo Aerial Image Construction and Measurement for Emergency Rescue , 2013, 2013 Fifth International Conference on Geo-Information Technologies for Natural Disaster Management.

[14]  K. R. Rao,et al.  High Efficiency Video Coding(HEVC) , 2014 .

[15]  Jörn Ostermann,et al.  Region of Interest Coding for Aerial Video Sequences Using Landscape Models , 2013 .

[16]  Bharadwaj S. Amrutur,et al.  Skip Decision and Reference Frame Selection for Low-Complexity H.264/AVC Surveillance Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Ofer Hadar,et al.  Complexity-aware adaptive spatial pre-processing for ROI scalable video coding with dynamic transition region , 2011, 2011 18th IEEE International Conference on Image Processing.

[18]  Horst Bischof,et al.  A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.

[19]  Filippo Speranza,et al.  Stereoscopic 3D-TV: Visual Comfort , 2011, IEEE Transactions on Broadcasting.

[20]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .