Moving object tracking for aerial video coding using linear motion prediction and block matching

Region of Interest (ROI) coding is a common method for data reduction in scenarios where bandwidth is crucial like in aerial video surveillance from Unmanned Aerial Vehicles (UAVs). In order to save bits, non-ROI areas are typically reduced in quality or not transmitted at all and thus, an accurate ROI classification is mandatory. Moving objects (MOs) are often considered as ROIs and consequently have to be accurately detected onboard. However, common detection approaches either rely on computationally demanding processing which is not available at small UAVs with only limited energy, are model based or cannot provide a sufficient detection precision. While not detected MOs lead to a degraded representation at the decoder, erroneously detected MOs lead to an unnecessary high bit rate. We tackle all these issues utilizing an efficient object proposal computation. Based on a dual-threshold strategy applied to image differences, we propose a linear prediction-supported block matcher. Compared to a simple thresholding approach, it shows superior performance and is robust to threshold tuning. By integrating superpixels into the framework, we further recover the complete shape of the MOs. Finally, an efficient tracking-by-detection system is employed to produce accurate detections from the proposals, thereby recovering missed MOs and denying wrong proposals, making the coding more efficient. We achieve an improved detection precision of up to 76 % compared to a simple difference image-based approach. By using a general ROI coding framework we reduce the bit rate of our test set by 70 % compared to common HEVC.

[1]  Mehran Yazdi,et al.  A block matching based method for moving object detection in active camera , 2013, The 5th Conference on Information and Knowledge Technology.

[2]  Martial Hebert,et al.  A Flow-Based Approach to Vehicle Detection and Background Mosaicking in Airborne Video , 2005, CVPR.

[3]  Jörn Ostermann,et al.  Codec independent region of interest video coding using a joint pre- and postprocessing framework , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[4]  Marco Munderloh,et al.  Detection of moving objects for aerial surveillance of arbitrary terrain , 2016 .

[5]  Bodo Rosenhahn,et al.  Efficient Multiple People Tracking Using Minimum Cost Arborescences , 2014, GCPR.

[6]  Mårten Sjöström,et al.  Spatio-temporal filter for ROI video coding , 2006, 2006 14th European Signal Processing Conference.

[7]  Ming-Chieh Chi,et al.  ROI video coding based on H.263+ with robust skin-color detection technique , 2003, IEEE Trans. Consumer Electron..

[8]  Jörn Ostermann,et al.  Low bit rate ROI based video coding for HDTV aerial surveillance video sequences , 2011, CVPR 2011 WORKSHOPS.

[9]  Bodo Rosenhahn,et al.  Everybody needs somebody: Modeling social and grouping behavior on a linear programming multiple people tracker , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[10]  Michael Teutsch,et al.  Detection, Segmentation, and Tracking of Moving Objects in UAV Videos , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[11]  Rainer Stiefelhagen,et al.  Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics , 2008, EURASIP J. Image Video Process..

[12]  Gérard G. Medioni,et al.  Detection and tracking of moving objects from a moving platform in presence of strong parallax , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[13]  Bodo Rosenhahn,et al.  Temporally Consistent Superpixels , 2013, 2013 IEEE International Conference on Computer Vision.

[14]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  LiXuelong,et al.  KLT feature based vehicle detection and tracking in airborne videos , 2011 .

[16]  Stefanos D. Kollias,et al.  Low bit-rate coding of image sequences using adaptive regions of interest , 1998, IEEE Trans. Circuits Syst. Video Technol..

[17]  Joern Ostermann,et al.  Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding , 2015, APSIPA Transactions on Signal and Information Processing.

[18]  Witold Czajewski,et al.  Moving Objects Detection and Tracking Framework for UAV-based Surveillance , 2010, 2010 Fourth Pacific-Rim Symposium on Image and Video Technology.

[19]  Bharadwaj S. Amrutur,et al.  Skip Decision and Reference Frame Selection for Low-Complexity H.264/AVC Surveillance Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Jörn Ostermann,et al.  Illumination change robust, codec independent lowbit rate coding of stereo from singleview aerial video , 2016, 2016 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[21]  Branko Ristic,et al.  Moving Target Indication and Tracking from Moving Sensors , 2005, Digital Image Computing: Techniques and Applications (DICTA'05).

[22]  Robert A. Schowengerdt,et al.  Airborne video registration and traffic-flow parameter estimation , 2005, IEEE Transactions on Intelligent Transportation Systems.

[23]  Jörn Ostermann,et al.  Region of Interest Coding for Aerial Video Sequences Using Landscape Models , 2013 .