Data fusion for unsupervised video object detection, tracking and geo-positioning

In this work we describe a system and propose a novel algorithm for moving object detection and tracking based on video feed. Apart of many well-known algorithms, it performs detection in unsupervised style, using velocity criteria for the objects detection. The algorithm utilises data from a single camera and Inertial Measurement Unit (IMU) sensors and performs fusion of video and sensory data captured from the UAV. The algorithm includes object tracking and detection, augmented by object geographical co-ordinates estimation. The algorithm can be generalised for any particular video sensor and is not restricted to any specific applications. For object tracking, Bayesian filter scheme combined with approximate inference is utilised. Object localisation in real-world co-ordinates is based on the tracking results and IMU sensor measurements.

[1]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[2]  Gong Zhang,et al.  Unsupervised Target Detection in SAR Images Using Scattering Center Model and Mean Shift Clustering Algorithm , 2012 .

[3]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[4]  Thierry Bouwmans,et al.  Background Subtraction for Visual Surveillance , 2012, Handbook of Soft Computing for Video Surveillance.

[5]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[6]  J. Wishart THE GENERALISED PRODUCT MOMENT DISTRIBUTION IN SAMPLES FROM A NORMAL MULTIVARIATE POPULATION , 1928 .

[7]  Daniel Cremers,et al.  Box-particle PHD filter for multi-target tracking , 2012, 2012 15th International Conference on Information Fusion.

[8]  Yaakov Bar-Shalom,et al.  Multitarget-multisensor tracking: Advanced applications , 1989 .

[9]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[10]  R. L. Stratonovich CONDITIONAL MARKOV PROCESSES , 1960 .

[11]  R. Mahler A Theoretical Foundation for the Stein-Winter "Probability Hypothesis Density (PHD)" Multitarget Tracking Approach , 2000 .

[12]  Chenhui Yang,et al.  Automatic detection and tracking of multiple interacting targets from a moving platform , 2014 .

[13]  Tong Heng Lee,et al.  Coordinate Systems and Transformations , 2011 .

[14]  Allan D. Jepson,et al.  Fast Rigid Motion Segmentation via Incrementally-Complex Local Models , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[16]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[17]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Massimo Piccardi,et al.  Background subtraction techniques: a review , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[19]  Jiri Matas,et al.  Forward-Backward Error: Automatic Detection of Tracking Failures , 2010, 2010 20th International Conference on Pattern Recognition.

[20]  Hugh Durrant-Whyte,et al.  Simultaneous localization and mapping (SLAM): part II , 2006 .

[21]  Gil Friedrich,et al.  Seeing People in the Dark: Face Recognition in Infrared Images , 2002, Biologically Motivated Computer Vision.

[22]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[23]  Robert T. Collins,et al.  An Open Source Tracking Testbed and Evaluation Web Site , 2005 .

[24]  T. Vincenty DIRECT AND INVERSE SOLUTIONS OF GEODESICS ON THE ELLIPSOID WITH APPLICATION OF NESTED EQUATIONS , 1975 .

[25]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[26]  L Sirovich,et al.  Low-dimensional procedure for the characterization of human faces. , 1987, Journal of the Optical Society of America. A, Optics and image science.