Viewpoint distortion compensation in practical surveillance systems

Our aim is to estimate the perspective-effected geometric distortion of a scene from a video feed. In contrast to all previous work we wish to achieve this using from low-level, spatio-temporally local motion features used in commercial semi-automatic surveillance systems. We: (i) describe a dense algorithm which uses motion features to estimate the perspective distortion at each image locus and then polls all such local estimates to arrive at the globally best estimate, (ii) present an alternative coarse algorithm which subdivides the image frame into blocks, and uses motion features to derive block-specific motion characteristics and constrain the relationships between these characteristics, with the perspective estimate emerging as a result of a global optimization scheme, and (iii) report the results of an evaluation using nine large sets acquired using existing close-circuit television (CCTV) cameras. Our findings demonstrate that both of the proposed methods are successful, their accuracy matching that of human labelling using complete visual data.

[1]  Xavier Armangué,et al.  A review on egomotion by means of differential epipolar geometry applied to the movement of a mobile robot , 2003, Pattern Recognit..

[2]  Adam Herout,et al.  Five Shades of Grey for Fast and Reliable Camera Pose Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Svetha Venkatesh,et al.  Detection of Dynamic Background Due to Swaying Movements From Motion Features , 2015, IEEE Transactions on Image Processing.

[4]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Ognjen Arandjelovic,et al.  Contextually Learnt Detection of Unusual Motion-Based Behaviour in Crowded Public Spaces , 2013, ISCIS.

[6]  Svetha Venkatesh,et al.  Two Maximum Entropy-Based Algorithms for Running Quantile Estimation in Nonstationary Data Streams , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Ognjen Arandjelovic,et al.  Crowd Detection from Still Images , 2008, BMVC.

[8]  Pascal Vasseur,et al.  Globally optimal line clustering and vanishing point estimation in Manhattan world , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Liang-Tien Chia,et al.  Estimating camera pose from a single urban ground-view omnidirectional image and a 2D building outline map , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Du Tran,et al.  Human Activity Recognition with Metric Learning , 2008, ECCV.

[11]  Luc Van Gool,et al.  3D Urban Scene Modeling Integrating Recognition and Reconstruction , 2008, International Journal of Computer Vision.

[12]  Yiannis Aloimonos,et al.  A Probabilistic Framework for Correspondence and Egomotion , 2006, WDV.

[13]  Alexei A. Efros,et al.  Geometric context from a single image , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  Svetha Venkatesh,et al.  Stream Quantiles via Maximal Entropy Histograms , 2014, ICONIP.

[15]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Svetha Venkatesh,et al.  CCTV Scene Perspective Distortion Estimation From Low-Level Motion Features , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Joel A. Hesch,et al.  A Direct Least-Squares (DLS) method for PnP , 2011, 2011 International Conference on Computer Vision.

[18]  Didier Henrion,et al.  Globally Optimal Estimates for Geometric Reconstruction Problems , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[19]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[20]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[21]  Bhavani M. Thuraisingham,et al.  A framework for a video analysis tool for suspicious event detection , 2005, MDM@KDD.

[22]  Ognjen Arandjelovic,et al.  Multiple-object Tracking in Cluttered and Crowded Public Spaces , 2010, ISVC.

[23]  Svetha Venkatesh,et al.  The adaptable buffer algorithm for high quantile estimation in non-stationary data streams , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).