Detecting non-transient anomalies in visual information using neural networks

We address the problem of detecting non-transient anomalies in visual information. By non-transient anomalies we mean changes in the way environments look that are persistent across time. Such changes may include leaving unattended bags at airport corridors, putting graffiti in building walls or damaging public property. Detecting non-transient anomalies is critical to security and surveillance in indoor and outdoor environments. We argue that existing off-the-shelf solutions to computer vision problems (e.g., image recognition, gesture recognition, text recognition) are not the most efficient when applied to detecting non-transient anomalies due to their associated computational overhead. In this paper we present a neural network-based architecture that addresses some of the limitations of the state of the art. To speed up computations, our architecture supports the processing of a large number of neurons in parallel. To reduce computational overheads, our architecture omits some of the Gaussian kernel-based feature extraction tasks performed by other systems. To classify visual anomalies as non-transient, our architecture uses a codebook-based algorithm which builds a history profile for every image segment. We describe our architecture and present some performance analysis.

[1]  Lars Bretzner,et al.  Hand gesture recognition using multi-scale colour features, hierarchical models and particle filtering , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[2]  Ioannis Pitas,et al.  Median radial basis function neural network , 1996, IEEE Trans. Neural Networks.

[3]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[4]  Sotirios P. Chatzis,et al.  A framework for robust visual behavior recognition based on holistic representations and multicamera information fusion , 2010 .

[5]  J. Nazuno Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .

[6]  Tony Lindeberg,et al.  Principles for Automatic Scale Selection , 1999 .

[7]  Jan-Michael Frahm,et al.  Feature tracking and matching in video using programmable graphics hardware , 2007, Machine Vision and Applications.

[8]  Gian Luca Foresti A real-time system for video surveillance of unattended outdoor environments , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[10]  Jake K. Aggarwal,et al.  Object tracking in an outdoor environment using fusion of features and cameras , 2006, Image Vis. Comput..

[11]  Sotirios Chatzis,et al.  Robust Visual Behavior Recognition , 2010, IEEE Signal Processing Magazine.

[12]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[13]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[14]  Keiichi Abe,et al.  Topological structural analysis of digitized binary images by border following , 1985, Comput. Vis. Graph. Image Process..