Learning to Use Scene Context for Object Classification in Surveillance

Object classification in far-field outdoor video surveillance is a challenging problem because of low resolution, presence of shadows and projective image distortion. We present a classification system that learns to use scene context variables (such as position and orientation of objects), in addition to object variables (such as shape and size), to improve its performance in an arbitrary scene. The key feature of our system is that it needs only a small number of labeled examples from a few scenes, but extends well to novel scenes by extracting contextual information from unlabeled data. A three-step bootstrapping algorithm for adapting classifiers to a novel scene is proposed. A baseline classifier is first trained using object-specific features. This classifier is applied to a novel scene to calculate ‘labels’ for unlabeled data. Some of these labels are then incorporated into the training set of a scene-dependent classifier which uses both object and context features. Experimental results demonstrate the effectiveness of our adaptive classifiers for multiple urban scenes.

[1]  Takeo Kanade,et al.  Advances in Cooperative Multi-Sensor Video Surveillance , 1999 .

[2]  Mubarak Shah,et al.  Tracking and Object Classification for Automated Surveillance , 2002, ECCV.

[3]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[5]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[6]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[7]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[8]  Ying Wu,et al.  Self-Supervised Learning for Object Recognition based on Kernel Discriminant-EM Algorithm , 2001, ICCV.

[9]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[10]  Hironobu Fujiyoshi,et al.  Moving target classification and tracking from real-time video , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[11]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.