2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS)

In this paper we propose a new representation of videos with spatiotemporal blocks. After a given video is decomposed into the spatiotemporal blocks, a dimensionality reduction technique is applied to obtain a compact vector representation of each block gray level values. The block vectors provide a joint representation of texture and motion patterns in videos. Our results on PETS repository videos show that detection and tracking of moving objects is substantially improved if based on spatiotemporal blocks instead on pixels. Thus, we go away from the standard input of pixel values that are known to be noisy and the main cause of instability of video analysis algorithms.

[1]  T. List,et al.  Performance evaluating the evaluator , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[2]  Daniela Hall Automatic parameter regulation for a tracking system with an auto-critical function , 2005, Seventh International Workshop on Computer Architecture for Machine Perception (CAMP'05).

[3]  Demetri Terzopoulos,et al.  Cognitive Vision for Autonomous Satellite Rendezvous and Docking , 2005, MVA.

[4]  Mark W. Powell,et al.  Automated performance evaluation of range image segmentation algorithms , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[5]  Robert B. Fisher,et al.  The PETS04 Surveillance Ground-Truth Data Sets , 2004 .

[6]  Jorge S. Marques,et al.  New Performance Evaluation Metrics for Object Detection Algorithms , 2004 .

[7]  James E. Black,et al.  A novel method for video tracking performance evaluation , 2003 .

[8]  Dragoljub Pokrajac,et al.  On hypothesis testing in multidimensional outlier detection , 2003, 6th International Conference on Telecommunications in Modern Satellite, Cable and Broadcasting Service, 2003. TELSIKS 2003..

[9]  Ling Guan,et al.  Centralized peer-to-peer streaming with layered video , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[10]  João M. Lemos,et al.  A Video System for Urban Surveillance: Function Integration and Evaluation , 2003 .

[11]  Sridha Sridharan,et al.  Real-time adaptive background segmentation , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[12]  Mubarak Shah,et al.  A hierarchical approach to robust background subtraction using color and gradient information , 2002, Workshop on Motion and Video Computing, 2002. Proceedings..

[13]  Michael Harville,et al.  A Framework for High-Level Feedback to Adaptive, Per-Pixel, Mixture-of-Gaussian Background Models , 2002, ECCV.

[14]  Mubarak Shah,et al.  Tracking and Object Classification for Automated Surveillance , 2002, ECCV.

[15]  Nikos Paragios,et al.  Video-Based Surveillance Systems , 2002, Springer US.

[16]  Terrance E. Boult,et al.  Into the woods: visual surveillance of noncooperative and camouflaged targets in complex outdoor settings , 2001, Proc. IEEE.

[17]  Richard K. Beatson,et al.  Reconstruction and representation of 3D objects with radial basis functions , 2001, SIGGRAPH.

[18]  Jian-Huang Lai,et al.  Face recognition using holistic Fourier invariant features , 2001, Pattern Recognit..

[19]  J. Crowley,et al.  Multi-Modal Tracking of Interacting Targets Using Gaussian Approximations , 2001 .

[20]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Takeo Kanade,et al.  Introduction to the Special Section on Video Surveillance , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Fabio Lavagetto,et al.  The facial animation engine: toward a high-level interface for the design of MPEG-4 compliant animated faces , 1999, IEEE Trans. Circuits Syst. Video Technol..

[25]  Steven M. Lalonde,et al.  A First Course in Multivariate Statistics , 1997, Technometrics.

[26]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[27]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Richard K. Beatson,et al.  Surface interpolation with radial basis functions for medical imaging , 1997, IEEE Transactions on Medical Imaging.

[29]  Mubarak Shah,et al.  Shape from shading using linear approximation , 1994, Image Vis. Comput..

[30]  Longin Jan Latecki,et al.  Digital Topology , 1994 .

[31]  Azriel Rosenfeld,et al.  'Continuous' functions on digital pictures , 1986, Pattern Recognit. Lett..

[32]  Ramesh C. Jain,et al.  Separating Non-Stationary from Stationary Scene Components in a Sequence of Real World TV Images , 1977, IJCAI.

[33]  E. Land,et al.  Lightness and retinex theory. , 1971, Journal of the Optical Society of America.