Uncertainty control for reliable video understanding on complex environments

The most popular applications for video understanding are those related to video-surveillance (e.g. alarms, abnormal behaviours, expected events, access control). Video understanding has several other applications of high impact to the society as medical supervision, traffic control, violent acts detection, crowd behaviour analysis, among many others. We propose a new generic video understanding approach able to extract and learn valuable information from noisy video scenes for real-time applications. This approach comprises motion segmentation, object classification, tracking and event learning phases. This work is focused on building the first fundamental blocks allowing a proper management of uncertainty of data in every phase of the video understanding process. The main contributions of the proposed approach are: (i) a new algorithm for tracking multiple objects in noisy environments, (ii) the utilisation of reliability measures for modelling uncertainty in data and for proper selection of valuable information extracted from noisy data, (iii) the improved capability of tracking to manage multiple visual evidence-target associations, (iv) the combination of 2D image data with 3D information in a dynamics model governed by reliability measures for proper control of uncertainty in data, and (v) a new approach for event recognition through incremental event learning, driven by reliability measures for selecting the most stable and relevant data.

[1]  Shaogang Gong,et al.  Video Behavior Profiling for Anomaly Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Pat Langley,et al.  Models of Incremental Concept Formation , 1990, Artif. Intell..

[3]  Gian Luca Foresti,et al.  Trajectory clustering and its applications for video surveillance , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[4]  François Brémond,et al.  Tracking multiple nonrigid objects in video sequences , 1998, IEEE Trans. Circuits Syst. Video Technol..

[5]  C. Coelho,et al.  A novel pedestrian classification algorithm for a high definition dual camera 360 degrees surveillance system , 2005, IEEE International Conference on Image Processing 2005.

[6]  S.S. Blackman,et al.  Multiple hypothesis tracking for multiple target tracking , 2004, IEEE Aerospace and Electronic Systems Magazine.

[7]  Ehud Rivlin,et al.  Understanding Video Events: A Survey of Methods for Automatic Interpretation of Semantic Occurrences in Video , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[8]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  Robert J. Dempster,et al.  Demonstration of multiple-hypothesis tracking (MHT) practical real-time implementation feasibility , 2001 .

[10]  Marcos Zúñiga Incremental learning of events in video using reliable information , 2008 .

[11]  Alan M. McIvor,et al.  Background Subtraction Techniques , 2000 .

[12]  Ramakant Nevatia,et al.  Tracking multiple humans in crowded environment , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[13]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[14]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  James Bret Michael,et al.  Boost Phase Ballistic Missile Defense Using Multiple Hypothesis Tracking , 2007, 2007 IEEE International Conference on System of Systems Engineering.

[16]  John J. Leonard,et al.  Curved shape reconstruction using multiple hypothesis tracking , 1997 .

[17]  Jan-Olof Eklundh,et al.  Real-Time Maintenance of Figure-Ground Segmentation , 1999, ICVS.

[18]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[19]  Rita Cucchiara,et al.  Posture classification in a multi-camera indoor environment , 2005, IEEE International Conference on Image Processing 2005.

[20]  Chia-Hung Yeh,et al.  Robust Vehicle and Traffic Information Extraction for Highway Surveillance , 2005, EURASIP J. Adv. Signal Process..

[21]  Bernd Heisele Motion-based Object Detection and Tracking in Color Image Sequence , 2000 .

[22]  François Brémond,et al.  Applying 3D human model in a posture recognition system , 2006, Pattern Recognit. Lett..

[23]  Patrick Pérez,et al.  Sequential Monte Carlo methods for multiple target tracking and data fusion , 2002, IEEE Trans. Signal Process..

[24]  Zoubin Ghahramani,et al.  Learning Dynamic Bayesian Networks , 1997, Summer School on Neural Networks.

[25]  Massimo Piccardi,et al.  Background subtraction techniques: a review , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[26]  Kevin Thompson,et al.  Cobweb/3: A portable implementation , 1990 .

[27]  François Brémond,et al.  ETISEO, performance evaluation for video surveillance systems , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[28]  Luc Van Gool,et al.  Efficient Mining of Frequent and Distinctive Feature Configurations , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[29]  Jörg Heuer,et al.  Model based segmentation of motion fields in compressed video sequences using partition projection and relaxation , 2005, Visual Communications and Image Processing.

[30]  Yaakov Bar-Shalom,et al.  Dimensionless score function for multiple hypothesis tracking , 2007 .