FAST AND RELIABLE OBJECT CLASSIFICATION IN VIDEO BASED ON A 3D GENERIC MODEL

We propose a new object classification approach for monocular video sequences, which allows to classify objects modelled independently from the camera position and object orientation. To achieve this independence, a simple 3D object model that represents an object as a parallelepiped is proposed. The approach is able to give good estimates of object dimensions and proposes visual reliability measures for the object dimensions. These measures give a representation of the visibility of the estimated dimension and are principally proposed to aid posterior phases of the video understanding process, as object tracking and event detection. The method obtains the 3D parallelepiped model estimation using a set of 2D moving regions (obtained in a segmentation phase), the perspective matrix transform (obtained from camera calibration using the pin-hole camera model) and predefined 3D models of expected objects in the scene. After classification, a merging step is performed to improve the classification performance by assembling 2D moving regions with better 3D model probability when together. This approach shows promising results on object classification, obtaining very high detection rates for complex situations and performing at video frame rate.

[1]  Monique Thonnat,et al.  Tracking Groups of People for Video Surveillance , 2002 .

[2]  Michael J. Black,et al.  Recognizing Human Motion Using Parameterized Models of Optical Flow , 1997 .

[3]  François Brémond,et al.  Automatic Video Interpretation: A Novel Algorithm for Temporal Scenario Recognition , 2003, IJCAI.

[4]  François Brémond,et al.  Applying 3D human model in a posture recognition system , 2006, Pattern Recognit. Lett..

[5]  Rita Cucchiara,et al.  Posture classification in a multi-camera indoor environment , 2005, IEEE International Conference on Image Processing 2005.

[6]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[7]  François Brémond,et al.  A Real-Time Scene Understanding System for Airport Apron Monitoring , 2006, Fourth IEEE International Conference on Computer Vision Systems (ICVS'06).

[8]  Rita Cucchiara,et al.  Predictive and Probabilistic Tracking to Detect Stopped Vehicles , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[9]  Aaron F. Bobick,et al.  Recognition of Visual Activities and Interactions by Stochastic Parsing , 2000, IEEE Trans. Pattern Anal. Mach. Intell..