Robust Online Change-point Detection in Video Sequences

We present the Cumulative Sum (CUSUM) stopping rule, applied to Computer Vision problems, to automatically detect changes in either parametric or nonparametric distributions, online or off-line. Our approach is based on using the previously received data of the sequence to detect a change in data that are to be received. We assume that no significant change has occurred up to an unknown time instance. Then a change in the distribution of the observations occurs and the objective is to estimate this instance. We test the hypotheses of no change occurs vs. a change occurs at the current frame, which is done by the CUSUM stopping rule. We apply our framework to the case of continuous 3D hand tracking, where the high dofs, the fast finger articulations, the large rotations and the frequent occlusions often cause error accumulation. Also we illustrate the performance of our approach in video segmentation, and specifically in segmentation of fingerspelling in American Sign Language (ASL) videos.

[1]  Shan Lu,et al.  Using multiple cues for hand tracking and model refinement , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[2]  Thomas S. Huang,et al.  Tracking articulated hand motion with eigen dynamics analysis , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[3]  Alʹbert Nikolaevich Shiri︠a︡ev,et al.  Optimal stopping rules , 1977 .

[4]  Michael Isard,et al.  Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[5]  Michael I. Mandel,et al.  Visual Hand Tracking Using Nonparametric Belief Propagation , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[6]  Sherman Wilcox,et al.  The phonetics of fingerspelling , 1992 .

[7]  Björn Stenger,et al.  Hand Pose Estimation Using Hierarchical Detection , 2004, ECCV Workshop on HCI.

[8]  Dimitris N. Metaxas,et al.  A Framework for Recognizing the Simultaneous Aspects of American Sign Language , 2001, Comput. Vis. Image Underst..

[9]  H. W. Sorenson,et al.  Kalman filtering : theory and application , 1985 .

[10]  G. Moustakides Optimal stopping times for detecting changes in distributions , 1986 .

[11]  E. S. Page A test for a change in a parameter occurring at an unknown point , 1955 .

[12]  Stan Sclaroff,et al.  Database Indexing Methods for 3D Hand Pose Estimation , 2003, Gesture Workshop.

[13]  Peter Willett,et al.  Detection of hidden Markov model transient signals , 2000, IEEE Trans. Aerosp. Electron. Syst..

[14]  Takeo Kanade,et al.  Model-based tracking of self-occluding articulated objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[15]  G. Lorden PROCEDURES FOR REACTING TO A CHANGE IN DISTRIBUTION , 1971 .

[16]  Kirsti Grobel,et al.  Video-based Recognition of Fingerspelling in Real-Time , 1996, Bildverarbeitung für die Medizin.

[17]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[18]  Carlo Tomasi,et al.  3D tracking = classification + interpolation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[19]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[20]  Ying Wu,et al.  Modeling the constraints of human hand motion , 2000, Proceedings Workshop on Human Motion.

[21]  Jay L. Devore,et al.  Probability and statistics for engineering and the sciences , 1982 .