Gesture Segmentation from a Video Sequence Using Greedy Similarity Measure

We propose a novel method of greedy similarity measure to segment long spatial-temporal video sequences. Firstly, a principal curve of motion region along frames of a video sequence is constructed to represent trajectory. Then from the constructed principal curves of trajectories of predefined gestures, HMMs are applied to modeling them. For a long input video sequence, greedy similarity measure is established to automatically segment it into gestures along with gesture recognition, where true breakpoints of its principal curve are found by maximizing the joint probability of two successive candidate segments conditioned on the gesture models obtained from HMMs. The method is flexible, of high accuracy, and robust to noise due to the exploitation of principal curves, the combination of two successive candidate segments, and the simultaneous recognition. Experiments including comparison with two established methods demonstrate the effectiveness of the proposed method

[1]  T. Hastie,et al.  Principal Curves , 2007 .

[2]  Longin Jan Latecki,et al.  Convexity Rule for Shape Decomposition Based on Discrete Contour Evolution , 1999, Comput. Vis. Image Underst..

[3]  Ben J. A. Kröse,et al.  A k-segments algorithm for finding principal curves , 2002, Pattern Recognit. Lett..

[4]  Shaogang Gong,et al.  Activity Based Video Content Trajectory Representation and Segmentation , 2004, BMVC.

[5]  Yong Rui,et al.  Segmenting visual actions based on spatio-temporal motion patterns , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[6]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, J. Electronic Imaging.

[7]  Rangachar Kasturi,et al.  Extraction and Temporal Segmentation of Multiple Motion Trajectories in Human Motion , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[8]  Majid Mirmehdi,et al.  Temporal video segmentation and classification of edit effects , 2003, Image Vis. Comput..

[9]  Chong-Wah Ngo,et al.  Gesture tracking and recognition for lecture video editing , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[10]  Patrick Bouthemy,et al.  Content-Based Video Segmentation using Statistical Motion Models , 2002, BMVC.

[11]  John R. Kender,et al.  Video scene segmentation via continuous video coherence , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).