Action recognition using tri-view constraints

Two-view methods have been well developed to identify human actions. However, in a case where the corresponding imaged points cannot induce distinguished measures, the performance of the methods deteriorates. For this reason, we propose a new view-invariant measure for human action recognition by enforcing tri-view constraints in this paper. We apply our approach to video synchronization by imposing both the similarity ratio and the consistency in the trifocal tensor over entire video sequences. By testing on both synthetic and real data, our method has achieved higher tolerance to noise levels, as well as higher identification accuracy than the traditional two-view method. Experimental results demonstrate that our approach can identify human pose transitions, despite of dynamic time-lines, different viewpoints, and unknown camera parameters.

[1]  Xiaochun Cao,et al.  Video synchronization and its application to object transfer , 2010, Image Vis. Comput..

[2]  Hassan Foroosh,et al.  View-invariant action recognition using fundamental ratios , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Rama Chellappa,et al.  View Invariance for Human Action Recognition , 2005, International Journal of Computer Vision.

[4]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[5]  Mubarak Shah,et al.  Actions sketch: a novel action representation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Tieniu Tan,et al.  Recent developments in human motion analysis , 2003, Pattern Recognit..

[7]  Hassan Foroosh,et al.  View-Invariant Action Recognition from Point Triplets , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Cheng Lei,et al.  Tri-focal tensor-based multiple video synchronization with subframe optimization , 2006, IEEE Transactions on Image Processing.

[9]  Mubarak Shah,et al.  Recognizing human actions using multiple features , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Wen Gao,et al.  Action Recognition in Broadcast Tennis Video Using Optical Flow and Support Vector Machine , 2006, ECCV Workshop on HCI.

[11]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2006, BMVC.

[12]  Yanchun Zhang,et al.  Frontiers of WWW Research and Development - APWeb 2006, 8th Asia-Pacific Web Conference, Harbin, China, January 16-18, 2006, Proceedings , 2006, APWeb.

[13]  Hamid Krim,et al.  Object Recognition Through Topo-Geometric Shape Models Using Error-Tolerant Subgraph Isomorphisms , 2010, IEEE Transactions on Image Processing.

[14]  Ivan Laptev,et al.  On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15]  Yaser Sheikh,et al.  Matching Trajectories of Anatomical Landmarks Under Viewpoint, Anthropometric and Temporal Transforms , 2009, International Journal of Computer Vision.