Multiresolution Local Autocorrelation of Optical Flows over Time for Action Recognition

We propose method for fast action recognition and comparable performance using local autocorrelation of optical flows over time. To capture action movement, dense optical flows is generated along sequence of video. Optical flows sometimes yield noise of motions that distract object of interest from another object motions and background. We suppress this by using edge based optical flow. The HOF vector is extracted from each window resolution and correlate its consecutive flow fields within cycle using local autocorrelation over time. It will gather richer information from movement while also gaining discriminative features than standard histogram methods. Comparison shows that the comparable performance is achieved over state of the arts.

[1]  Greg Mori,et al.  Action recognition by learning mid-level motion features , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Takio Kurita,et al.  A New Scheme for Practical Flexible and Intelligent Vision Systems , 1988, MVA.

[3]  Luc Van Gool,et al.  Action snippets: How many frames does human action recognition require? , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  David J. Fleet,et al.  Computation of component image velocity from local phase information , 1990, International Journal of Computer Vision.

[5]  Jitendra Malik,et al.  Occlusion boundary detection and figure/ground assignment from optical flow , 2011, CVPR 2011.

[6]  Antonio Torralba,et al.  SIFT Flow: Dense Correspondence across Scenes and Its Applications , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[8]  Takio Kurita,et al.  Action Recognition Using Three-Way Cross-Correlations Feature of Local Moton Attributes , 2010, 2010 20th International Conference on Pattern Recognition.

[9]  Patrick Bouthemy,et al.  Better Exploiting Motion for Better Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Martial Hebert,et al.  Spatio-temporal Shape and Flow Correlation for Action Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Krystian Mikolajczyk,et al.  Action recognition with appearance-motion features and fast search trees , 2011, Comput. Vis. Image Underst..

[12]  Feng Shi,et al.  Sampling Strategies for Real-Time Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Vassilios Morellas,et al.  Action recognition using global spatio-temporal features derived from sparse representations , 2014, Comput. Vis. Image Underst..

[14]  Jason J. Corso,et al.  Action bank: A high-level representation of activity in video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16]  Martial Hebert,et al.  Efficient visual event detection using volumetric features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[17]  Cordelia Schmid,et al.  Action recognition by dense trajectories , 2011, CVPR 2011.

[18]  Christian Wolf,et al.  Recognizing and Localizing Individual Activities through Graph Matching , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[19]  Kuldip K. Paliwal,et al.  Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition , 2006, Speech Commun..

[20]  Jae-Young Jung,et al.  Automated measurement of crowd density based on edge detection and optical flow , 2010, 2010 The 2nd International Conference on Industrial Mechatronics and Automation.

[21]  Thomas B. Moeslund,et al.  Selective spatio-temporal interest points , 2012, Comput. Vis. Image Underst..

[22]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[23]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[24]  Takumi Kobayashi,et al.  Image Feature Extraction Using Gradient Local Auto-Correlations , 2008, ECCV.