Activity detection using Sequential Statistical Boundary Detection (SSBD)

We propose a novel activity detection scheme tailored for home environment scenes.We introduce three new action datasets for action detection evaluation.Fast spatio-temporal action localization with the use of statistical tools. The spiralling increase of video data has rendered the automated localization and recognition of activities an essential step for video content understanding. In this work, we introduce novel algorithms for detecting human activities in the spatial domain via a binary activity detection mask, the Motion Boundary Activity Area (MBAA), and in the time domain by a new approach, Statistical Sequential Boundary Detection (SSBD). MBAAs are estimated by analyzing the motion vectors using the Kurtosis metric, while dense trajectories are extracted and described using a low level HOGHOF descriptor and high level Fisher representation scheme, modeling a Support Vector Data Description (SVDD) hypersphere. SSBD is then realized by applying Sequential Change Detection with the Cumulative Sum (CUSUM) algorithm on the distances between Fisher data descriptors and the corresponding reference SVDD hyperspheres for rapid detection of changes in the activity pattern. Activities in the resulting video subsequences are then classified using an multi-class SVM model, leading to state of the art results. Our experiments with benchmark and real world data demonstrate that our technique is successful in reducing the computational cost and also in improving activity detection rates.

[1]  Jason J. Corso,et al.  Action bank: A high-level representation of activity in video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Jason Wittenberg,et al.  Clarify: Software for Interpreting and Presenting Statistical Results , 2003 .

[3]  Junsong Yuan,et al.  Max-Margin Structured Output Regression for Spatio-Temporal Action Localization , 2012, NIPS.

[4]  Kristen Grauman,et al.  Efficient activity detection with max-subgraph search , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Cordelia Schmid,et al.  Human Focused Action Localization in Video , 2010, ECCV Workshops.

[6]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, ICPR 2004.

[7]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[8]  Nazli Ikizler-Cinbis,et al.  Action Recognition and Localization by Hierarchical Space-Time Segments , 2013, 2013 IEEE International Conference on Computer Vision.

[9]  G. Marsaglia,et al.  Evaluating Kolmogorov's distribution , 2003 .

[10]  Jean Ponce,et al.  Automatic annotation of human actions in video , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11]  Yiannis Kompatsiaris,et al.  Recognition of Activities of Daily Living for Smart Home Environments , 2013, 2013 9th International Conference on Intelligent Environments.

[12]  Luc Van Gool,et al.  Exemplar-based Action Recognition in Video , 2009, BMVC.

[13]  Cordelia Schmid,et al.  Action recognition by dense trajectories , 2011, CVPR 2011.

[14]  Ying Wu,et al.  Discriminative subvolume search for efficient action detection , 2009, CVPR.

[15]  Richard P. Wildes,et al.  Efficient action spotting based on a spacetime oriented structure representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Andrew Zisserman,et al.  The devil is in the details: an evaluation of recent feature encoding methods , 2011, BMVC.

[17]  Hassan Foroosh,et al.  Feature-Independent Action Spotting without Human Localization, Segmentation, or Frame-wise Tracking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Yang Wang,et al.  Discriminative figure-centric models for joint action localization and recognition , 2011, 2011 International Conference on Computer Vision.

[19]  Cordelia Schmid,et al.  Efficient Action Localization with Approximately Normalized Fisher Vectors , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Patrick Pérez,et al.  Retrieving actions in movies , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[21]  Eric Moreau,et al.  Unbiased Efficient Estimator of the Fourth-Order Cumulant for Random Zero-Mean Non-i.i.d. Signals: Particular Case of MA Stochastic Process , 2010, IEEE Transactions on Information Theory.

[22]  Patrick Bouthemy,et al.  Action Localization with Tubelets from Motion , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[24]  Alexander Novikov,et al.  Statistics and control of random processes , 1995 .

[25]  Christopher Joseph Pal,et al.  Activity recognition using the velocity histories of tracked keypoints , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[26]  L. T. DeCarlo On the meaning and use of kurtosis. , 1997 .

[27]  Thomas Mensink,et al.  Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[28]  Mubarak Shah,et al.  Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Yiannis Kompatsiaris,et al.  Recognition of Activities of Daily Living , 2012, 2012 IEEE 24th International Conference on Tools with Artificial Intelligence.

[30]  Cordelia Schmid,et al.  Aggregating Local Image Descriptors into Compact Codes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.