Temporal Subspace Clustering for Unsupervised Action Segmentation

Action segmentation (segmenting a continuous sequence of motion data into a set of actions) has a wide range of applications and plays a role in many problems in computer vision. We look at subspace clustering as an unsupervised approach for this task. Classical subspace clustering methods uncover relationships within the data by learning codes for the samples (i.e. frames), but in this process these methods do not consider the temporal dependency of nearby samples in motion data. In this paper, we propose two subspace clustering methods with temporal regularization terms: Temporal Sparse Subspace Clustering Laplacian Regularization (TSSC-LR) and Temporal Sparse Subspace Clustering Linear Combination (TSSC-LC). TSSC-LR encourages similar codings of samples within a window and TSSC-LC enforces codings within a window to be linear combinations of each other. Efficient ADMM algorithms are proposed for each method. Experiments against state-of-the-art methods on three action datasets demonstrate the effectiveness of the two proposed methods.

[1]  Yong Yu,et al.  Robust Subspace Segmentation by Low-Rank Representation , 2010, ICML.

[2]  Junbin Gao,et al.  Subspace Clustering for Sequential Data , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Vladimir Pavlovic,et al.  Probabilistic Temporal Subspace Clustering , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Jian Cheng,et al.  Sequential Subspace Clustering via Temporal Smoothness , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[5]  Junbin Gao,et al.  Ordered Subspace Clustering With Block-Diagonal Priors , 2016, IEEE Transactions on Cybernetics.

[6]  Gregory D. Hager,et al.  Learning convolutional action primitives for fine-grained action recognition , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Zhihua Zhang,et al.  Exact Subspace Clustering in Linear Time , 2014, AAAI.

[8]  Yi Wang,et al.  Sequential Max-Margin Event Detectors , 2014, ECCV.

[9]  S. Shankar Sastry,et al.  Generalized principal component analysis (GPCA) , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Henry C. Lin,et al.  JHU-ISI Gesture and Skill Assessment Working Set ( JIGSAWS ) : A Surgical Activity Dataset for Human Motion Modeling , 2014 .

[11]  René Vidal,et al.  Sparse subspace clustering , 2009, CVPR.

[12]  Maja Pantic,et al.  Temporal Archetypal Analysis for Action Segmentation , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[13]  Yun Fu,et al.  Temporal Subspace Clustering for Human Motion Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).