Similarity-Based Processing of Motion Capture Data

Motion capture technologies digitize human movements by tracking 3D positions of specific skeleton joints in time. Such spatio-temporal data have an enormous application potential in many fields, ranging from computer animation, through security and sports to medicine, but their computerized processing is a difficult problem. The recorded data can be imprecise, voluminous, and the same movement action can be performed by various subjects in a number of alternatives that can vary in speed, timing or a position in space. This requires employing completely different data-processing paradigms compared to the traditional domains such as attributes, text or images. The objective of this tutorial is to explain fundamental principles and technologies designed for similarity comparison, searching, subsequence matching, classification and action detection in the motion capture data. Specifically, we emphasize the importance of similarity needed to express the degree of accordance between pairs of motion sequences and also discuss the machine-learning approaches able to automatically acquire content-descriptive movement features. We explain how the concept of similarity together with the learned features can be employed for searching similar occurrences of interested actions within a long motion sequence. Assuming a user-provided categorization of example motions, we discuss techniques able to recognize types of specific movement actions and detect such kinds of actions within continuous motion sequences. Selected operations will be demonstrated by on-line web applications.

[1]  Hans-Peter Seidel,et al.  Efficient and Robust Annotation of Motion Capture Data , 2009 .

[2]  Wenjun Zeng,et al.  Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection , 2018, IEEE Transactions on Image Processing.

[3]  Xiaohui Xie,et al.  Co-Occurrence Feature Learning for Skeleton Based Action Recognition Using Regularized Deep LSTM Networks , 2016, AAAI.

[4]  Thierry Dutoit,et al.  3D skeleton‐based action recognition by representing motion capture sequences as 2D‐RGB images , 2017, Comput. Animat. Virtual Worlds.

[5]  Olegas Vasilecas,et al.  Advances in Databases and Information Systems (ADBIS) , 2002, SIGMOD Rec..

[6]  Pavel Zezula,et al.  Enhancing Effectiveness of Descriptors for Searching and Recognition in Motion Capture Data , 2017, 2017 IEEE International Symposium on Multimedia (ISM).

[7]  Mathieu Barnachon,et al.  Ongoing human action recognition with motion capture , 2014, Pattern Recognit..

[8]  Pavel Zezula,et al.  Similarity Search: The Metric Space Approach (Advances in Database Systems) , 2005 .

[9]  Weibin Liu,et al.  Behavioral segmentation for human motion capture data based on graph cut method , 2017, J. Vis. Lang. Comput..

[10]  Georgios Evangelidis,et al.  Skeletal Quads: Human Action Recognition Using Joint Quadruples , 2014, 2014 22nd International Conference on Pattern Recognition.

[11]  Pavel Zezula,et al.  Similarity Searching for Database Applications , 2016, ADBIS.

[12]  Zhigang Deng,et al.  Perceptually consistent example-based human motion retrieval , 2009, I3D '09.

[13]  Stan Sclaroff,et al.  Learning Activity Progression in LSTMs for Activity Detection and Early Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Juan José Pantrigo,et al.  Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition , 2018, Pattern Recognit..

[15]  Pavel Zezula,et al.  Fast Subsequence Matching in Motion Capture Data , 2017, ADBIS.

[16]  Michael Neff,et al.  Deep signatures for indexing and retrieval in large motion databases , 2015, MIG.

[17]  Pavel Zezula,et al.  A Key-Pose Similarity Algorithm for Motion Data Retrieval , 2013, ACIVS.

[18]  Xin Zhang,et al.  Learning multi-level features for sensor-based human action recognition , 2016, Pervasive Mob. Comput..

[19]  Norman I. Badler,et al.  Efficient motion retrieval in large motion databases , 2013, I3D '13.

[20]  Pavel Zezula,et al.  Effective and efficient similarity searching in motion capture data , 2017, Multimedia Tools and Applications.

[21]  Jian Yang,et al.  Spatio-Temporal Graph Convolution for Skeleton Based Action Recognition , 2018, AAAI.

[22]  Dehui Kong,et al.  Effective human action recognition using global and local offsets of skeleton joints , 2018, Multimedia Tools and Applications.

[23]  Pavel Zezula,et al.  Searching for variable-speed motions in long sequences of motion capture data , 2019, Inf. Syst..

[24]  Yong Du,et al.  Hierarchical recurrent neural network for skeleton based action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Rajeev Srivastava,et al.  Depth based enlarged temporal dimension of 3D deep convolutional network for activity recognition , 2018, Multimedia Tools and Applications.

[26]  Gang Wang,et al.  Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks , 2017, IEEE Transactions on Image Processing.

[27]  Hung-Hsuan Huang,et al.  Searching human actions based on a multi-dimensional time series similarity calculation method , 2015, 2015 IEEE/ACIS 14th International Conference on Computer and Information Science (ICIS).

[28]  Pavel Zezula,et al.  Similarity Search - The Metric Space Approach , 2005, Advances in Database Systems.

[29]  Silvio Savarese,et al.  Structural-RNN: Deep Learning on Spatio-Temporal Graphs , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Xin Zhao,et al.  Structured Streaming Skeleton -- A New Feature for Online Human Gesture Recognition , 2014, TOMM.

[31]  Paul J. Taylor,et al.  AMAB: Automated measurement and analysis of body motion , 2013, Behavior research methods.

[32]  Christian Wolf,et al.  Human Action Recognition: Pose-Based Attention Draws Focus to Hands , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[33]  Yun Fu,et al.  Early Recognition of 3D Human Actions , 2018, ACM Trans. Multim. Comput. Commun. Appl..

[34]  Pavel Zezula,et al.  Similarity Searching for the Big Data , 2015, Mob. Networks Appl..

[35]  Pavel Zezula,et al.  A Real-Time Annotation of Motion Data Streams , 2017, 2017 IEEE International Symposium on Multimedia (ISM).

[36]  C.-C. Jay Kuo,et al.  Automatic Human Mocap Data Classification , 2014, IEEE Transactions on Multimedia.

[37]  Pavel Zezula,et al.  Probabilistic Classification of Skeleton Sequences , 2018, DEXA.

[38]  Wenjun Zeng,et al.  An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data , 2016, AAAI.

[39]  Pichao Wang,et al.  Online human action recognition based on incremental learning of weighted covariance descriptors , 2018, Inf. Sci..

[40]  Reinhard Klein,et al.  Efficient Unsupervised Temporal Segmentation of Motion Data , 2015, IEEE Transactions on Multimedia.

[41]  Franck Multon,et al.  CuDi3D: Curvilinear displacement based approach for online 3D action detection , 2018, Comput. Vis. Image Underst..

[42]  Gang Wang,et al.  Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition , 2016, ECCV.

[43]  Danica Kragic,et al.  Deep Representation Learning for Human Motion Prediction and Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Ling Shao,et al.  Leveraging Hierarchical Parametric Networks for Skeletal Joints Based Action Segmentation and Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.