A Novel Skeleton-Based Human Activity Discovery Technique Using Particle Swarm Optimization with Gaussian Mutation

Human activity discovery aims to distinguish the activities performed by humans, without any prior information of what defines each activity. Most methods presented in human activity recognition are supervised, where there are labeled inputs to train the system. In reality, it is difficult to label data because of its huge volume and the variety of activities performed by humans. In this paper, a novel unsupervised approach is proposed to perform human activity discovery in 3D skeleton sequences. First, important frames are selected based on kinetic energy. Next, the displacement of joints, set of statistical, angles, and orientation features are extracted to represent the activities information. Since not all extracted features have useful information, the dimension of features is reduced using PCA. Most human activity discovery proposed are not fully unsupervised. They use pre-segmented videos before categorizing activities. To deal with this, we used the fragmented sliding time window method to segment the time series of activities with some overlapping. Then, activities are discovered by a novel hybrid particle swarm optimization with a Gaussian mutation algorithm to avoid getting stuck in the local optimum. Finally, k-means is applied to the outcome centroids to overcome the slow rate of PSO. Experiments on three datasets have been presented and the results show the proposed method has superior perfor∗Corresponding author. Email addresses: 20h8561@ubd.edu.bn (Parham Hadikhani), daphne.lai@ubd.edu.bn (Daphne Teck Ching Lai), weehong.ong@ubd.edu.bn (Wee-Hong Ong) ar X iv :2 20 1. 05 31 4v 1 [ cs .C V ] 1 4 Ja n 20 22 mance in discovering activities in all evaluation parameters compared to the other state-of-the-art methods and has increased accuracy of at least 4 % on average. The code is available here: Human-Activity-Discovery-HPGMK

[1]  Josef Kittler,et al.  Spatial Residual Layer and Dense Connection Block Enhanced Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[2]  Jake K. Aggarwal,et al.  View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[3]  Richard McClatchey,et al.  Tracking changes in user activity from unlabelled smart home sensor data using unsupervised learning methods , 2020, Neural Computing and Applications.

[4]  Mubarak Shah,et al.  Unsupervised Action Discovery and Localization in Videos , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  Xing Liu,et al.  Rotation-based spatial–temporal feature learning from skeleton sequences for action recognition , 2020, Signal Image Video Process..

[6]  Parham Hadikhani,et al.  An adaptive hybrid algorithm for social networks to choose groups with independent members , 2019, Evol. Intell..

[7]  Ying Wu,et al.  Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Takafumi Koseki,et al.  Investigation of Feature Extraction for Unsupervised Learning in Human Activity Detection , 2013 .

[9]  Changhe Li,et al.  An adaptive mutation operator for particle swarm optimization , 2008 .

[10]  Bart Selman,et al.  Unstructured human activity detection from RGBD images , 2011, 2012 IEEE International Conference on Robotics and Automation.

[11]  Hui Yu,et al.  Synergetic information bottleneck for joint multi-view and ensemble clustering , 2020, Inf. Fusion.

[12]  Jianghui Cai,et al.  A Novel Clustering Algorithm Based on DPC and PSO , 2020, IEEE Access.

[13]  Marco Morana,et al.  Human Activity Recognition Process Using 3-D Posture Data , 2015, IEEE Transactions on Human-Machine Systems.

[14]  Alberto Del Bimbo,et al.  Recognizing Actions from Depth Cameras as Weakly Aligned Multi-part Bag-of-Poses , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[15]  Parul Agarwal,et al.  Analyzing Subspace Clustering Approaches for High Dimensional Data , 2021, Artificial Intelligence for a Sustainable Industry 4.0.

[16]  Ahmad Lotfi,et al.  Human activity learning for assistive robotics using a classifier ensemble , 2018, Soft Comput..

[17]  Takafumi Koseki,et al.  Autonomous Learning and Recognition of Human Action based on An Incremental Approach of Clustering , 2015 .

[18]  Gang Wang,et al.  NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Alessio Del Bue,et al.  Subspace Clustering for Action Recognition with Covariance Representations and Temporal Pruning , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[20]  Srinivas Akella,et al.  3D human action segmentation and recognition using pose kinetic energy , 2014, 2014 IEEE International Workshop on Advanced Robotics and its Social Impacts.

[21]  이화영 X , 1960, Chinese Plants Names Index 2000-2009.

[22]  Massimo Mecella,et al.  Visual process maps: a visualization tool for discovering habits in smart homes , 2019, Journal of Ambient Intelligence and Humanized Computing.

[23]  Ling Shao,et al.  A Recursive Constrained Framework for Unsupervised Video Action Clustering , 2020, IEEE Transactions on Industrial Informatics.

[24]  Mahmood Fathy,et al.  Switching Structured Prediction for Simple and Complex Human Activity Recognition , 2020, IEEE Transactions on Cybernetics.

[25]  Sriyankar Acharyya,et al.  Repository and Mutation based Particle Swarm Optimization (RMPSO): A new PSO variant applied to reconstruction of Gene Regulatory Network , 2019, Appl. Soft Comput..

[26]  Kui Jia,et al.  JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[27]  Takafumi Koseki,et al.  An Incremental Approach of Clustering for Human Activity Discovery , 2014 .

[28]  Ajith Abraham,et al.  A meta-heuristic density-based subspace clustering algorithm for high-dimensional data , 2021, Soft Computing.

[29]  Marco La Cascia,et al.  3D skeleton-based human action classification: A survey , 2016, Pattern Recognit..

[30]  Nanning Zheng,et al.  Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Pichao Wang,et al.  Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Shengrui Wang,et al.  Particle swarm optimizer for variable weighting in clustering high-dimensional data , 2009, 2009 IEEE Swarm Intelligence Symposium.

[33]  Gang Wang,et al.  Global Context-Aware Attention LSTM Networks for 3D Action Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Yangdong Ye,et al.  Unsupervised Human Action Categorization with Consensus Information Bottleneck Method , 2016, IJCAI.

[35]  G. Fitzgerald,et al.  'I. , 2019, Australian journal of primary health.

[36]  Hedau Varsha Chandrashekhar,et al.  HUMAN ACTIVITY REPRESENTATION, ANALYSIS, AND RECOGNITION , 2006 .

[37]  Xiulong Liu,et al.  PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Chen Chen,et al.  Memory Attention Networks for Skeleton-Based Action Recognition , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[39]  Liang Wang,et al.  Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition , 2021, ArXiv.

[40]  Lei Shi,et al.  Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Bingbing Ni,et al.  Learning Multi-View Interactional Skeleton Graph for Action Recognition , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2006, BMVC.

[43]  Gang Wang,et al.  Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition , 2016, ECCV.

[44]  Nanning Zheng,et al.  View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[45]  Ling Shao,et al.  Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.