Space-Time Domain Tensor Neural Networks: An Application on Human Pose Classification.

Recent advances in sensing technologies require the design and development of pattern recognition models capable of processing spatiotemporal data efficiently. In this study, we propose a spatially and temporally aware tensor-based neural network for human pose classification using three-dimensional skeleton data. Our model employs three novel components. First, an input layer capable of constructing highly discriminative spatiotemporal features. Second, a tensor fusion operation that produces compact yet rich representations of the data, and third, a tensor-based neural network that processes data representations in their original tensor form. Our model is end-to-end trainable and characterized by a small number of trainable parameters making it suitable for problems where the annotated data is limited. Experimental evaluation of the proposed model indicates that it can achieve state-of-the-art performance.

[1]  Yongxin Yang,et al.  Attribute-Enhanced Face Recognition with Neural Tensor Fusion Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[2]  Nikolaos Grammalidis,et al.  Dance analysis using multiple Kinect sensors , 2016, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[3]  Jonathan Tompson,et al.  Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[4]  Antonis Nikitakis,et al.  Tensor-Based Classification Models for Hyperspectral Data Analysis , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[7]  Stefano Ramat,et al.  Automatic Pose Recognition for Monitoring Dangerous Situations in Ambient-Assisted Living , 2020, Frontiers in Bioengineering and Biotechnology.

[8]  Walid Gomaa,et al.  Novel Approaches to Activity Recognition Based on Vector Autoregression and Wavelet Transforms , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[9]  Xiaoshan Li,et al.  Tucker Tensor Regression and Neuroimaging Analysis , 2018, Statistics in Biosciences.

[10]  Alan L. Yuille,et al.  Articulated Pose Estimation by a Graphical Model with Image Dependent Pairwise Relations , 2014, NIPS.

[11]  Darko Kirovski,et al.  Real-time classification of dance gestures from skeleton animation , 2011, SCA '11.

[12]  Nikolaos Doulamis,et al.  Common Mode Patterns for Supervised Tensor Subspace Learning , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Masashi Sugiyama,et al.  Tensor Networks for Dimensionality Reduction and Large-scale Optimization: Part 2 Applications and Future Perspectives , 2017, Found. Trends Mach. Learn..

[14]  Nikolaos Doulamis,et al.  Deep learning based human behavior recognition in industrial workflows , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[15]  Cristian Sminchisescu,et al.  The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[16]  Nikolaos Tampouratzis,et al.  A Unified Novel Neural Network Approach and a Prototype Hardware Implementation for Ultra-Low Power EEG Classification , 2019, IEEE Transactions on Biomedical Circuits and Systems.

[17]  Antonios Liapis,et al.  Fusing Level and Ruleset Features for Multimodal Learning of Gameplay Outcomes , 2019, 2019 IEEE Conference on Games (CoG).

[18]  Anima Anandkumar,et al.  Tensor Regression Networks , 2017, J. Mach. Learn. Res..

[19]  Özgür B. Akan,et al.  Spatio-temporal correlation: theory and applications for wireless sensor networks , 2004, Comput. Networks.

[20]  Alexander C. Berg,et al.  Combining multiple sources of knowledge in deep CNNs for action recognition , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[21]  Lian-Wen Jin,et al.  Activity recognition from acceleration data using AR model representation and SVM , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[22]  Nikolaos Doulamis,et al.  Extraction of key postures from 3D human motion data for choreography summarization , 2017, 2017 9th International Conference on Virtual Worlds and Games for Serious Applications (VS-Games).

[23]  Michael L. Littman,et al.  Activity Recognition from Accelerometer Data , 2005, AAAI.

[24]  Bernt Schiele,et al.  Analyzing features for activity recognition , 2005, sOc-EUSAI '05.

[25]  Anima Anandkumar,et al.  Tensor Contraction & Regression Networks , 2018 .

[26]  Marinos Ioannides,et al.  Modelling of Static and Moving Objects: Digitizing Tangible and Intangible Cultural Heritage , 2017, Mixed Reality and Gamification for Cultural Heritage.

[27]  Hassan Ghasemzadeh,et al.  Multi-sensor fusion in body sensor networks: State-of-the-art and research challenges , 2017, Inf. Fusion.

[28]  Nikolaos Doulamis,et al.  Learning Choreographic Primitives Through A Bayesian Optimized Bi-Directional LSTM Model , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[29]  Moritz Grosse-Wentrup,et al.  Multiclass Common Spatial Patterns and Information Theoretic Feature Extraction , 2008, IEEE Transactions on Biomedical Engineering.

[30]  Fabio Tozeto Ramos,et al.  Unsupervised clustering of people from ‘skeleton’ data , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[31]  Andrew Zisserman,et al.  Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[32]  Lei Wang,et al.  A Comparative Review of Recent Kinect-Based Action Recognition Algorithms , 2019, IEEE Transactions on Image Processing.

[33]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[34]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[35]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Zhengyou Zhang,et al.  Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[37]  Ioannis Papaefstathiou,et al.  Data-Driven Background Subtraction Algorithm for In-Camera Acceleration in Thermal Imagery , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[38]  A. M. Khan,et al.  Accelerometer signal-based human activity recognition using augmented autoregressive model coefficients and artificial neural nets , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[39]  Shuangquan Wang,et al.  Human activity recognition with user-free accelerometers in the sensor networks , 2005, 2005 International Conference on Neural Networks and Brain.