论文信息 - Space-Time Domain Tensor Neural Networks: An Application on Human Pose Classification.

Space-Time Domain Tensor Neural Networks: An Application on Human Pose Classification.

Recent advances in sensing technologies require the design and development of pattern recognition models capable of processing spatiotemporal data efficiently. In this study, we propose a spatially and temporally aware tensor-based neural network for human pose classification using three-dimensional skeleton data. Our model employs three novel components. First, an input layer capable of constructing highly discriminative spatiotemporal features. Second, a tensor fusion operation that produces compact yet rich representations of the data, and third, a tensor-based neural network that processes data representations in their original tensor form. Our model is end-to-end trainable and characterized by a small number of trainable parameters making it suitable for problems where the annotated data is limited. Experimental evaluation of the proposed model indicates that it can achieve state-of-the-art performance.

Nikolaos Doulamis | Anastasios Doulamis | Konstantinos Makantasis | Nikolaos Bakalos | Athanasios Voulodimos

[1] Yongxin Yang,et al. Attribute-Enhanced Face Recognition with Neural Tensor Fusion Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[2] Nikolaos Grammalidis,et al. Dance analysis using multiple Kinect sensors , 2016, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[3] Jonathan Tompson,et al. Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[4] Antonis Nikitakis,et al. Tensor-Based Classification Models for Hyperspectral Data Analysis , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[5] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[7] Stefano Ramat,et al. Automatic Pose Recognition for Monitoring Dangerous Situations in Ambient-Assisted Living , 2020, Frontiers in Bioengineering and Biotechnology.

[8] Walid Gomaa,et al. Novel Approaches to Activity Recognition Based on Vector Autoregression and Wavelet Transforms , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[9] Xiaoshan Li,et al. Tucker Tensor Regression and Neuroimaging Analysis , 2018, Statistics in Biosciences.

[10] Alan L. Yuille,et al. Articulated Pose Estimation by a Graphical Model with Image Dependent Pairwise Relations , 2014, NIPS.

[11] Darko Kirovski,et al. Real-time classification of dance gestures from skeleton animation , 2011, SCA '11.

[12] Nikolaos Doulamis,et al. Common Mode Patterns for Supervised Tensor Subspace Learning , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13] Masashi Sugiyama,et al. Tensor Networks for Dimensionality Reduction and Large-scale Optimization: Part 2 Applications and Future Perspectives , 2017, Found. Trends Mach. Learn..

[14] Nikolaos Doulamis,et al. Deep learning based human behavior recognition in industrial workflows , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[15] Cristian Sminchisescu,et al. The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[16] Nikolaos Tampouratzis,et al. A Unified Novel Neural Network Approach and a Prototype Hardware Implementation for Ultra-Low Power EEG Classification , 2019, IEEE Transactions on Biomedical Circuits and Systems.

[17] Antonios Liapis,et al. Fusing Level and Ruleset Features for Multimodal Learning of Gameplay Outcomes , 2019, 2019 IEEE Conference on Games (CoG).

[18] Anima Anandkumar,et al. Tensor Regression Networks , 2017, J. Mach. Learn. Res..

[19] Özgür B. Akan,et al. Spatio-temporal correlation: theory and applications for wireless sensor networks , 2004, Comput. Networks.

[20] Alexander C. Berg,et al. Combining multiple sources of knowledge in deep CNNs for action recognition , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[21] Lian-Wen Jin,et al. Activity recognition from acceleration data using AR model representation and SVM , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[22] Nikolaos Doulamis,et al. Extraction of key postures from 3D human motion data for choreography summarization , 2017, 2017 9th International Conference on Virtual Worlds and Games for Serious Applications (VS-Games).

[23] Michael L. Littman,et al. Activity Recognition from Accelerometer Data , 2005, AAAI.

[24] Bernt Schiele,et al. Analyzing features for activity recognition , 2005, sOc-EUSAI '05.

[25] Anima Anandkumar,et al. Tensor Contraction & Regression Networks , 2018 .

[26] Marinos Ioannides,et al. Modelling of Static and Moving Objects: Digitizing Tangible and Intangible Cultural Heritage , 2017, Mixed Reality and Gamification for Cultural Heritage.

[27] Hassan Ghasemzadeh,et al. Multi-sensor fusion in body sensor networks: State-of-the-art and research challenges , 2017, Inf. Fusion.

[28] Nikolaos Doulamis,et al. Learning Choreographic Primitives Through A Bayesian Optimized Bi-Directional LSTM Model , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[29] Moritz Grosse-Wentrup,et al. Multiclass Common Spatial Patterns and Information Theoretic Feature Extraction , 2008, IEEE Transactions on Biomedical Engineering.

[30] Fabio Tozeto Ramos,et al. Unsupervised clustering of people from ‘skeleton’ data , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[31] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[32] Lei Wang,et al. A Comparative Review of Recent Kinect-Based Action Recognition Algorithms , 2019, IEEE Transactions on Image Processing.

[33] Tamara G. Kolda,et al. Tensor Decompositions and Applications , 2009, SIAM Rev..

[34] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[35] Ming Yang,et al. 3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36] Zhengyou Zhang,et al. Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[37] Ioannis Papaefstathiou,et al. Data-Driven Background Subtraction Algorithm for In-Camera Acceleration in Thermal Imagery , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[38] A. M. Khan,et al. Accelerometer signal-based human activity recognition using augmented autoregressive model coefficients and artificial neural nets , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[39] Shuangquan Wang,et al. Human activity recognition with user-free accelerometers in the sensor networks , 2005, 2005 International Conference on Neural Networks and Brain.