Bidirectional long short-term memory networks and sparse hierarchical modeling for scalable educational learning of dance choreographies

Recently, several educational game platforms have been proposed in the literature for choreographic training. However, their main limitation is that they fail to provide a quantitative assessment framework of a performing choreography against a groundtruth one. In this paper, we address this issue by proposing a machine learning framework exploiting deep learning paradigms. In particular, we introduce a long short-term memory network with the main capability of analyzing 3D captured skeleton feature joints of a dancer into predefined choreographic postures. This pose identification procedure is capable of providing a detailed (fine) evaluation score of a performing dance. In addition, the paper proposes a choreographic summarization architecture based on sparse modeling representative selection (SMRS) in order to abstractly represent the performing choreography through a set of key choreographic primitives. We have modified the SMRS algorithm in a way to extract hierarchies of key representatives. Choreographic summarization provides an efficient tool for a coarse quantitative evaluation of a dance. Moreover, hierarchical representation scheme allows for a scalable assessment of a choreography. The serious game platform supports advanced visualization toolkits using Labanotation in order to deliver the performing sequence in a formal documentation.

[1]  Hao Wu,et al.  Using automatic generation of Labanotation to protect folk dance , 2017, J. Electronic Imaging.

[2]  Kingkarn Sookhanaphibarn,et al.  Game-based system for learning labanotation using Microsoft Kinect , 2017, 2017 IEEE 6th Global Conference on Consumer Electronics (GCCE).

[3]  Anders Grunnet-Jepsen,et al.  Intel RealSense Stereoscopic Depth Cameras , 2017, CVPR 2017.

[4]  Yuichi Iwadate,et al.  3D Archive System for Traditional Performing Arts , 2011, International Journal of Computer Vision.

[5]  Stefanos D. Kollias,et al.  An adaptable neural-network model for recursive nonlinear traffic prediction and modeling of MPEG video sources , 2003, IEEE Trans. Neural Networks.

[6]  Rahil Baber,et al.  Rigid body simulation , 2006 .

[7]  Nikolaos Doulamis,et al.  Kinematics-based Extraction of Salient 3D Human Motion Data for Summarization of Choreographic Sequences , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[8]  Athanasios Voulodimos,et al.  Transforming Intangible Folkloric Performing Arts into Tangible Choreographic Digital Objects: The Terpsichore Approach , 2017, VISIGRAPP.

[9]  Andreas Aristidou,et al.  Digitization of Cypriot Folk Dances , 2012, EuroMed.

[10]  Marinos Ioannides,et al.  Modelling of Static and Moving Objects: Digitizing Tangible and Intangible Cultural Heritage , 2017, Mixed Reality and Gamification for Cultural Heritage.

[11]  Kozaburo Hachimura,et al.  LabanEditor: Graphical editor for dance notation , 2002, Proceedings. 11th IEEE International Workshop on Robot and Human Interactive Communication.

[12]  Assaf Schuster,et al.  Laban Movement Analysis Using Kinect , 2015 .

[13]  Nikolaos Doulamis,et al.  Extraction of key postures from 3D human motion data for choreography summarization , 2017, 2017 9th International Conference on Virtual Worlds and Games for Serious Applications (VS-Games).

[14]  George Papagiannakis,et al.  Style-based motion analysis for dance composition , 2017, The Visual Computer.

[15]  Daniel Cohen-Or,et al.  Emotion control of unstructured dance movements , 2017, Symposium on Computer Animation.

[16]  Kozaburo Hachimura,et al.  Method of generating coded description of human body motion from motion-captured data , 2001, Proceedings 10th IEEE International Workshop on Robot and Human Interactive Communication. ROMAN 2001 (Cat. No.01TH8591).

[17]  Nikolaos Doulamis,et al.  Physics-based keyframe selection for human motion summarization , 2018, Multimedia Tools and Applications.

[18]  Nikolaos Doulamis,et al.  An Embodied Learning Game Using Kinect and Labanotation for Analysis and Visualization of Dance Kinesiology , 2018, 2018 10th International Conference on Virtual Worlds and Games for Serious Applications (VS-Games).

[19]  Nikolaos Doulamis,et al.  A LOW-COST MARKERLESS TRACKING SYSTEM FOR TRAJECTORYINTERPRETATION , 2017 .

[20]  Nikolaos Doulamis,et al.  Deep Learning for Computer Vision: A Brief Review , 2018, Comput. Intell. Neurosci..

[21]  Darko Kirovski,et al.  Real-time classification of dance gestures from skeleton animation , 2011, SCA '11.

[22]  Gobinda G. Chowdhury,et al.  Introduction to Modern Information Retrieval , 1999 .

[23]  Guillermo Sapiro,et al.  See all by looking at a few: Sparse modeling for finding representative objects , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Gaël Richard,et al.  Multimodal classification of dance movements using body joint trajectories and step sounds , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).

[25]  Nikolaos Grammalidis,et al.  Capturing the intangible an introduction to the i-Treasures project , 2015, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[26]  Nikolaos Grammalidis,et al.  A Game-like Application for Dance Learning Using a Natural Human Computer Interface , 2015, HCI.

[27]  Jessica K. Hodgins,et al.  Hierarchical Aligned Cluster Analysis for Temporal Clustering of Human Motion , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Hui-mei Justina Hsu The Potential of Kinect in Education , 2011 .

[29]  Klara Nahrstedt,et al.  Advancing interactive collaborative mediums through tele-immersive dance (TED): a symbiotic creativity and design environment for art and computer science , 2008, ACM Multimedia.

[30]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[31]  Nikolaos Doulamis,et al.  Spatio-temporal summarization of dance choreographies , 2018, Comput. Graph..

[32]  R. Kurin Safeguarding Intangible Cultural Heritage in the 2003 UNESCO Convention: a critical appraisal , 2004 .

[33]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[34]  Nikolaos Doulamis,et al.  FAST-MDL: Fast Adaptive Supervised Training of multi-layered deep learning models for consistent object tracking and classification , 2016, 2016 IEEE International Conference on Imaging Systems and Techniques (IST).

[35]  Reinhard Klein,et al.  Efficient unsupervised temporal segmentation of human motion , 2014, SCA '14.

[36]  Andreas Aristidou,et al.  Emotion Analysis and Classification: Understanding the Performers' Emotions Using the LMA Entities , 2015, Comput. Graph. Forum.

[37]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[38]  Andreas Aristidou,et al.  Folk Dance Evaluation Using Laban Movement Analysis , 2015, JOCCH.

[39]  E. Protopapadakis,et al.  FOLK DANCE PATTERN RECOGNITION OVER DEPTH IMAGES ACQUIRED VIA KINECT SENSOR , 2017 .

[40]  Zhengyou Zhang,et al.  Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[41]  Dohyung Kim,et al.  Classification of K-Pop Dance Movements Based on Skeleton Information Obtained by a Kinect Sensor , 2017, Sensors.