Spatio-temporal summarization of dance choreographies

Abstract An important issue in performing dance analysis is the automatic extraction of its choreographic patterns, since these elements provide an abstract representation of the semantics of the dance and encode the overall dance storytelling. However, application of conventional video summarization algorithms on dance sequences cannot appropriately retrieve their choreographic patterns, since a dance is composed of an ordered set of sequential elements which are often repeated in time. Additionally, 3D geometry is lost using color information. For this reason, in this paper we propose a new dance summarization scheme of 3D motion captured data (in the form of skeleton joints coordinates) recorded using the Vicon motion capture system. The proposed key frame extraction method implements a hierarchical scheme that exploits spatio-temporal variations of dance features. Initially, global holistic descriptors are extracted to localize the key choreographic steps of a dance (coarse representation). Then, each segment is further decomposed into finer sub-segments to improve dance representativity (fine representation). The abstraction scheme exploits the concepts of a Sparse Modeling Representative Selection (SMRS) appropriately modified to enable spatio-temporal modelling of the dance sequences through a hierarchical decomposition algorithm. Our approach is evaluated on thirty folkloric dance sequences recorded at the Aristotle University of Thessaloniki under the framework of Terpsichore project representing five different choreographies and on publicly available datasets from Carnegie–Mellon University, which depict performances on theatrical kinesiology. Comparisons with other traditional video summarization methods indicate a clear superiority of the proposed hierarchical spatio-temporal decomposition scheme.

[1]  Darko Kirovski,et al.  Real-time classification of dance gestures from skeleton animation , 2011, SCA '11.

[2]  Taku Komura,et al.  A Virtual Reality Dance Training System Using Motion Capture Technology , 2011, IEEE Transactions on Learning Technologies.

[3]  Yannis Avrithis,et al.  Efficient content representation in MPEG video databases , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[4]  Zhi-Hua Zhou,et al.  Multi-View Video Summarization , 2010, IEEE Transactions on Multimedia.

[5]  Stéphane Dupont,et al.  An interactive installation for browsing a dance video database , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[6]  Fadi Dornaika,et al.  Decremental Sparse Modeling Representative Selection for prototype selection , 2015, Pattern Recognit..

[7]  Guillermo Sapiro,et al.  See all by looking at a few: Sparse modeling for finding representative objects , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[9]  E. Protopapadakis,et al.  FOLK DANCE PATTERN RECOGNITION OVER DEPTH IMAGES ACQUIRED VIA KINECT SENSOR , 2017 .

[10]  Chong-Wah Ngo,et al.  Video summarization and scene detection by graph modeling , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Fotis Liarokapis,et al.  Developing serious games for cultural heritage: a state-of-the-art review , 2010, Virtual Reality.

[12]  Theodora A. Varvarigou,et al.  Adaptive Algorithms for Interactive Multimedia , 2003, IEEE Multim..

[13]  Klara Nahrstedt,et al.  Advancing interactive collaborative mediums through tele-immersive dance (TED): a symbiotic creativity and design environment for art and computer science , 2008, ACM Multimedia.

[14]  Marinos Ioannides,et al.  Modelling of Static and Moving Objects: Digitizing Tangible and Intangible Cultural Heritage , 2017, Mixed Reality and Gamification for Cultural Heritage.

[15]  Stefanos D. Kollias,et al.  Efficient summarization of stereoscopic video sequences , 2000, IEEE Trans. Circuits Syst. Video Technol..

[16]  Kristen Grauman,et al.  Story-Driven Summarization for Egocentric Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Georgios Tziritas,et al.  Equivalent Key Frames Selection Based on Iso-Content Principles , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Andreas Aristidou,et al.  Folk Dance Evaluation Using Laban Movement Analysis , 2015, JOCCH.

[19]  Stefanos D. Kollias,et al.  A fuzzy video content representation for video summarization and content-based retrieval , 2000, Signal Process..

[20]  Nikolaos Doulamis,et al.  A LOW-COST MARKERLESS TRACKING SYSTEM FOR TRAJECTORYINTERPRETATION , 2017 .

[21]  Dohyung Kim,et al.  Classification of K-Pop Dance Movements Based on Skeleton Information Obtained by a Kinect Sensor , 2017, Sensors.

[22]  Tong Wu,et al.  Hierarchical Union-of-Subspaces Model for Human Activity Summarization , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[23]  Andreas Aristidou,et al.  Digitization of Cypriot Folk Dances , 2012, EuroMed.

[24]  Daniel Cohen-Or,et al.  Emotion control of unstructured dance movements , 2017, Symposium on Computer Animation.

[25]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[26]  Travis T Simpson,et al.  Dance recognition system using lower body movement. , 2014, Journal of applied biomechanics.

[27]  Stefanos D. Kollias,et al.  Non-sequential video content representation using temporal variation of feature vectors , 2000, 2000 Digest of Technical Papers. International Conference on Consumer Electronics. Nineteenth in the Series (Cat. No.00CH37102).

[28]  Alexandra Poulovassilis,et al.  Learning as immersive experiences: Using the four-dimensional framework for designing and evaluating immersive learning experiences in a virtual world , 2010, Br. J. Educ. Technol..

[29]  Nikolaos Grammalidis,et al.  Capturing the intangible an introduction to the i-Treasures project , 2015, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[30]  George Papagiannakis,et al.  Style-based motion analysis for dance composition , 2017, The Visual Computer.

[31]  Nikos Grammalidis,et al.  Classification of Multidimensional Time-Evolving Data Using Histograms of Grassmannian Points , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Nikolaos Doulamis,et al.  Extraction of key postures from 3D human motion data for choreography summarization , 2017, 2017 9th International Conference on Virtual Worlds and Games for Serious Applications (VS-Games).

[33]  Zhengyou Zhang,et al.  Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[34]  Yuichi Iwadate,et al.  3D Archive System for Traditional Performing Arts , 2011, International Journal of Computer Vision.

[35]  Gaël Richard,et al.  Multimodal classification of dance movements using body joint trajectories and step sounds , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).