Real-Time Musical Conducting Gesture Recognition Based on a Dynamic Time Warping Classifier Using a Single-Depth Camera

Gesture recognition is a human–computer interaction method, which is widely used for educational, medical, and entertainment purposes. Humans also use gestures to communicate with each other, and musical conducting uses gestures in this way. In musical conducting, conductors wave their hands to control the speed and strength of the music played. However, beginners may have a limited comprehension of the gestures and might not be able to properly follow the ensembles. Therefore, this paper proposes a real-time musical conducting gesture recognition system to help music players improve their performance. We used a single-depth camera to capture image inputs and establish a real-time dynamic gesture recognition system. The Kinect software development kit created a skeleton model by capturing the palm position. Different palm gestures were collected to develop training templates for musical conducting. The dynamic time warping algorithm was applied to recognize the different conducting gestures at various conducting speeds, thereby achieving real-time dynamic musical conducting gesture recognition. In the experiment, we used 5600 examples of three basic types of musical conducting gestures, including seven capturing angles and five performing speeds for evaluation. The experimental result showed that the average accuracy was 89.17% in 30 frames per second.

[1]  Sunwoong Choi,et al.  Smartwatch User Interface Implementation Using CNN-Based Gesture Pattern Recognition , 2018, Sensors.

[2]  Hugo Jair Escalante,et al.  Simultaneous segmentation and recognition of hand gestures for human-robot interaction , 2013, 2013 16th International Conference on Advanced Robotics (ICAR).

[3]  Anton Nijholt,et al.  The Virtual Conductor: Learning and Teaching about Music, Performing, and Conducting , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[4]  Marek R. Ogiela,et al.  Human actions recognition from motion capture recordings using signal resampling and pattern recognition methods , 2018, Ann. Oper. Res..

[5]  Maja J. Mataric,et al.  Comparing models for gesture recognition of children's bullying behaviors , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).

[6]  Silvia Ceccacci,et al.  A Methodology to Introduce Gesture-Based Interaction into Existing Consumer Product , 2016, HCI.

[7]  Cláudio Rosito Jung,et al.  Dynamic Time Warping for Music Conducting Gestures Evaluation , 2015, IEEE Transactions on Multimedia.

[8]  Michele Risi,et al.  A Multi-layer Parsing Strategy for On-line Recognition of Hand-drawn Diagrams , 2006, Visual Languages and Human-Centric Computing (VL/HCC'06).

[9]  Yoichiro Maeda,et al.  Melody oriented interactive chaotic sound generation system using music conductor gesture , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[11]  Junsong Yuan,et al.  Robust Part-Based Hand Gesture Recognition Using Kinect Sensor , 2013, IEEE Transactions on Multimedia.

[12]  Marcel J. T. Reinders,et al.  Sign Language Recognition by Combining Statistical DTW and Independent Classification , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Adam Glowacz,et al.  Recognition of images of finger skin with application of histogram, image filtration and K-NN classifier , 2016 .

[14]  György Fazekas,et al.  Mood Conductor: Emotion-Driven Interactive Music Performance , 2013, 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction.

[15]  Devendrakumar H. Pal,et al.  Dynamic hand gesture recognition using kinect sensor , 2016, 2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC).

[16]  Salvatore Sessa,et al.  Music conductor gesture recognition by using inertial measurement system for human-robot musical interaction , 2012, 2012 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[17]  Dipak Kumar Ghosh,et al.  Hand gesture recognition using DWT and F-ratio based feature descriptor , 2018, IET Image Process..

[18]  Anders Friberg,et al.  Systems for Interactive Control of Computer Generated Music Performance , 2013, Guide to Computing for Expressive Music Performance.

[19]  Daijin Kim,et al.  Vision-Based Hand Gesture Recognition for Understanding Musical Time Pattern and Tempo , 2007, IECON 2007 - 33rd Annual Conference of the IEEE Industrial Electronics Society.

[20]  Sansanee Auephanwiriyakul,et al.  A Novel String Grammar Unsupervised Possibilistic C-Medians Algorithm for Sign Language Translation Systems , 2017, Symmetry.

[21]  P. Kolesnik Conducting Gesture Recognition, Analysis and Performance System , 2004 .

[22]  S. Forrester Music Teacher Knowledge: An Examination of the Intersections Between Instrumental Music Teaching and Conducting , 2018 .

[23]  Ankit Chaudhary,et al.  Robust gesture recognition using Kinect: A comparison between DTW and HMM , 2015 .

[24]  Noriyuki Kawarazaki,et al.  A supporting system of chorus singing for visually impaired persons using depth image sensor , 2013, 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference.

[25]  Ana-Maria Cretu,et al.  Static and Dynamic Hand Gesture Recognition in Depth Data Using Dynamic Time Warping , 2016, IEEE Transactions on Instrumentation and Measurement.

[26]  Chang-Biau Yang,et al.  Flexible Dynamic Time Warping for Time Series Classification , 2015, ICCS.

[27]  Yi-Shin Chen,et al.  An interactive conducting system using Kinect , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[28]  Swati Nigam,et al.  A Review of Computational Approaches for Human Behavior Detection , 2018 .

[29]  Philippe Robert,et al.  PRAXIS: Towards automatic cognitive assessment using gesture recognition , 2018, Expert Syst. Appl..

[30]  Soma Mitra,et al.  Vision based Hand Gesture Recognition using Dynamic Time Warping for Indian Sign Language , 2016, 2016 International Conference on Information Science (ICIS).

[31]  Jakub Galka,et al.  Inertial Motion Sensing Glove for Sign Language Gesture Acquisition and Recognition , 2016, IEEE Sensors Journal.

[32]  Jianxin Chen,et al.  Real-time Hand Tracking Using Kinect , 2018, ICDSP.

[33]  Ayşegül Uçar,et al.  Gesture imitation and recognition using Kinect sensor and extreme learning machines , 2016 .

[34]  Ana M. Barbancho,et al.  Fast-gesture recognition and classification using Kinect: an application for a virtual reality drumkit , 2015, Multimedia Tools and Applications.

[35]  Ming Zeng,et al.  Novel Algorithm for Hand Gesture Recognition Utilizing a Wrist-Worn Inertial Sensor , 2018, IEEE Sensors Journal.

[36]  Qiu-yu Zhang,et al.  A Method of Hand Gesture Segmentation and Tracking with Appearance Based on Probability Model , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[37]  Kia Ng,et al.  Tracking Conductors Hand Movements Using Multiple Wiimotes , 2008, 2008 International Conference on Automated Solutions for Cross Media Content and Multi-Channel Distribution.