Intelligent invariance techniques for music gesture recognition based on skin modelling

A computer vision methodology for the recognition of finger gestures performed on a music instrument has been recently developed and implemented in the PianOrasis system. PianOrasis recognises the gestures of all the five fingers simultaneously, but not in real-time. In this paper, an optimisation of the above methodology is presented, implying the recognition of finger musical gestures performed in space. Scale and rotation invariance techniques are integrated into the system increasing the recognition quality and reducing the processing time. Scale invariance rests on the deterministic modelling of the number of skin pixels in the image. The proposed modelling enable three different sets of filtering parameters for the hand segmentation process, overcoming a long and manual preliminary analysis. More flexible finger gestures, performed in space without music instrument, can be recognised because of the integration of the rotation invariance.

[1]  Lars Bretzner,et al.  Hand gesture recognition using multi-scale colour features, hierarchical models and particle filtering , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[2]  Roland T. Chin,et al.  On image analysis by the methods of moments , 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Alireza Khotanzad,et al.  Invariant Image Recognition by Zernike Moments , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Sotiris Manitsaris Computer vision for the gesture recognition: gesture analysis and stochastic modelling in music interaction (Vision par ordinateur pour la reconnaissance des gestes: analyse et modélisation stochastique du geste dans l'interaction musicale) , 2010 .

[5]  Matti Pietikäinen,et al.  An Experimental Comparison of Autoregressive and Fourier-Based Descriptors in 2D Shape Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Jun Xu,et al.  Sign language image processing for intelligent communication by a communications satellite , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  William T. Freeman,et al.  Orientation Histograms for Hand Gesture Recognition , 1995 .

[8]  Rómer Rosales,et al.  3D Hand Pose Reconstruction Using Specialized Mappings , 2001, ICCV.

[9]  Jae C. Oh,et al.  Improving Learning of Genetic Rule-Based Classifier Systems , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[10]  Lilly Spirkovska,et al.  Coarse-coded higher-order neural networks for PSRI object recognition , 1993, IEEE Trans. Neural Networks.

[11]  Chin-Chen Chang,et al.  Hand pose recognition using curvature scale space , 2002, Object recognition supported by user interaction for service robots.

[12]  Tsutomu Miyasato,et al.  Multi-camera hand pose recognition system using skeleton image , 1995, Proceedings 4th IEEE International Workshop on Robot and Human Communication.

[13]  Paulo J. G. Lisboa,et al.  Translation, rotation, and scale invariant pattern recognition by high-order neural networks and moment classifiers , 1992, IEEE Trans. Neural Networks.

[14]  D Mendlovic,et al.  Composite harmonic filters for scale-, projection-, and shift-invariant pattern recognition. , 1995, Applied optics.