Dancing with Turks

Dance is a dynamic art form that reflects a wide range of cultural diversity and individuality. With the advancement of motion-capture technology combined with crowd-sourcing and machine learning algorithms, we explore the complex relationship between perceived dance quality/dancer's gender and dance movements/music respectively. As a feasibility study, we construct a computational framework for an analysis-synthesis-feedback loop using a novel multimedia dance-music texture representation. Furthermore, we integrate crowd-sourcing, music and motion-capture data, and machine learning-based methods for dance segmentation, analysis and synthesis of new dancers. A quantitative validation of this framework on a motion-capture dataset of 172 dancers evaluated by more than 400 independent on-line raters demonstrates significant correlation between human perception and the algorithmically intended dance quality or gender of synthesized dancers. The technology illustrated in this work has a high potential to advance the multimedia entertainment industry via dancing with Turks.

[1]  Keith Grochow,et al.  Dance reveals symmetry especially in young men , 2005, Nature.

[2]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[3]  Tao Qin,et al.  An active feedback framework for image retrieval , 2008, Pattern Recognit. Lett..

[4]  Christoph Bregler,et al.  Motion capture assisted animation: texturing and synthesis , 2002, ACM Trans. Graph..

[5]  Lorenzo Torresani,et al.  Learning Motion Style Synthesis from Perceptual Observations , 2006, NIPS.

[6]  Richard Szeliski,et al.  Video textures , 2000, SIGGRAPH.

[7]  Mubarak Shah,et al.  A survey of motion analysis from moving light displays , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Jovan Popovic,et al.  Style translation for human motion , 2005, ACM Trans. Graph..

[9]  Pascal Fua,et al.  Style‐Based Motion Synthesis † , 2004, Comput. Graph. Forum.

[10]  S. Sumi Upside-down Presentation of the Johansson Moving Light-Spot Pattern , 1984, Perception.

[11]  Aaron Hertzmann,et al.  Style-based inverse kinematics , 2004, ACM Trans. Graph..

[12]  J. L. Le Saint-Milon,et al.  A real-time French text-to-speech system generating high-quality synthetic speech , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[13]  Richard M. Murray,et al.  A Mathematical Introduction to Robotic Manipulation , 1994 .

[14]  J. Cutting,et al.  Recognizing the sex of a walker from a dynamic point-light display , 1977 .

[15]  Harry Shum,et al.  Motion texture: a two-level statistical model for character motion synthesis , 2002, ACM Trans. Graph..

[16]  Lucas Kovar,et al.  Motion Graphs , 2002, ACM Trans. Graph..

[17]  Lance Williams,et al.  Motion signal processing , 1995, SIGGRAPH.

[18]  Jessica K. Hodgins,et al.  Interactive control of avatars animated with human motion data , 2002, SIGGRAPH.

[19]  Adam Finkelstein,et al.  How well do line drawings depict shape? , 2009, SIGGRAPH '09.

[20]  Yanxi Liu,et al.  Discriminative MR Image Feature Analysis for Automatic Schizophrenia and Alzheimer's Disease Classification , 2004, MICCAI.

[21]  Ken-ichi Anjyo,et al.  Fourier principles for emotion-based human figure animation , 1995, SIGGRAPH.

[22]  Wei Pan,et al.  Unsupervised hierarchical modeling of locomotion styles , 2009, ICML '09.

[23]  Aaron Hertzmann,et al.  Style machines , 2000, SIGGRAPH 2000.

[24]  Yanxi Liu,et al.  Near-regular texture analysis and manipulation , 2004, SIGGRAPH 2004.

[25]  Michael F. Cohen,et al.  Verbs and Adverbs: Multidimensional Motion Interpolation , 1998, IEEE Computer Graphics and Applications.

[26]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[27]  Christoph Bregler,et al.  Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.

[28]  Jitendra Malik,et al.  Tracking people with twists and exponential maps , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[29]  David J. Fleet,et al.  Multifactor Gaussian process models for style-content separation , 2007, ICML '07.

[30]  Thomas S. Huang,et al.  Relevance feedback in image retrieval: A comprehensive review , 2003, Multimedia Systems.

[31]  Alexei A. Efros,et al.  Image quilting for texture synthesis and transfer , 2001, SIGGRAPH.

[32]  Okan Arikan,et al.  Interactive motion generation from examples , 2002, ACM Trans. Graph..

[33]  Yanxi Liu,et al.  The Promise and Perils of Near-Regular Texture , 2004, International Journal of Computer Vision.

[34]  Geoffrey E. Hinton,et al.  Factored conditional restricted Boltzmann Machines for modeling motion style , 2009, ICML '09.

[35]  Zoran Popovic,et al.  Motion warping , 1995, SIGGRAPH.

[36]  Yanxi Liu,et al.  Facial asymmetry quantification for expression invariant human identification , 2003, Comput. Vis. Image Underst..

[37]  Yanxi Liu,et al.  Facial asymmetry quantification for expression invariant human identification , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[38]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.