Hierarchical learning recurrent neural networks for 3D motion synthesis

Three-dimensional human motion synthesis is one of the key technologies in the field of computer animation and multimedia applications. It is well known that the human body's own motion is full of strong personality, emotion, and high-dimensional characteristics, leading to the automatic synthesis of diverse and lifelike 3D human motion data continues to be a challenging task. Facing the challenge, this paper proposes a human motion synthesis framework based on hierarchical learning recurrent neural networks (HL-RNN). The framework includes a low-level network and a high-level network, which are used to extract the path information of the movement and the spatio-temporal relationship of the human bone structure, respectively. Then, after fusion, motions that satisfy the path constraints could be generated. This method can not only synthesize high-quality human movements that follow a specified trajectory, but also synthesize smooth transitions between various movements, and can also be used to synthesize data of different motion styles. Compared with some latest methods, experiments showed that the proposed method can significantly improve the quality and generalization performance of motion synthesis.

[1]  Bo Xu,et al.  Encoder–decoder recurrent network model for interactive character animation generation , 2017, The Visual Computer.

[2]  Stephen P. Boyd,et al.  End-to-end optimization of optics and image processing for achromatic extended depth of field and super-resolution imaging , 2018, ACM Trans. Graph..

[3]  Weili Ding,et al.  Human posture recognition based on multiple features and rule learning , 2020, International Journal of Machine Learning and Cybernetics.

[4]  Taku Komura,et al.  Phase-functioned neural networks for character control , 2017, ACM Trans. Graph..

[5]  Multi-Contact Locomotion Using a Contact Graph with Feasibility Predictors , 2017, ACM Trans. Graph..

[6]  Sergey Levine,et al.  Continuous character control with low-dimensional embeddings , 2012, ACM Trans. Graph..

[7]  Matthew Johnson-Roberson,et al.  PedX: Benchmark Dataset for Metric 3-D Pose Estimation of Pedestrians in Complex Urban Intersections , 2018, IEEE Robotics and Automation Letters.

[8]  José M. F. Moura,et al.  Teaching Robots to Predict Human Motion , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Taku Komura,et al.  A Deep Learning Framework for Character Motion Synthesis and Editing , 2016, ACM Trans. Graph..

[10]  Taku Komura,et al.  Fast Neural Style Transfer for Motion Data , 2017, IEEE Computer Graphics and Applications.

[11]  Michael J. Black,et al.  On Human Motion Prediction Using Recurrent Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Marcelo Kallmann,et al.  Multi-modal data-driven motion planning and synthesis , 2015, MIG.

[13]  Derek Nowrouzezahrai,et al.  Learning hatching for pen-and-ink illustration of surfaces , 2012, TOGS.

[14]  C. Lee Giles,et al.  A Neural Temporal Model for Human Motion Prediction , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Yi Zhou,et al.  Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis , 2017, ICLR.

[16]  Sung-Hee Lee,et al.  Multi-contact locomotion using a contact graph with feasibility predictors , 2017, TOGS.

[17]  Dimitri Plemenos,et al.  Artificial Intelligence Techniques for Computer Graphics , 2008, Artificial Intelligence Techniques for Computer Graphics.

[18]  Christopher Joseph Pal,et al.  Recurrent transition networks for character locomotion , 2018, SIGGRAPH Asia Technical Briefs.

[19]  Yuval Tassa,et al.  Learning human behaviors from motion capture by adversarial imitation , 2017, ArXiv.

[20]  Paul Bremner,et al.  Audio-Driven Robot Upper-Body Motion Synthesis , 2020, IEEE Transactions on Cybernetics.

[21]  Dario Pavllo,et al.  Modeling Human Motion with Quaternion-Based Neural Networks , 2019, International Journal of Computer Vision.

[22]  Steve Marschner,et al.  Matching Real Fabrics with Micro-Appearance Models , 2015, ACM Trans. Graph..

[23]  Danica Kragic,et al.  Classify, predict, detect, anticipate and synthesize: Hierarchical recurrent latent variable models for human activity modeling , 2018, ArXiv.

[24]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[25]  Sukwon Lee,et al.  Hand Contact between Remote Users through Virtual Avatars , 2016, CASA.

[26]  Zhiyong Wang,et al.  Combining Recurrent Neural Networks and Adversarial Training for Human Motion Synthesis and Control , 2018, IEEE Transactions on Visualization and Computer Graphics.

[27]  Hans-Peter Seidel,et al.  Design and volume optimization of space structures , 2017, ACM Trans. Graph..

[28]  Qiang Ji,et al.  Bayesian Adversarial Human Motion Synthesis , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Silvio Savarese,et al.  Structural-RNN: Deep Learning on Spatio-Temporal Graphs , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Jehee Lee,et al.  Interactive character animation by learning multi-objective control , 2018, ACM Trans. Graph..

[31]  Glen Berseth,et al.  DeepLoco , 2017, ACM Trans. Graph..

[32]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[33]  Taku Komura,et al.  A Recurrent Variational Autoencoder for Human Motion Synthesis , 2017, BMVC.

[34]  Sergey Levine,et al.  DeepMimic , 2018, ACM Trans. Graph..

[35]  Avinash Sharma,et al.  GlocalNet: Class-aware Long-term Human Motion Synthesis , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[36]  Juan Carlos Niebles,et al.  Socially and Contextually Aware Human Motion and Pose Forecasting , 2020, IEEE Robotics and Automation Letters.

[37]  Minh Vo,et al.  Long-term Human Motion Prediction with Scene Context , 2020, ECCV.

[38]  Matthew Johnson-Roberson,et al.  Bio-LSTM: A Biomechanically Inspired Recurrent Neural Network for 3-D Pedestrian Pose and Gait Prediction , 2018, IEEE Robotics and Automation Letters.

[39]  Dario Pavllo,et al.  QuaterNet: A Quaternion-based Recurrent Model for Human Motion , 2018, BMVC.

[40]  Taesoo Kwon,et al.  Real‐time Locomotion Controller using an Inverted‐Pendulum‐based Abstract Model , 2018, Comput. Graph. Forum.

[41]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[42]  J. Stoer,et al.  Introduction to Numerical Analysis , 2002 .