Transferring Human Tutor's Style to Pedagogical Agent: A Possible Way by Leveraging Variety of Artificial Intelligence Achievements

Pedagogical agents (P A) are lifelike characters presented on a computer screen that guide users through multimedia learning environments. Evidences show that P A has effect on promoting learning. However, P A field still faces common problems to be solved, such as how to promote the trust relationship among P A and learners. Such questions relate to how to make P A behaves more like human. This paper argues that solving these problems requires at least P A to simulate humans better in appearance, speech and motion, a possible solution is to transfer a human tutor's style to P A. Traditional P A production methods rely on the authoring tool, such as Microsoft agent, this type of methods are highly depending on manual production, so it is difficult to transfer the style of human tutor to P A well. Meanwhile, the production cycle is long, the cost is high, and the adaptability is not ideal. Based on the analysis of the achievements of interdisciplinary literature, this paper points out that based on various machine learning technologies, the above problems can be solved to a great extent. The related technologies are more likely to enhance the human-like of PA, such as the establishment of teacher strategy and behavior prediction model.

[1]  Yelong Shen,et al.  Socialized Gaussian Process Model for Human Behavior Prediction in a Health Social Network , 2012, 2012 IEEE 12th International Conference on Data Mining.

[2]  James C. Lester,et al.  Face-to-Face Interaction with Pedagogical Agents, Twenty Years Later , 2016, International Journal of Artificial Intelligence in Education.

[3]  Hao Wang,et al.  Ontology-based deep learning for human behavior prediction in health social networks , 2015, BCB.

[4]  Ari Shapiro,et al.  Building a Character Animation System , 2011, MIG.

[5]  A. L. Baylor,et al.  A Social-Cognitive Framework for Pedagogical Agents as Learning Companions , 2006 .

[6]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[7]  Justine Cassell,et al.  BEAT: the Behavior Expression Animation Toolkit , 2001, Life-like characters.

[8]  R. Mayer,et al.  An embodiment effect in computer-based learning with animated pedagogical agents. , 2012, Journal of experimental psychology. Applied.

[9]  Yuxuan Wang,et al.  Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron , 2018, ICML.

[10]  M. Oshita,et al.  Motion Style Transformation by Extracting and Applying Motion Features , 2006 .

[11]  Hao Wang,et al.  Ontology-based deep learning for human behavior prediction with explanations in health social networks , 2017, Inf. Sci..

[12]  Ira Kemelmacher-Shlizerman,et al.  Synthesizing Obama , 2017, ACM Trans. Graph..

[13]  Yisong Yue,et al.  A deep learning approach for generalized speech animation , 2017, ACM Trans. Graph..

[14]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[15]  Sercan Ömer Arik,et al.  Deep Voice 3: 2000-Speaker Neural Text-to-Speech , 2017, ICLR 2018.

[16]  Yanghee Kim,et al.  Pedagogical Agents as Social Models to Influence Learner Attitudes , 2007 .

[17]  R. Mayer The Cambridge Handbook of Multimedia Learning: Principles Based on Social Cues in Multimedia Learning: Personalization, Voice, Image, and Embodiment Principles , 2014 .

[18]  Jens Lehmann,et al.  Neural Network-based Question Answering over Knowledge Graphs on Word and Character Level , 2017, WWW.

[19]  Maurizio Mancini,et al.  Implementing Expressive Gesture Synthesis for Embodied Conversational Agents , 2005, Gesture Workshop.

[20]  Alex Pentland,et al.  Modeling and Prediction of Human Behavior , 1999, Neural Computation.

[21]  George Veletsianos,et al.  Pedagogical Agents , 2013 .

[22]  Yoshua Bengio,et al.  Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus , 2016, ACL.

[23]  Yoshua Bengio,et al.  ObamaNet: Photo-realistic lip-sync from text , 2017, ArXiv.

[24]  Taku Komura,et al.  Phase-functioned neural networks for character control , 2017, ACM Trans. Graph..

[25]  Zhiyuan Liu,et al.  Learning Entity and Relation Embeddings for Knowledge Graph Completion , 2015, AAAI.

[26]  Mitesh M. Khapra,et al.  Generating Natural Language Question-Answer Pairs from a Knowledge Graph Using a RNN Based Question Generation Model , 2017, EACL.

[27]  Carolyn Penstein Rosé,et al.  Tools for Authoring a Dialogue Agent that Participates in Learning Studies , 2007, AIED.

[28]  Voicu Popescu,et al.  Hand Gesture and Mathematics Learning: Lessons From an Avatar , 2017, Cogn. Sci..