Integrated learning for interactive synthetic characters

The ability to learn is a potentially compelling and important quality for interactive synthetic characters. To that end, we describe a practical approach to real-time learning for synthetic characters. Our implementation is grounded in the techniques of reinforcement learning and informed by insights from animal training. It simplifies the learning task for characters by (a) enabling them to take advantage of predictable regularities in their world, (b) allowing them to make maximal use of any supervisory signals, and (c) making them easy to train by humans.We built an autonomous animated dog that can be trained with a technique used to train real dogs called "clicker training". Capabilities demonstrated include being trained to recognize and use acoustic patterns as cues for actions, as well as to synthesize new actions from novel paths through its motion space.A key contribution of this paper is to demonstrate that by addressing the three problems of state, action, and state-action space discovery at the same time, the solution for each becomes easier. Finally, we articulate heuristics and design principles that make learning practical for synthetic characters.

[1]  C. Warden The Animal Mind , 1933 .

[2]  K. Lorenz,et al.  Motivation of human and animal behavior : an ethological view , 1973 .

[3]  C. W. Therrien,et al.  Decision, Estimation and Classification: An Introduction to Pattern Recognition and Related Topics , 1989 .

[4]  Richard S. Sutton,et al.  Reinforcement learning architectures for animats , 1991 .

[5]  Gary L. Drescher,et al.  Made-up minds - a constructivist approach to artificial intelligence , 1991 .

[6]  Leslie Pack Kaelbling,et al.  Learning in embedded systems , 1993 .

[7]  Michiel van de Panne,et al.  Sensor-actuator networks , 1993, SIGGRAPH.

[8]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[9]  M. V. D. Panne,et al.  Synthesizing Parameterized Motions , 1994 .

[10]  Demetri Terzopoulos,et al.  Artificial fishes: physics, locomotion, perception, behavior , 1994, SIGGRAPH.

[11]  Bruce Blumberg,et al.  Multi-level direction of autonomous creatures for real-time virtual environments , 1995, SIGGRAPH.

[12]  Demetri Terzopoulos,et al.  Automated learning of muscle-actuated locomotion through control abstraction , 1995, SIGGRAPH.

[13]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[14]  Ken Perlin,et al.  Improv: a system for scripting interactive actors in virtual worlds , 1996, SIGGRAPH.

[15]  Dave Cliff,et al.  Creatures: artificial life autonomous software agents for home entertainment , 1997, AGENTS '97.

[16]  Jessica K. Hodgins,et al.  Adapting simulated behaviors for new characters , 1997, SIGGRAPH.

[17]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1998 .

[18]  Michael Gleicher,et al.  Retargetting motion to new characters , 1998, SIGGRAPH.

[19]  S. Shettleworth Cognition, evolution, and behavior , 1998 .

[20]  Geoffrey E. Hinton,et al.  NeuroAnimator: fast neural network emulation and control of physics-based models , 1998, SIGGRAPH.

[21]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[22]  C. Collin,et al.  An Introduction to Natural Computation , 1998, Trends in Cognitive Sciences.

[23]  Michael F. Cohen,et al.  Verbs and Adverbs: Multidimensional Motion Interpolation , 1998, IEEE Computer Graphics and Applications.

[24]  John Funge,et al.  Cognitive modeling: knowledge, reasoning and planning for intelligent characters , 1999, SIGGRAPH.

[25]  K. Pryor Clicker training for dogs , 1999 .

[26]  R. Brill Animal training: successful animal management through positive reinforcement , 2000 .

[27]  Bruce Blumberg,et al.  Motivation driven learning for interactive synthetic characters , 2000, AGENTS '00.

[28]  C. Gallistel,et al.  Time, rate, and conditioning. , 2000, Psychological review.

[29]  Bruce Blumberg,et al.  Interactive Training for Synthetic Characters , 2000, AAAI/IAAI.

[30]  Bruce Blumberg,et al.  A Layered Brain Architecture for Synthetic Creatures , 2001, IJCAI.

[31]  Marc Downie,et al.  Animation and Music: The Music and Movement of Synthetic Characters , 2001 .

[32]  Damian A. Isla,et al.  Creature Smarts: The Art and Architecture of a Virtual Brain , 2001 .

[33]  Damian A. Isla,et al.  The virtual hippocampus : spatial common sense for synthetic creatures , 2001 .

[34]  Alex Pentland,et al.  Expectation Maximization for Weakly Labeled Data , 2001, ICML.

[35]  Petros Faloutsos,et al.  Composable controllers for physics-based character animation , 2001, SIGGRAPH.

[36]  L. Coppinger,et al.  Dogs: A New Understanding of Canine Origin, Behavior and Evolution , 2001 .

[37]  Frédéric Kaplan Taming robots with clicker training A solution for teaching complex behaviors , 2001 .

[38]  Steve Rabin,et al.  AI Game Programming Wisdom , 2002 .

[39]  Bruce Blumberg,et al.  AlphaWolf: Social Learning, Emotion and Development in Autonomous Virtual Agents , 2002, WRAC.

[40]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.