Principles and experimentations of self-organizing embedded agents allowing learning from demonstration in ambient robotics

Ambient systems are populated by many heterogeneous devices to provide adequate services to their users. The adaptation of an ambient system to the specific needs of its users is a challenging task. Because human-system interaction has to be as natural as possible, we propose an approach based on Learning from Demonstration (LfD). LfD is an interesting approach to generalize what has been observed during the demonstration to similar situations. However, using LfD in ambient systems needs adaptivity of the learning technique. We present ALEX, a multi-agent system able to dynamically learn and reuse contexts from demonstrations performed by a tutor. The results of the experiments performed on both a real and a virtual robot show interesting properties of our technology for ambient applications. Extreme Sensitive Robotic as a bottom-up approach to deal with complexity in ambient robotic.Self-adaptive multi-agent system for learning from demonstration in ambient robotic.Experiments show that tutorship learning with Context agent is promising.

[1]  Gal A. Kaminka,et al.  Autonomous Agents Research in Robotics: A Report from the Trenches , 2012, AAAI Spring Symposium: Designing Intelligent Robots.

[2]  Manuela M. Veloso,et al.  Confidence-based policy learning from demonstration using Gaussian mixture models , 2007, AAMAS '07.

[3]  Chrystopher L. Nehaniv,et al.  Teaching robots by moulding behavior and scaffolding the environment , 2006, HRI '06.

[4]  Pierre Glize,et al.  Principles and Properties of a MAS Learning Algorithm: A Comparison with Standard Learning Algorithms Applied to Implicit Feedback Assessment , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[5]  David Silver,et al.  Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain , 2010, Int. J. Robotics Res..

[6]  Marie-Pierre Gleizes,et al.  Extreme Sensitive Robotic - A Context-Aware Ubiquitous Learning , 2015, ICAART.

[7]  Hedyeh Rafii-Tari,et al.  A cooperative control framework for haptic guidance of bimanual surgical tasks based on Learning From Demonstration , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Andrew W. Moore,et al.  Efficient memory-based learning for robot control , 1990 .

[9]  Joo-Hyun Chung Ambient Robotics for Meaningful Life of the Elderly , 2015 .

[10]  Thomas G. Dietterich Adaptive computation and machine learning , 1998 .

[11]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[12]  Manuel Lopes,et al.  Robot programming from demonstration, feedback and transfer , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Stefan Schaal,et al.  Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.

[14]  Francesco Borrelli,et al.  Autonomous car following: A learning-based approach , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[15]  K. Dautenhahn,et al.  The agent-based perspective on imitation , 2002 .

[16]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[17]  K. Dautenhahn,et al.  The correspondence problem , 2002 .

[18]  Hani Hagras,et al.  Creating an ambient-intelligence environment using embedded agents , 2004, IEEE Intelligent Systems.

[19]  Thomas Hellström,et al.  A formalism for learning from demonstration , 2010, Paladyn J. Behav. Robotics.

[20]  Zoran Miljkovic,et al.  Neural network learning from demonstration and epipolar geometry for visual control of a nonholonomic mobile robot , 2014, Soft Comput..

[21]  Bruno Siciliano,et al.  EuRoC - The Challenge Initiative for European Robotics , 2014, ISR 2014.

[22]  Marie-Pierre Gleizes,et al.  Self-adaptive Complex Systems , 2011, EUMAS.

[23]  R. Schapire,et al.  Toward Efficient Agnostic Learning , 1994 .

[24]  Peter Stone,et al.  Reinforcement Learning with Human Feedback in Mountain Car , 2011, AAAI Spring Symposium: Help Me Help You: Bridging the Gaps in Human-Agent Collaboration.

[25]  Marie-Pierre Gleizes,et al.  Adelfe 2.0 , 2014, Handbook on Agent-Oriented Design Processes.

[26]  Pierre Glize,et al.  Self-Adaptation of a Learnt Behaviour by Detecting and by Managing User's Implicit Contradictions , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[27]  Linda Sellie,et al.  Toward efficient agnostic learning , 1992, COLT '92.

[28]  Christopher J. Gatti The Mountain Car Problem , 2015 .

[29]  Alex M. Andrew,et al.  ROBOT LEARNING, edited by Jonathan H. Connell and Sridhar Mahadevan, Kluwer, Boston, 1993/1997, xii+240 pp., ISBN 0-7923-9365-1 (Hardback, 218.00 Guilders, $120.00, £89.95). , 1999, Robotica (Cambridge. Print).

[30]  Nicolas Verstaevel,et al.  Extreme Sensitive Robotic , 2015, ICAART 2015.

[31]  D. Capera,et al.  The AMAS theory for complex problem solving based on self-organizing cooperative agents , 2003, WET ICE 2003. Proceedings. Twelfth IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises, 2003..

[32]  Najdan Vukovic,et al.  Trajectory learning and reproduction for differential drive mobile robots based on GMM/HMM and dynamic time warping using learning from demonstration framework , 2015, Eng. Appl. Artif. Intell..

[33]  Sonia Chernova,et al.  Reinforcement Learning from Demonstration through Shaping , 2015, IJCAI.

[34]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[35]  Pierre Glize,et al.  Model-free Optimization of an Engine Control Unit thanks to Self-Adaptive Multi-Agent Systems , 2014 .

[36]  Martin A. Riedmiller,et al.  Learning by Experience from Others — Social Learning and Imitation in Animals and Robots , 2003 .

[37]  Cynthia Breazeal,et al.  Training a Robot via Human Feedback: A Case Study , 2013, ICSR.

[38]  Marie-Pierre Gleizes,et al.  The Self-Adaptive Context Learning Pattern: Overview and Proposal , 2015, CONTEXT.