Perceiving user's intention-for-interaction: A probabilistic multimodal data fusion scheme

Understanding people's intention, be it action or thought, plays a fundamental role in establishing coherent communication amongst people, especially in non-proactive robotics, where the robot has to understand explicitly when to start an interaction in a natural way. In this work, a novel approach is presented to detect people's intention-for-interaction. The proposed detector fuses multimodal cues, including estimated head pose, shoulder orientation and vocal activity detection, using a probabilistic discrete state Hidden Markov Model. The multimodal detector achieves up to 80% correct detection rates improving purely audio and RGB-D based variants.

[1]  Monica N. Nicolescu,et al.  A Vision-Based Architecture for Intent Recognition , 2007, ISVC.

[2]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[3]  Luc Van Gool,et al.  Random Forests for Real Time 3D Face Analysis , 2012, International Journal of Computer Vision.

[4]  Bernd Huber,et al.  Foot position as indicator of spatial interest at public displays , 2013, CHI Extended Abstracts.

[5]  Zhijun Zhang,et al.  Human–Robot Interaction by Understanding Upper Body Gestures , 2014, PRESENCE: Teleoperators and Virtual Environments.

[6]  Uwe D. Hanebeck,et al.  A generic model for estimating user intentions in human-robot cooperation , 2005, ICINCO.

[7]  Paolo Rocco,et al.  Towards safe human-robot interaction in robotic cells: An approach based on visual tracking and intention estimation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Han-Pang Huang,et al.  Human intention estimation method for a new compliant rehabilitation and assistive robot , 2010, Proceedings of SICE Annual Conference 2010.

[9]  M. Matarić,et al.  Monitoring and Guiding User Attention and Intention in Human-Robot Interaction , 2010, ICRA 2010.

[10]  Yukiko I. Nakano,et al.  Estimating a User's Conversational Engagement Based on Head Pose Information , 2011, IVA.

[11]  J. Decety,et al.  From the perception of action to the understanding of intention , 2001, Nature reviews. Neuroscience.

[12]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[13]  Dana Kulic,et al.  Estimating intent for human-robot interaction , 2003 .

[14]  Christian Laugier,et al.  Intention Driven Human Aware Navigation for Assisted Mobility , 2012 .