Intention Estimation and Recommendation System Based on Attention Sharing

In human-agent interactions, attention sharing plays a key role in understanding other’s intention without explicit verbal explanation. Deep learning algorithms are recently used to model these interactions in a complex real world environment. In this paper we propose a deep learning based intention estimation and recommendation system by understanding humans attention based on their gestures. Action-object affordances are modeled using stacked auto-encoder, which represents the relationships between actions and objects. Intention estimation and object recommendation system according to human intention is implemented based on an affordance model. Experimental result demonstrates meaningful intention estimation and recommendation performance in the real-world scenarios.

[1]  J. Gibson The Ecological Approach to Visual Perception , 1979 .

[2]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[3]  Uwe D. Hanebeck,et al.  A generic model for estimating user intentions in human-robot cooperation , 2005, ICINCO.

[4]  Danica Kragic,et al.  Visual object-action recognition: Inferring object affordances from human demonstration , 2011, Comput. Vis. Image Underst..

[5]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[6]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[7]  Monica N. Nicolescu,et al.  Deep networks for predicting human intent with respect to objects , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[8]  C. Moore,et al.  Joint attention : its origins and role in development , 1995 .

[9]  Qi Cheng,et al.  Human intention recognition in Smart Assisted Living Systems using a Hierarchical Hidden Markov Model , 2008, 2008 IEEE International Conference on Automation Science and Engineering.

[10]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[11]  Minho Lee,et al.  Probabilistic human intention modeling for cognitive augmentation , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[12]  A. Treisman Strategies and models of selective attention. , 1969, Psychological review.

[13]  J. Duncan Selective attention and the organization of visual information. , 1984, Journal of experimental psychology. General.

[14]  Minho Lee,et al.  Affective saliency map considering psychological distance , 2011, Neurocomputing.

[15]  Manuel Lopes,et al.  Learning Object Affordances: From Sensory--Motor Coordination to Imitation , 2008, IEEE Transactions on Robotics.

[16]  Irina Rish,et al.  An empirical study of the naive Bayes classifier , 2001 .