Self-Reflective Risk-Aware Artificial Cognitive Modeling for Robot Response to Human Behaviors

In order for cooperative robots ("co-robots") to respond to human behaviors accurately and efficiently in human-robot collaboration, interpretation of human actions, awareness of new situations, and appropriate decision making are all crucial abilities for co-robots. For this purpose, the human behaviors should be interpreted by co-robots in the same manner as human peers. To address this issue, a novel interpretability indicator is introduced so that robot actions are appropriate to the current human behaviors. In addition, the complete consideration of all potential situations of a robot's environment is nearly impossible in real-world applications, making it difficult for the co-robot to act appropriately and safely in new scenarios. This is true even when the pretrained model is highly accurate in a known situation. For effective and safe teaming with humans, we introduce a new generalizability indicator that allows a co-robot to self-reflect and reason about when an observation falls outside the co-robot's learned model. Based on topic modeling and two novel indicators, we propose a new Self-reflective Risk-aware Artificial Cognitive (SRAC) model. The co-robots are able to consider action risks and identify new situations so that better decisions can be made. Experiments both using real-world datasets and on physical robots suggest that our SRAC model significantly outperforms the traditional methodology and enables better decision making in response to human activities.

[1]  Richard M. Young,et al.  Introduction to This Special issue on Cognitive Architectures and Human-Computer , 1997, Hum. Comput. Interact..

[2]  Fei Xu,et al.  Probabilistic models of cognitive development: Towards a rational constructivist approach to the study of learning and development , 2011, Cognition.

[3]  Hans-Hellmut Nagel,et al.  Steps toward a Cognitive Vision System , 2004, AI Mag..

[4]  James L. Crowley,et al.  Things That See: Context-Aware Multi-modal Interaction , 2006, Cognitive Vision Systems.

[5]  Max Welling,et al.  Fast collapsed gibbs sampling for latent dirichlet allocation , 2008, KDD.

[6]  Giulio Sandini,et al.  A Survey of Artificial Cognitive Systems: Implications for the Autonomous Development of Mental Capabilities in Computational Agents , 2007, IEEE Transactions on Evolutionary Computation.

[7]  J.K. Aggarwal,et al.  Human activity analysis , 2011, ACM Comput. Surv..

[8]  Lynne E. Parker,et al.  4-dimensional local spatio-temporal features for human activity recognition , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[10]  Timothy Baldwin,et al.  Automatic Evaluation of Topic Coherence , 2010, NAACL.

[11]  Christian D. Schunn,et al.  Integrating perceptual and cognitive modeling for adaptive and intelligent human-computer interaction , 2002, Proc. IEEE.

[12]  Pierre Vandergheynst,et al.  FREAK: Fast Retina Keypoint , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Murat Dundar,et al.  Bayesian Nonexhaustive Learning for Online Discovery and Modeling of Emerging Classes , 2012, ICML.

[14]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[15]  Lynne E. Parker,et al.  Bio-inspired predictive orientation decomposition of skeleton trajectories for real-time human activity prediction , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[16]  Chong Wang,et al.  Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[17]  Marc Halbrügge,et al.  ACT-CV: Bridging the Gap between Cognitive Models and the Outer World , 2013 .

[18]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[19]  Daniel Gatica-Perez,et al.  Discovering routines from large-scale human locations using probabilistic topic models , 2011, TIST.

[20]  Xinghua Sun,et al.  Action recognition via local descriptors and holistic features , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[21]  Tieniu Tan,et al.  Relevance Topic Model for Unstructured Social Group Activity Recognition , 2013, NIPS.

[22]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories , 2006 .

[23]  Martin Buss,et al.  Interactive scene prediction for automotive applications , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).

[24]  Bruce Blumberg,et al.  A Layered Brain Architecture for Synthetic Creatures , 2001, IJCAI.

[25]  Xin Chen,et al.  Extraction Method of Gait Feature Based on Human Centroid Trajectory , 2014 .

[26]  Bernt Schiele,et al.  Discovery of activity patterns using topic models , 2008 .

[27]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[29]  John R. Anderson ACT: A simple theory of complex cognition. , 1996 .

[30]  Robert T. Collins,et al.  Vision-Based Analysis of Small Groups in Pedestrian Crowds , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Hardy Smieszek,et al.  Grundlagen und Anwendungen der Mensch-Maschine-Interaktion , 2013 .

[32]  Valérie Issarny,et al.  Dynamic decision networks for decision-making in self-adaptive systems: A case study , 2013, 2013 8th International Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS).

[33]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[34]  Anthony G. Cohn,et al.  Real-time Activity Recognition by Discerning Qualitative Relationships Between Randomly Chosen Visual Features , 2014, BMVC.

[35]  Ute Schmid,et al.  The challenge of complexity for cognitive systems , 2011, Cognitive Systems Research.

[36]  Gregory Dudek,et al.  Autonomous adaptive exploration using realtime online spatiotemporal topic modeling , 2014, Int. J. Robotics Res..

[37]  J. Tenenbaum,et al.  Probabilistic models of cognition: where next? , 2006, Trends in Cognitive Sciences.

[38]  John E. Laird,et al.  The Soar Cognitive Architecture , 2012 .

[39]  Shlomo Zilberstein,et al.  Temporal and Object Relations in Plan and Activity Recognition for Robots Using Topic Models , 2014, AAAI Fall Symposia.

[40]  Stefan Trausan-Matu,et al.  Improving Topic Evaluation Using Conceptual Knowledge , 2011, IJCAI.

[41]  David B. Dunson,et al.  Probabilistic topic models , 2011, KDD '11 Tutorials.

[42]  Imran N. Junejo,et al.  Silhouette-based human action recognition using SAX-Shapes , 2014, The Visual Computer.

[43]  John D. Lafferty,et al.  Correlated Topic Models , 2005, NIPS.

[44]  Christopher L. Dancy ACT-RΦ: A cognitive architecture with physiology and affect , 2013, BICA 2013.

[45]  Paul F. M. J. Verschure,et al.  EFAA - A Companion Emerges From Integrating a Layered Cognitive Architecture , 2014, 2014 9th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[46]  Bir Bhanu,et al.  Dynamic Bayesian Networks for Vehicle Classification in Video , 2012, IEEE Transactions on Industrial Informatics.

[47]  Andrea Cavallaro,et al.  Video-Based Human Behavior Understanding: A Survey , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[48]  Paul Baxter,et al.  Cognitive architecture for human–robot interaction: Towards behavioural alignment , 2013, BICA 2013.

[49]  Ronen Basri,et al.  Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Yang Wang,et al.  Human Action Recognition by Semilatent Topic Models , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Quoc V. Le,et al.  Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis , 2011, CVPR 2011.

[52]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[53]  Oliver Brdiczka,et al.  Learning Situation Models in a Smart Home , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[54]  Tamim Asfour,et al.  A cognitive architecture for a humanoid robot: a first approach , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[55]  Ruslan Salakhutdinov,et al.  Evaluation methods for topic models , 2009, ICML '09.

[56]  Zhidong Deng,et al.  C-HMAX: Artificial cognitive model inspired by the color vision mechanism of the human brain , 2013 .

[57]  Anup Basu,et al.  Human Activity Recognition Based on Silhouette Directionality , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[58]  J. Gregory Trafton,et al.  ACT-R/E , 2013, HRI 2013.

[59]  Ivan Laptev,et al.  On Space-Time Interest Points , 2005, International Journal of Computer Vision.