Action Recognition From Skeleton Data via Analogical Generalization Over Qualitative Representations

Human action recognition remains a difficult problem for AI. Traditional machine learning techniques can have high recognition accuracy, but they are typically black boxes whose internal models are not inspectable and whose results are not explainable. This paper describes a new pipeline for recognizing human actions from skeleton data via analogical generalization. Specifically, starting with Kinect data, we segment each human action by temporal regions where the motion is qualitatively uniform, creating a sketch graph that provides a form of qualitative representation of the behavior that is easy to visualize. Models are learned from sketch graphs via analogical generalization, which are then used for classification via analogical retrieval. The retrieval process also produces links between the new example and components of the model that provide explanations. To improve recognition accuracy, we implement dynamic feature selection to pick reasonable relational features. We show the explanation advantage of our approach by example, and results on three public datasets illustrate its utility.

[1]  Anthony G. Cohn,et al.  Unsupervised Learning of Qualitative Motion Behaviours by a Mobile Robot , 2016, AAMAS.

[2]  Kenneth D. Forbus,et al.  Extending SME to Handle Large-Scale Cognitive Modeling , 2017, Cogn. Sci..

[3]  Dedre Gentner,et al.  Structure-Mapping: A Theoretical Framework for Analogy , 1983, Cogn. Sci..

[4]  John Folkesson,et al.  Combining Top-down Spatial Reasoning and Bottom-up Object Class Recognition for Scene Understanding , 2014, IROS 2014.

[5]  Nasser Kehtarnavaz,et al.  UTD-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and a wearable inertial sensor , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[6]  Pichao Wang,et al.  Action Recognition Based on Joint Trajectory Maps with Convolutional Neural Networks , 2018, Knowl. Based Syst..

[7]  Jake K. Aggarwal,et al.  View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[8]  Dimitrios Makris,et al.  Hierarchical transfer learning for online recognition of compound actions , 2016, Comput. Vis. Image Underst..

[9]  Dimitrios Makris,et al.  Clustered Spatio-temporal Manifolds for Online Action Recognition , 2014, 2014 22nd International Conference on Pattern Recognition.

[10]  Chen Liang,et al.  Representation and Computation in Cognitive Models , 2017, Top. Cogn. Sci..

[11]  Anthony G. Cohn,et al.  QSRlib: a software library for online acquisition of qualitative spatial relations from video , 2016 .

[12]  A. Cohn,et al.  A qualitative trajectory calculus as a basis for representing moving objects in Geographical Information Systems , 2006 .

[13]  Kenneth D. Forbus,et al.  MAC/FAC: A Model of Similarity-Based Retrieval , 1995, Cogn. Sci..

[14]  Jun Ye Spatial and Temporal Modeling for Human Activity Recognition from Multimodal Sequential Data , 2016 .

[15]  Kenneth D. Forbus,et al.  CogSketch: Sketch Understanding for Cognitive Science Research and for Education , 2011, Top. Cogn. Sci..

[16]  Jessika Weiss,et al.  Vision Science Photons To Phenomenology , 2016 .

[17]  Christian Wolf,et al.  Sequential Deep Learning for Human Action Recognition , 2011, HBU.

[18]  Anthony G. Cohn,et al.  Qualitative Spatial Representation and Reasoning with the Region Connection Calculus , 1997, GeoInformatica.

[19]  Alberto Del Bimbo,et al.  Submitted to Ieee Transactions on Cybernetics 1 3d Human Action Recognition by Shape Analysis of Motion Trajectories on Riemannian Manifold , 2022 .

[20]  Christopher Meek,et al.  Adversarial learning , 2005, KDD '05.

[21]  Antonio Marin-Hernandez,et al.  Improving action recognition by selection of features , 2016, 2016 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC).

[22]  John Folkesson,et al.  A Comparison of Qualitative and Metric Spatial Relation Models for Scene Understanding , 2015, AAAI.

[23]  Dimitrios Makris,et al.  G3D: A gaming action dataset and real time action recognition evaluation framework , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[24]  Kenneth D. Forbus,et al.  Modeling Visual Problem Solving as Analogical Reasoning , 2017, Psychological review.

[25]  A. Lovett,et al.  Cultural commonalities and differences in spatial problem-solving: A computational analysis , 2011, Cognition.

[26]  Kenneth D. Forbus,et al.  Extending Analogical Generalization with Near-Misses , 2015, AAAI.

[27]  Rama Chellappa,et al.  Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Jianxin Chen,et al.  Joint Motion Similarity (JMS)-Based Human Action Recognition Using Kinect , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[29]  Alberto Del Bimbo,et al.  Recognizing Actions from Depth Cameras as Weakly Aligned Multi-part Bag-of-Poses , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[30]  Dimitris Kastaniotis,et al.  Pose-based human action recognition via sparse representation in dissimilarity space , 2014, J. Vis. Commun. Image Represent..