Cross-Scene Trajectory Level Intention Inference using Gaussian Process Regression and Naive Registration

Human intention inference is the ability of an artificial system to predict the intention of a person. It is important in the context of human-robot interaction and homeland security, where proactive decision making is necessary. Human intention inference systems at test time is given a partial sequence of observations rather than a complete one. At a trajectory level, the observations are 2D/3D spatial human trajectories and intents are 2D/3D spatial locations where these human trajectories might end up. We study a learning approach where we train a model from complete spatial trajectories, and use partial spatial trajectories to test intention predictions early and accurately. We use non-parametric Gaussian Process Regression (GPR) as the learning model since GPR has been shown to model subtle aspects of human trajectory very well. We also develop a simple geometric transfer technique called Naive Registration (NR) that allows us to learn the model using training data in a source scene and then reuse that model for testing data in a target scene. Our results on synthetic and real data suggests that our transfer technique achieves comparable results as the technique of training from scratch in the target scene.

[1]  Julie A. Shah,et al.  Fast target prediction of human reaching motion for cooperative human-robot manipulation tasks using time series classification , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Silvio Savarese,et al.  Social LSTM: Human Trajectory Prediction in Crowded Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Michael S. Ryoo,et al.  Human activity prediction: Early recognition of ongoing activities from streaming videos , 2011, 2011 International Conference on Computer Vision.

[5]  Dani Lischinski,et al.  Crowds by Example , 2007, Comput. Graph. Forum.

[6]  Dmitry Berenson,et al.  A framework for unsupervised online human reaching motion recognition and early prediction , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  J. Faraway,et al.  Modelling three‐dimensional trajectories by using Bézier curves with application to hand motion , 2007 .

[8]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[9]  D. Wolpert,et al.  Mental state inference using visual control parameters. , 2005, Brain research. Cognitive brain research.

[10]  Hema Swetha Koppula,et al.  Anticipating Human Activities Using Object Affordances for Reactive Robotic Response , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Debasmit Das,et al.  Unsupervised Domain Adaptation Using Regularized Hyper-Graph Matching , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[12]  Luc Van Gool,et al.  You'll never walk alone: Modeling social behavior for multi-target tracking , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Stefano Soatto,et al.  Intent-aware long-term prediction of pedestrian motion , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Xiaogang Wang,et al.  Understanding pedestrian behaviors from stationary crowd groups , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Song-Chun Zhu,et al.  Modeling and Inferring Human Intents and Latent Functional Objects for Trajectory Prediction , 2016, ArXiv.

[16]  Ashwin P. Dani,et al.  Human intention inference and motion modeling using approximate E-M with online learning , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17]  Debasmit Das,et al.  Sample-to-Sample Correspondence for Unsupervised Domain Adaptation , 2018, Eng. Appl. Artif. Intell..

[18]  Siddhartha S. Srinivasa,et al.  Planning-based prediction for pedestrians , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Roger Woodard,et al.  Interpolation of Spatial Data: Some Theory for Kriging , 1999, Technometrics.

[20]  Bernhard Schölkopf,et al.  Probabilistic movement modeling for intention inference in human–robot interaction , 2013, Int. J. Robotics Res..