A poisson process model for activity forecasting

Activity forecasting has recently become an active research area for its importance in critical applications like automated navigation and human-computer interaction. However, for a video observed upto a certain time, all of the existing forecasting works focus on predicting the activity label, i.e., predicting what the next unobserved activity is. To the best of our knowledge, no work has answered the crucial question yet: when the next unobserved activity will occur. In this paper, we propose an approach for predicting the starting time of the next unobserved activity without assuming that we know its label. We model activities occurring at a variable rate using a Log-Gaussian Cox Process (LGCP) and learn the rate function from the training data. Then the starting time is predicted using importance sampling algorithm. In our experiments on the challenging MPII-Cooking dataset, we find that both the label of the last observed activity and the label of the activity being predicted affect the time prediction accuracy.

[1]  J. Møller,et al.  Log Gaussian Cox Processes , 1998 .

[2]  Kalina Bontcheva,et al.  Modeling Tweet Arrival Times using Log-Gaussian Cox Processes , 2015, EMNLP.

[3]  Anirban Chakraborty,et al.  Context-Aware Activity Forecasting , 2014, ACCV.

[4]  Lihi Zelnik-Manor,et al.  Learning Video Saliency from Human Gaze Using Candidate Selection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Martial Hebert Active and passive range sensing for robotics , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[6]  Aki Vehtari,et al.  GPstuff: Bayesian modeling with Gaussian processes , 2013, J. Mach. Learn. Res..

[7]  Kris M. Kitani,et al.  Action-Reaction: Forecasting the Dynamics of Human Interaction , 2014, ECCV.

[8]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[9]  Silvio Savarese,et al.  A Hierarchical Representation for Future Action Prediction , 2014, ECCV.

[10]  Fang Meng,et al.  Saliency detection on videos with scene change , 2014, 2014 International Conference on Audio, Language and Image Processing.

[11]  Patrick Pérez,et al.  Video Inpainting of Complex Scenes , 2014, SIAM J. Imaging Sci..

[12]  Volkan Sezer,et al.  Towards autonomous navigation of unsignalized intersections under uncertainty of human driver intent , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Melba M. Crawford,et al.  Modeling and simulation of a nonhomogeneous poisson process having cyclic behavior , 1991 .

[14]  Martial Hebert,et al.  Activity Forecasting , 2012, ECCV.

[15]  Visakan Kadirkamanathan,et al.  Point process modelling of the Afghan War Diary , 2012, Proceedings of the National Academy of Sciences.

[16]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[17]  Amit K. Roy-Chowdhury,et al.  Context-Aware Activity Recognition and Anomaly Detection in Video , 2013, IEEE Journal of Selected Topics in Signal Processing.

[18]  Sridha Sridharan,et al.  Forecasting Events Using an Augmented Hidden Conditional Random Field , 2014, ACCV.

[19]  Bernt Schiele,et al.  A database for fine grained activity detection of cooking activities , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Wen-Hsien Fang,et al.  Video anomaly detection and localization using hierarchical feature representation and Gaussian process regression , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Yun Fu,et al.  Prediction of Human Activity by Discovering Temporal Sequence Patterns , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Nizar Bouguila,et al.  Video Completion Using Bandlet Transform , 2012, IEEE Transactions on Multimedia.