Learning Granger Causality for Hawkes Processes

Learning Granger causality for general point processes is a very challenging task. In this paper, we propose an effective method, learning Granger causality, for a special but significant type of point processes --- Hawkes process. We reveal the relationship between Hawkes process's impact function and its Granger causality graph. Specifically, our model represents impact functions using a series of basis functions and recovers the Granger causality graph via group sparsity of the impact functions' coefficients. We propose an effective learning algorithm combining a maximum likelihood estimator (MLE) with a sparse-group-lasso (SGL) regularizer. Additionally, the flexibility of our model allows to incorporate the clustering structure event types into learning framework. We analyze our learning algorithm and propose an adaptive procedure to select basis functions. Experiments on both synthetic and real-world data show that our method can learn the Granger causality graph and the triggering patterns of the Hawkes processes simultaneously.

[1]  Stephen J. Roberts,et al.  Scalable Nonparametric Bayesian Inference on Point Processes with Gaussian Processes , 2014, ICML.

[2]  Nicolas Vayatis,et al.  Nonparametric Markovian Learning of Triggering Kernels for Mutually Exciting and Mutually Inhibiting Multivariate Hawkes Processes , 2014, ECML/PKDD.

[3]  Christopher Meek Toward Learning Graphical and Causal Process Models , 2014, CI@UAI.

[4]  Puyang Xu,et al.  A Model for Temporal Dependencies in Event Streams , 2011, NIPS.

[5]  Le Song,et al.  Shaping Social Activity by Incentivizing Users , 2014, NIPS.

[6]  Ali Shojaie,et al.  Network granger causality with inherent grouping structure , 2012, J. Mach. Learn. Res..

[7]  Shuang Li,et al.  COEVOLVE: A Joint Point Process Model for Information Diffusion and Network Co-evolution , 2015, NIPS.

[8]  E. Bacry,et al.  Some limit theorems for Hawkes processes and application to financial statistics , 2013 .

[9]  Esko Valkeila,et al.  An Introduction to the Theory of Point Processes, Volume II: General Theory and Structure, 2nd Edition by Daryl J. Daley, David Vere‐Jones , 2008 .

[10]  Jure Leskovec,et al.  SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity , 2015, KDD.

[11]  Hongyuan Zha,et al.  Learning Mixtures of Markov Chains from Aggregate Data with Structural Constraints , 2016, IEEE Trans. Knowl. Data Eng..

[12]  Ole Winther,et al.  Multivariate Hawkes process models of the occurrence of regulatory elements , 2010, BMC Bioinformatics.

[13]  Amr Ahmed,et al.  Recovering time-varying networks of dependencies in social and biological studies , 2009, Proceedings of the National Academy of Sciences.

[14]  Hongyuan Zha,et al.  On Machine Learning towards Predictive Sales Pipeline Analytics , 2015, AAAI.

[15]  Ulrike Goldschmidt,et al.  An Introduction To The Theory Of Point Processes , 2016 .

[16]  Le Song,et al.  Estimating Diffusion Network Structures: Recovery Conditions, Sample Complexity & Soft-thresholding Algorithm , 2014, ICML.

[17]  R. Dahlhaus,et al.  Graphical Modeling for Multivariate Hawkes Processes with Nonparametric Link Functions , 2016, 1605.06759.

[18]  Fang Han,et al.  Transition Matrix Estimation in High Dimensional Time Series , 2013, ICML.

[19]  A. Châtelain,et al.  The European Physical Journal D , 1999 .

[20]  E. Bacry,et al.  Non-parametric kernel estimation for symmetric Hawkes processes. Application to high frequency financial data , 2011, 1112.1838.

[21]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[22]  Le Song,et al.  Learning Social Infectivity in Sparse Low-rank Networks Using Multi-dimensional Hawkes Processes , 2013, AISTATS.

[23]  Yan Liu,et al.  Temporal causal modeling with graphical granger methods , 2007, KDD '07.

[24]  Noah Simon,et al.  A Sparse-Group Lasso , 2013 .

[25]  Vinayak A. Rao,et al.  A Multitask Point Process Predictive Model , 2015, ICML.

[26]  Shuang-Hong Yang,et al.  Mixture of Mutually Exciting Processes for Viral Diffusion , 2013, ICML.

[27]  Bernard W. Silverman,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[28]  Vanessa Didelez,et al.  Graphical models for marked point processes based on local independence , 2007, 0710.5874.

[29]  Chloe Chen Chen Graphical modelling of multivariate time series , 2011 .

[30]  M. Eichler Graphical modelling of multivariate time series , 2006, math/0610654.

[31]  Hongyuan Zha,et al.  Trailer Generation via a Point Process-Based Visual Attractiveness Model , 2015, IJCAI.

[32]  Robert E. Hampson,et al.  Identification of sparse neural functional connectivity using penalized likelihood estimation and basis functions , 2013, Journal of Computational Neuroscience.

[33]  Rong Xie,et al.  You Are What You Watch and When You Watch: Inferring Household Structures From IPTV Viewing Data , 2014, IEEE Transactions on Broadcasting.

[34]  A. Hawkes Spectra of some self-exciting and mutually exciting point processes , 1971 .

[35]  Stephen J. Roberts,et al.  Variational Inference for Gaussian Process Modulated Poisson Processes , 2014, ICML.

[36]  A. W. M. van den Enden,et al.  Discrete Time Signal Processing , 1989 .

[37]  Zenglin Xu,et al.  Online Learning for Group Lasso , 2010, ICML.

[38]  Ryan P. Adams,et al.  Tractable nonparametric Bayesian inference in Poisson processes with Gaussian process intensities , 2009, ICML '09.

[39]  J. Rasmussen Bayesian Inference for Hawkes Processes , 2013 .

[40]  Rebecca Willett,et al.  Tracking Dynamic Point Processes on Networks , 2014, IEEE Transactions on Information Theory.

[41]  Le Song,et al.  Learning Triggering Kernels for Multi-dimensional Hawkes Processes , 2013, ICML.

[42]  Arthur Gretton,et al.  A Kernel Independence Test for Random Processes , 2014, ICML.

[43]  P. Reynaud-Bouret,et al.  Adaptive estimation for Hawkes processes; application to genome analysis , 2009, 0903.2919.

[44]  Wenjun Zhang,et al.  Multi-Task Multi-Dimensional Hawkes Processes for Modeling Event Sequences , 2015, IJCAI.

[45]  Erik A. Lewis,et al.  RESEARCH ARTICLE A Nonparametric EM algorithm for Multiscale Hawkes Processes , 2011 .

[46]  Lasso and probabilistic inequalities for multivariate point processes , 2015, 1208.0570.

[47]  Alan V. Oppenheim,et al.  Discrete-Time Signal Pro-cessing , 1989 .

[48]  Le Song,et al.  Learning Networks of Heterogeneous Influence , 2012, NIPS.