Sparse and low-rank multivariate Hawkes processes

We consider the problem of unveiling the implicit network structure of node interactions (such as user interactions in a social network), based only on high-frequency timestamps. Our inference is based on the minimization of the least-squares loss associated with a multivariate Hawkes model, penalized by $\ell_1$ and trace norm of the interaction tensor. We provide a first theoretical analysis for this problem, that includes sparsity and low-rank inducing penalizations. This result involves a new data-driven concentration inequality for matrix martingales in continuous time with observable variance, which is a result of independent interest. A consequence of our analysis is the construction of sharply tuned $\ell_1$ and trace-norm penalizations, that leads to a data-driven scaling of the variability of information available for each users. Numerical experiments illustrate the significant improvements achieved by the use of such data-driven penalizations.

[1]  P. Massart,et al.  Concentration inequalities and model selection , 2007 .

[2]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[3]  Bernhard Schölkopf,et al.  Modeling Information Propagation with Survival Theory , 2013, ICML.

[4]  Emmanuel Bacry,et al.  Modelling microstructure noise with mutually exciting point processes , 2011, 1101.3422.

[5]  Lasso and probabilistic inequalities for multivariate point processes , 2015, 1208.0570.

[6]  Francesco Orabona,et al.  PRISMA: PRoximal Iterative SMoothing Algorithm , 2012, ArXiv.

[7]  A. Lewis The Convex Analysis of Unitarily Invariant Matrix Functions , 1995 .

[8]  Stéphane Gaïffas,et al.  Link prediction in graphs with autoregressive features , 2012, J. Mach. Learn. Res..

[9]  George E. Tita,et al.  Self-Exciting Point Process Modeling of Crime , 2011 .

[10]  Agathe Guilloux,et al.  High-dimensional additive hazards models and the Lasso , 2011, 1106.4662.

[11]  Shuang-Hong Yang,et al.  Mixture of Mutually Exciting Processes for Viral Diffusion , 2013, ICML.

[12]  V. Koltchinskii,et al.  Nuclear norm penalization and optimal rates for noisy low rank matrix completion , 2010, 1011.6256.

[13]  P. Bartlett,et al.  Empirical minimization , 2006 .

[14]  Bernhard Schölkopf,et al.  Uncovering the Temporal Dynamics of Diffusion Networks , 2011, ICML.

[15]  M. A. de Menezes,et al.  Fluctuations in network dynamics. , 2004, Physical review letters.

[16]  Yingdong Lu Theory of Martingales , 2013 .

[17]  Emmanuel Bacry,et al.  Concentration for matrix martingales in continuous time and microscopic activity of social networks , 2014, 1412.7705.

[18]  Katherine A. Heller,et al.  Modelling Reciprocating Relationships with Hawkes Processes , 2012, NIPS.

[19]  Emmanuel Bacry,et al.  tick: a Python Library for Statistical Learning, with an emphasis on Hawkes Processes and Time-Dependent Models , 2017, J. Mach. Learn. Res..

[20]  Joel A. Tropp,et al.  User-Friendly Tail Bounds for Sums of Random Matrices , 2010, Found. Comput. Math..

[21]  Emmanuel Bacry,et al.  Concentration inequalities for matrix martingales in continuous time , 2018 .

[22]  Didier Sornette,et al.  Robust dynamic classes revealed by measuring the response function of a social system , 2008, Proceedings of the National Academy of Sciences.

[23]  S. R. Jammalamadaka,et al.  Empirical Processes in M-Estimation , 2001 .

[24]  Emmanuel Bacry,et al.  Dual optimization for convex constrained objectives without the gradient-Lipschitz assumption , 2018, ArXiv.

[25]  E. Bacry,et al.  Hawkes Processes in Finance , 2015, 1502.04592.

[26]  P. Bickel,et al.  SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR , 2008, 0801.1095.

[27]  Scott W. Linderman,et al.  Discovering Latent Network Structure in Point Process Data , 2014, ICML.

[28]  Christos Faloutsos,et al.  Dynamics of large networks , 2008 .

[29]  Padhraic Smyth,et al.  Stochastic blockmodeling of relational event dynamics , 2013, AISTATS.

[30]  Le Song,et al.  Estimating Diffusion Network Structures: Recovery Conditions, Sample Complexity & Soft-thresholding Algorithm , 2014, ICML.

[31]  Y. Ogata The asymptotic behaviour of maximum likelihood estimators for stationary point processes , 1978 .

[32]  Yosihiko Ogata,et al.  On Lewis' simulation method for point processes , 1981, IEEE Trans. Inf. Theory.

[33]  Emmanuel Bacry,et al.  Mean-field inference of Hawkes point processes , 2015, ArXiv.

[34]  Tomoharu Iwata,et al.  Discovering latent influence in online social activities via shared cascade poisson processes , 2013, KDD.

[35]  V. Koltchinskii,et al.  Oracle inequalities in empirical risk minimization and sparse recovery problems , 2011 .

[36]  Y. Ogata Space-Time Point-Process Models for Earthquake Occurrences , 1998 .

[37]  Le Song,et al.  Learning Social Infectivity in Sparse Low-rank Networks Using Multi-dimensional Hawkes Processes , 2013, AISTATS.

[38]  Emmanuel J. Candès,et al.  Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[39]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[40]  L. Landesa,et al.  The generalized forward-backward method for analyzing the scattering from targets on ocean-like rough surfaces , 1999, IEEE Antennas and Propagation Society International Symposium. 1999 Digest. Held in conjunction with: USNC/URSI National Radio Science Meeting (Cat. No.99CH37010).

[41]  Emmanuel J. Candès,et al.  The Power of Convex Relaxation: Near-Optimal Matrix Completion , 2009, IEEE Transactions on Information Theory.

[42]  Lior Rokach,et al.  Introduction to Recommender Systems Handbook , 2011, Recommender Systems Handbook.

[43]  A. Hawkes Spectra of some self-exciting and mutually exciting point processes , 1971 .