Clustering large networks of parametric dynamic generative models

Analysis, prediction and control of parametric generative models for stochastic processes arise in numerous applications, such as in biology, telecommunications, geography, seismology and finance. In many of these applications, it is desirable to obtain an aggregated behavior from an underlying network of stochastic interactions. This paper focuses on the simplification of parametric models describing multiple stochastic processes, by aggregating the processes that have similar input-output behaviors in an ensemble. We propose a clustering-based method, which is general in the sense that the similarity metric upon which the aggregation relies can accommodate processes characterized by a variety of generative models. To illustrate our aggregation framework, we investigate an example system comprised of a set of point process models for earthquakes. Simulations are presented.

[1]  M. Wilson,et al.  Analyzing Functional Connectivity Using a Network Likelihood Model of Ensemble Neural Spiking Activity , 2005, Neural Computation.

[2]  L. Adamopoulos Cluster models for earthquakes: Regional comparisons , 1976 .

[3]  Emery N. Brown,et al.  A Granger Causality Measure for Point Process Models of Ensemble Neural Spiking Activity , 2011, PLoS Comput. Biol..

[4]  Puneet Sharma,et al.  A Scalable Approach to Combinatorial Library Design for Drug Discovery , 2008, J. Chem. Inf. Model..

[5]  Todd P. Coleman,et al.  Equivalence between minimal generative model graphs and directed information graphs , 2011, 2011 IEEE International Symposium on Information Theory Proceedings.

[6]  S. Asmussen,et al.  Applied Probability and Queues , 1989 .

[7]  Trades and Quotes: A Bivariate Point Process , 1998 .

[8]  F. Schoenberg,et al.  Point process modeling of wildfire hazard in Los Angeles County, California , 2011, 1108.0754.

[9]  Yosihiko Ogata,et al.  Statistical Models for Earthquake Occurrences and Residual Analysis for Point Processes , 1988 .

[10]  Eric R. Ziegel,et al.  An Introduction to Generalized Linear Models , 2002, Technometrics.

[11]  K. Rose Deterministic annealing for clustering, compression, classification, regression, and related optimization problems , 1998, Proc. IEEE.

[12]  Wei Ren,et al.  Information consensus in multivehicle cooperative control , 2007, IEEE Control Systems.

[13]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[14]  Y. Ogata Space-Time Point-Process Models for Earthquake Occurrences , 1998 .

[15]  Tsuhan Chen,et al.  Hierarchical Semantics of Objects (hSOs) , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[16]  R. Kass,et al.  Multiple neural spike train data analysis: state-of-the-art and future challenges , 2004, Nature Neuroscience.

[17]  Yunwen Xu,et al.  On reduction of graphs and Markov chain models , 2011, IEEE Conference on Decision and Control and European Control Conference.

[18]  Peter Guttorp,et al.  Nonparametric estimation of intensities for sampled counting processes , 1990 .

[19]  Daoud Berkani,et al.  Low-order model for speech signals , 2004, Signal Processing.