Evolutionary Clustering via Message Passing

We are often interested in clustering objects that evolve over time and identifying solutions to the clustering problem for every time step. Evolutionary clustering provides insight into cluster evolution and temporal changes in cluster memberships while enabling performance superior to that achieved by independently clustering data collected at different time points. In this paper we introduce evolutionary affinity propagation (EAP), an evolutionary clustering algorithm that groups data points by exchanging messages on a factor graph. EAP promotes temporal smoothness of the solution to clustering time-evolving data by linking the nodes of the factor graph that are associated with adjacent data snapshots, and introduces consensus nodes to enable cluster tracking and identification of cluster births and deaths. Unlike existing evolutionary clustering methods that require additional processing to approximate the number of clusters or match them across time, EAP determines the number of clusters and tracks them automatically. A comparison with existing methods on simulated and experimental data demonstrates effectiveness of the proposed EAP algorithm.

[1]  Aidong Zhang,et al.  Analysis on Community Variational Trend in Dynamic Networks , 2014, CIKM.

[2]  Liu Fengqi,et al.  Application of the clustering method in analysing shallow water masses and modified water masses in the Huanghai Sea and East China Sea , 1983 .

[3]  B. Yin,et al.  Analysis of seasonal variation of water masses in East China Sea , 2014, Chinese Journal of Oceanology and Limnology.

[4]  Francesco Folino,et al.  An Evolutionary Multiobjective Approach for Community Discovery in Dynamic Networks , 2014, IEEE Transactions on Knowledge and Data Engineering.

[5]  Haris Vikalo,et al.  Semi-Supervised Affinity Propagation with Soft Instance-Level Constraints , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Machiko Toyoda,et al.  Adaptive Message Update for Fast Affinity Propagation , 2015, KDD.

[7]  Ian Paul McCarthy,et al.  The Star Treatment , 2013, The Journal of Human Resources.

[8]  Nick S. Jones,et al.  Dynamic communities in multichannel data: an application to the foreign exchange market during the 2007-2008 credit crisis. , 2008, Chaos.

[9]  Brendan J. Frey,et al.  A Binary Variable Model for Affinity Propagation , 2009, Neural Computation.

[10]  T. Hazen,et al.  The Unique Chemistry of Eastern Mediterranean Water Masses Selects for Distinct Microbial Communities by Depth , 2015, PloS one.

[11]  Brendan J. Frey,et al.  Semi-Supervised Affinity Propagation with Instance-Level Constraints , 2009, AISTATS.

[12]  Jiawei Han,et al.  A Particle-and-Density Based Evolutionary Clustering Method for Dynamic Networks , 2009, Proc. VLDB Endow..

[13]  Mark A. Moline,et al.  Bioinformatic approaches for objective detection of water masses on continental shelves , 2004 .

[14]  L. Talley Chapter 9 – Atlantic Ocean , 2011 .

[15]  Przemyslaw Kazienko,et al.  GED: the method for group evolution discovery in social networks , 2012, Social Network Analysis and Mining.

[16]  Young-Min Kim,et al.  Temporal Multinomial Mixture for Instance-Oriented Evolutionary Clustering , 2015, ECIR.

[17]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[18]  William Chen,et al.  Antihypertensive medication adherence and subsequent healthcare utilization and costs. , 2010, The American journal of managed care.

[19]  P. Deb,et al.  Association between Medicare Advantage plan star ratings and enrollment. , 2013, JAMA.

[20]  Tanya Y. Berger-Wolf,et al.  A framework for community identification in dynamic social networks , 2007, KDD '07.

[21]  Brent H. Meyer,et al.  The Budget and Economic Outlook , 2007 .

[22]  Yun Chi,et al.  Evolutionary spectral clustering by incorporating temporal smoothness , 2007, KDD '07.

[23]  Deepayan Chakrabarti,et al.  Evolutionary clustering , 2006, KDD '06.

[24]  Yan Li,et al.  Multiparameter cluster analysis of seasonal variation of water masses in the eastern Beibu Gulf , 2011 .

[25]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[26]  Alfred O. Hero,et al.  Adaptive evolutionary clustering , 2011, Data Mining and Knowledge Discovery.

[27]  Dragomir Anguelov,et al.  Mining The Stock Market : Which Measure Is Best ? , 2000 .

[28]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Yihong Gong,et al.  Detecting communities and their evolutions in dynamic social networks—a Bayesian approach , 2011, Machine Learning.

[30]  L. Talley Chapter 11 – Indian Ocean , 2011 .

[31]  Vicenç Quera,et al.  Determining shoal membership using affinity propagation , 2013, Behavioural Brain Research.

[32]  Yifan Li,et al.  Clustering moving objects , 2004, KDD.

[33]  Haris Vikalo,et al.  Evolutionary affinity propagation , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[34]  Chonghui Guo,et al.  Incremental Affinity Propagation Clustering Based on Message Passing , 2014, IEEE Transactions on Knowledge and Data Engineering.

[35]  Michele Leone,et al.  Clustering by Soft-constraint Affinity Propagation: Applications to Gene-expression Data , 2022 .

[36]  Thomas Seidl,et al.  Tracing Evolving Subspace Clusters in Temporal Climate Data , 2011, Data Mining and Knowledge Discovery.

[37]  Yun Chi,et al.  On evolutionary spectral clustering , 2009, TKDD.

[38]  Brendan J. Frey,et al.  Hierarchical Affinity Propagation , 2011, UAI.

[39]  L. Kazis,et al.  The Role of Geography in the Assessment of Quality: Evidence from the Medicare Advantage Program , 2016, PloS one.

[40]  Derek Greene,et al.  Tracking the Evolution of Communities in Dynamic Social Networks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[41]  Kanad Ghose,et al.  Detecting and Tracking Spatio-temporal Clusters with Adaptive History Filtering , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[42]  Chang-Dong Wang,et al.  Multi-Exemplar Affinity Propagation , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Eric P. Xing,et al.  Timeline: A Dynamic Hierarchical Dirichlet Process Model for Recovering Birth/Death and Evolution of Topics in Text Stream , 2010, UAI.

[44]  Kevin S. Xu,et al.  Tracking Communities of Spammers by Evolutionary Clustering , 2010 .

[45]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[46]  Dean Roemmich,et al.  The 2004-2008 mean and annual cycle of temperature, salinity, and steric height in the global ocean from the Argo Program , 2009 .

[47]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[48]  J. Foody,et al.  Adherence to statins, subsequent healthcare costs, and cardiovascular hospitalizations. , 2011, The American journal of cardiology.

[49]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[50]  Lizhu Zhou,et al.  Mining Naturally Smooth Evolution of Clusters from Dynamic Data , 2007, SDM.

[51]  Philip S. Yu,et al.  Evolutionary Clustering by Hierarchical Dirichlet Process with Hidden Markov State , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[52]  M. Weigt,et al.  Unsupervised and semi-supervised clustering by message passing: soft-constraint affinity propagation , 2007, 0712.1165.

[53]  S. Speich,et al.  Interocean exchanges and the spreading of Antarctic Intermediate Water south of Africa , 2012 .

[54]  Brendan J. Frey,et al.  Non-metric affinity propagation for unsupervised image categorization , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[55]  M Christopher Roebuck,et al.  Medication adherence leads to lower health care use and costs despite increased drug spending. , 2011, Health affairs.

[56]  Eric P. Xing,et al.  Dynamic Non-Parametric Mixture Models and the Recurrent Chinese Restaurant Process: with Applications to Evolutionary Clustering , 2008, SDM.

[57]  Yiannis Kompatsiaris,et al.  Community detection in Social Media , 2012, Data Mining and Knowledge Discovery.

[58]  L. Talley Chapter 13 – Southern Ocean , 2011 .

[59]  Ruiyuan Tian,et al.  Tracking Time-Variant Cluster Parameters in MIMO Channel Measurements , 2007, 2007 Second International Conference on Communications and Networking in China.

[60]  Frans Albarillo LibGuides Home: Center for Research in Security Prices: CRSP , 2016 .

[61]  J. Couto,et al.  Geographic Variation in Medication Adherence in Commercial and Medicare Part D Populations , 2014, Journal of managed care & specialty pharmacy.

[62]  B. Stuart,et al.  Does medication adherence lower Medicare spending among beneficiaries with diabetes? , 2011, Health services research.