Tracking and Visualization of Cluster Dynamics by Sequence-based SOM

Since events and physical phenomena change with time, it is important to capture the main transitions and elements of such events and phenomena. Such transitions can be seen to occur in the World Wide Web (Levene & Poulovassilis, 2004), news topics (Allan, 2002), a person’s health condition, and the state of an instrument or a plant. Transition, or change, refers to a sequential increase/decrease or generation/extinction of the feature of the object. Visualization of such transitions of dynamic clusters is helpful in understanding such phenomena instinctively and plays a useful role in many application domains, such as fault diagnosis and medial examinations. Although a number of clustering methods have been proposed (Jain et al., 1999), most conventional clustering methods deal with static data and cannot handle sequential changes of the cluster explicitly. Tracing the trajectory within clusters that have been collectively processed and a sliding window-based method to generate separate clusters can be considered as simple methods. Although the former method cannot trace changes of clusters, it can trace changes in the number of data that belong to each cluster. The latter method can handle changes of clusters to a certain degree. However, there are some problems, such as setting an appropriate window size, the inevitable decrease in the number of data within a window, and the correspondence relationships of clusters between windows. The present study considers a window-based approach using the temporal neighborhood to address the above-described problems. Kohonen's Self-Organizing Map (SOM) (Kohonen, 2000) is considered to be an appropriate technique for visualizing clusters and their similarity relationships. The SOM is an unsupervised neural network learning technique that produces clusters and subsequently projects them onto a low-dimensional (normally two-dimensional) topology map. The conventional SOM deals with static data. However, we have extended the SOM learning model by introducing the Sequencing Weight Function (SWF), so that the model can visualize the transition of dynamics clusters. This model is referred to herein as the Sequence-based SOM (SbSOM) (Fukui et al., 2008). A SOM-based method was selected because the SOM has a neuron topology in the feature space and that is associated with topology (visualization) space. The introduction of temporal order into the topology is natural. The proposed method mitigates the problems of appropriate 7

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  Masayuki Numao,et al.  Sequence-based SOM: Visualizing transition of dynamic clusters , 2008, 2008 8th IEEE International Conference on Computer and Information Technology.

[3]  Péter András Kernel-Kohonen Networks , 2002, Int. J. Neural Syst..

[4]  D. M. Hutton,et al.  Web Dynamics - Adapting to Change in Content, Size, Topology and Use , 2006 .

[5]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[6]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[7]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[9]  Yoshiharu Ishikawa,et al.  T-Scroll: Visualizing Trends in a Time-Series of Documents for Interactive User Exploration , 2007, ECDL.

[10]  Fabrice Rossi,et al.  Batch kernel SOM and related Laplacian methods for social network analysis , 2008, Neurocomputing.

[11]  Fabrice Rossi,et al.  Visualization methods for metric studies , 2006 .

[12]  Masayuki Numao,et al.  Visualization Architecture Based on SOM for Two-Class Sequential Data , 2006, KES.

[13]  Masayuki Numao,et al.  Combining Burst Extraction Method and Sequence-Based SOM for Evaluation of Fracture Dynamics in Solid Oxide Fuel Cell , 2007 .

[14]  Jaideep Srivastava,et al.  Simultaneously Finding Fundamental Articles and New Topics Using a Community Tracking Method , 2009, PAKDD.

[15]  Hujun Yin,et al.  Kernel self-organising maps for classification , 2006, Neurocomputing.

[16]  Andrew McCallum,et al.  Topics over time: a non-Markov continuous-time model of topical trends , 2006, KDD '06.

[17]  Ah Chung Tsoi,et al.  A supervised self-organizing map for structures , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[18]  M. Kimura,et al.  Multinomial PCA for extracting major latent topics from document streams , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[19]  Bernd Fritzke,et al.  Growing cell structures--A self-organizing network for unsupervised and supervised learning , 1994, Neural Networks.

[20]  James Allan,et al.  Topic detection and tracking: event-based information organization , 2002 .

[21]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[22]  David Jensen,et al.  TimeMines: Constructing Timelines with Statistical Models of Word Usage , 2000, KDD 2000.