论文信息 - A Latent Space Approach to Dynamic Embedding of Co-occurrence Data

A Latent Space Approach to Dynamic Embedding of Co-occurrence Data

We consider dynamic co-occurrence data, such as author-word links in papers published in successive years of the same conference. For static co-occurrence data, researchers often seek an embedding of the entities (authors and words) into a lowdimensional Euclidean space. We generalize a recent static co-occurrence model, the CODE model of Globerson et al. (2004), to the dynamic setting: we seek coordinates for each entity at each time step. The coordinates can change with time to explain new observations, but since large changes are improbable, we can exploit data at previous and subsequent steps to find a better explanation for current observations. To make inference tractable, we show how to approximate our observation model with a Gaussian distribution, allowing the use of a Kalman filter for tractable inference. The result is the first algorithm for dynamic embedding of co-occurrence data which provides distributional information for its coordinate estimates. We demonstrate our model both on synthetic data and on author-word data from the NIPS corpus, showing that it produces intuitively reasonable embeddings. We also provide evidence for the usefulness of our model by its performance on an authorprediction task.

Purnamrita Sarkar | Geoffrey J. Gordon | Sajid M. Siddiqi | S. Siddiqi | Purnamrita Sarkar

[1] M.W. Berry,et al. Computational Methods for Intelligent Information Access , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[2] P. Groenen,et al. Modern multidimensional scaling , 1996 .

[3] J. Tenenbaum,et al. A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[4] A. Moore,et al. Dynamic social network analysis using latent space models , 2005, SKDD.

[5] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[6] Guobiao Mei,et al. Visualization of Collaborative Data , 2006, UAI.

[7] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8] Peter D. Hoff,et al. Latent Space Approaches to Social Network Analysis , 2002 .

[9] T. Başar,et al. A New Approach to Linear Filtering and Prediction Problems , 2001 .

[10] R. Sibson. Studies in the Robustness of Multidimensional Scaling: Perturbational Analysis of Classical Scaling , 1979 .

[11] Gal Chechik,et al. Euclidean Embedding of Co-occurrence Data , 2004, J. Mach. Learn. Res..