Hyperlink Prediction in Hypernetworks Using Latent Social Features

Predicting the existence of links between pairwise objects in networks is a key problem in the study of social networks. However, relationships among objects are often more complex than simple pairwise relations. By restricting attention to dyads, it is possible that information valuable for many learning tasks can be lost. The hypernetwork relaxes the assumption that only two nodes can participate in a link, permitting instead an arbitrary number of nodes to participate in so-called hyperlinks or hyperedges, which is a more natural representation for complex, multi-party relations. However, the hyperlink prediction problem has yet to be studied. In this paper, we propose HPLSF (Hyperlink Prediction using Latent Social Features), a hyperlink prediction algorithm for hypernetworks. By exploiting the homophily property of social networks, HPLSF explores social features for hyperlink prediction. To handle the problem that social features are not always observable, a latent social feature learning scheme is developed. To cope with the arbitrary cardinality hyperlink issue in hypernetworks, we design a feature-embedding scheme to map the a priori arbitrarily-sized feature set associated with each hyperlink into a uniformly-sized auxiliary space. To address the fact that observed features and latent features may be not independent, we generalize a structural SVM to learn using both observed features and latent features. In experiments, we evaluate the proposed HPLSF framework on three large-scale hypernetwork datasets. Our results on the three diverse datasets demonstrate the effectiveness of the HPLSF algorithm. Although developed in the context of social networks, HPLSF is a general methodology and applies to arbitrary hypernetworks.

[1]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[2]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Srinivasan Parthasarathy,et al.  Local Probabilistic Models for Link Prediction , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[5]  Ludovic Denoyer,et al.  Temporal link prediction by integrating content and structure information , 2011, CIKM '11.

[6]  Nitesh V. Chawla,et al.  New perspectives and methods in link prediction , 2010, KDD.

[7]  Jieping Ye,et al.  Hypergraph spectral learning for multi-label classification , 2008, KDD.

[8]  Bart Selman,et al.  Natural communities in large linked networks , 2003, KDD '03.

[9]  David Grangier,et al.  Feature Set Embedding for Incomplete Data , 2010, NIPS.

[10]  Jennifer Neville,et al.  Leveraging relational autocorrelation with latent group models , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[11]  Byoung-Tak Zhang,et al.  Evolutionary hypernetwork models for aptamer-based cardiovascular disease diagnosis , 2007, GECCO '07.

[12]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[13]  Ye Xu,et al.  Feature selection for link prediction , 2012, PIKM '12.

[14]  Byoung-Tak Zhang,et al.  Evolving hypernetwork models of binary time series for forecasting price movements on stock markets , 2009, 2009 IEEE Congress on Evolutionary Computation.

[15]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[16]  Saeid Nahavandi,et al.  Concurrent control chart patterns recognition with singular spectrum analysis and support vector machine , 2013, Comput. Ind. Eng..

[17]  Gert R. G. Lanckriet,et al.  Metric Learning to Rank , 2010, ICML.

[18]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[19]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[20]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[21]  Adam M. Kleinbaum Organizational Misfits and the Origins of Brokerage in Intrafirm Networks , 2012 .

[22]  Rong Jin,et al.  Correlated Label Propagation with Application to Multi-label Learning , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[23]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[24]  Gueorgi Kossinets,et al.  Empirical Analysis of an Evolving Social Network , 2006, Science.

[25]  Michael I. Jordan,et al.  Bayesian Nonparametric Latent Feature Models , 2011 .

[26]  Foster J. Provost,et al.  Classification in Networked Data: a Toolkit and a Univariate Case Study , 2007, J. Mach. Learn. Res..

[27]  Alex Pothen,et al.  Graph Partitioning Algorithms with Applications to Scientific Computing , 1997 .

[28]  Ye Xu,et al.  Multi-instance Metric Learning , 2011, 2011 IEEE 11th International Conference on Data Mining.

[29]  Alex Alves Freitas,et al.  A Survey of Evolutionary Algorithms for Decision-Tree Induction , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[30]  Jun Zhu,et al.  Max-Margin Nonparametric Latent Feature Models for Link Prediction , 2012, ICML.

[31]  Anne Lohrli Chapman and Hall , 1985 .

[32]  Peter A. Gloor,et al.  Towards Growing a COIN in a Medical Research Community , 2011 .

[33]  Adam M. Kleinbaum,et al.  Inside the Black Box of the Corporate Staff: Social Networks and the Implementation of Corporate Strategy , 2012 .

[34]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[35]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[36]  Michael Johnston,et al.  Hyper-edges and multidimensional centrality , 2004, Soc. Networks.

[37]  Thomas L. Griffiths,et al.  Nonparametric Latent Feature Models for Link Prediction , 2009, NIPS.

[38]  Byoung-Tak Zhang,et al.  Evolving hypernetwork classifiers for microRNA expression profile analysis , 2007, 2007 IEEE Congress on Evolutionary Computation.

[39]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[40]  Steven J. Simske,et al.  Feature dimensionality reduction for example-based image super-resolution , 2011 .