Exploiting rank-learning models to predict the diffusion of preferences on social networks

This work tries to bring a marriage between two areas of computer science, social network analysis and machine learning, by exploiting ranking-based learning models for preference prediction on social networks. In the field of social network analysis, the diffusion of information on social networks has been studied for decades. This paper proposes the study of diffusion of preference on social networks. In general, there are two types of approaches proposed to predict the diffusion of information on a network, model-driven and data-driven approaches. The former assumes an underlying mechanism for diffusion while the latter tries to learn a more flexible model with the given data. This paper first proposes a simple modification on the existing model-driven binary diffusion approaches for preference list diffusion, and then addresses some concerns by proposing a rank-learning based data-driven approach. To evaluate the approaches, we propose two scenarios which data can be obtained from publicly available sources, namely predicting the preference propagation about the citation behavior and the microblogging behavior. The experiments show that the proposed ranking-based data-driven method outperforms all the other competitors significantly in both evaluation scenarios.

[1]  Hongliang Fei,et al.  Content based social behavior prediction: a multi-task learning approach , 2011, CIKM '11.

[2]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[3]  Masahiro Kimura,et al.  Selecting Information Diffusion Models over Social Networks for Behavioral Analysis , 2010, ECML/PKDD.

[4]  Ying Zhang,et al.  Statistically Modeling the Effectiveness of Disaster Information in Social Media , 2011, 2011 IEEE Global Humanitarian Technology Conference.

[5]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[6]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[7]  Shou-De Lin,et al.  Modeling the Diffusion of Preferences on Social Networks , 2013, SDM.

[8]  Jon M. Kleinberg,et al.  Overview of the 2003 KDD Cup , 2003, SKDD.

[9]  Michael R. Lyu,et al.  DiffusionRank: a possible penicillin for web spamming , 2007, SIGIR.

[10]  J. Leskovec,et al.  Cascading Behavior in Large Blog Graphs Patterns and a model , 2006 .

[11]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[12]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[13]  Peter A. Flach,et al.  Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28 - July 1, 2009 , 2009, KDD.

[14]  Mark S. Granovetter Threshold Models of Collective Behavior , 1978, American Journal of Sociology.

[15]  Michael R. Lyu,et al.  Mining social networks using heat diffusion processes for marketing candidates selection , 2008, CIKM '08.

[16]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[17]  Shou-De Lin,et al.  Assessing the Quality of Diffusion Models Using Real-World Social Network Data , 2011, 2011 International Conference on Technologies and Applications of Artificial Intelligence.

[18]  Wolfgang Kellerer,et al.  Outtweeting the Twitterers - Predicting Information Cascades in Microblogs , 2010, WOSN.

[19]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[20]  W. O. Kermack,et al.  A contribution to the mathematical theory of epidemics , 1927 .

[21]  Jacob Goldenberg,et al.  Using Complex Systems Analysis to Advance Marketing Theory Development , 2001 .