Proposing a new friend recommendation method, FRUTAI, to enhance social media providers' performance

Social media, such as Facebook and Twitter, have grown rapidly in recent years. Friend recommendation systems, as an important emerging component of social media, may efficiently expand social media networks by proactively recommending new and potentially high-quality friends to users. Literature review has shown that prior research work on friend recommendation mainly focuses on the linking relation between users in social media but largely neglects the influence of users' attributes. In this study, we have systematically reviewed and evaluated the existing state-of-the-art friend recommendation algorithms. We introduce a new Friend Recommendation system using a User's Total Attributes Information (FRUTAI) based on the law of total probability. The proposed method can be easily extended according to the increasing number of a user's attributes with low computation cost. Furthermore, the FRUTAI is a universal friend recommendation method and can be applied in different types of social media because it does not distinguish the structure of the network. We have collected 7 million users' public information and their friend relationships from RenRen, commonly regarded as the Facebook of China. Using the real-world data from a dominant social media provider, we extensively evaluate the proposed method with other existing friend recommendation algorithms. Our experimental results have demonstrated the comparatively better performance of FRUTAI. In our empirical studies, we have observed that the performance of FRUTAI is related to the number of a user's friends. In particular, when a user has a small number of friends, the proposed FRUTAI algorithm performs better than other algorithms; when a user has a large number of friends, the overall performance of FRUTAI becomes less competitive but is still comparable to those of other providers, and its precision rate is quite outstanding. Our findings may provide some important practical implications to social media design and performance. Propose a new friend recommendation method and algorithm, FRUTAIFRUTAI has a flexible format that can be easily extended to adding users' additional important attributes.Use the real-world data from a dominant social media provider in China

[1]  Marcus Vaska,et al.  Creating and Assessing a Subject-Based Blog for Current Awareness within a Cancer Care Environment , 2013 .

[2]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[4]  Hsinchun Chen,et al.  Link prediction approach to collaborative filtering , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[5]  Philip M. Gravinese,et al.  Corals on Acid: An Inquiry-based Activity Leading Students to a Better Understanding of Ocean Acidification Impacts , 2013 .

[6]  Lothar Fritsch,et al.  Trustworthy and Inclusive Identity Management for Applications in Social Media , 2013, HCI.

[7]  Grant Cairncross,et al.  Student Tourism and Destination Choice: Exploring the Influence of Traditional, New, And Social Media: an Australian Case Study , 2013 .

[8]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[9]  Sepandar D. Kamvar,et al.  An Analytical Comparison of Approaches to Personalizing PageRank , 2003 .

[10]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[11]  Taher H. Haveliwala Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..

[12]  V. Kumar,et al.  Practice Prize Winner - Creating a Measurable Social Media Marketing Strategy: Increasing the Value and ROI of Intangibles and Tangibles for Hokey Pokey , 2013, Mark. Sci..

[13]  Ed Urban,et al.  Using Scientific Meetings to Enhance the Development of Early Career Scientists , 2013 .

[14]  AdomaviciusGediminas,et al.  Toward the Next Generation of Recommender Systems , 2005 .

[15]  Jiawei Han,et al.  A Unified Framework for Link Recommendation Using Random Walks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[16]  R. Huckfeldt,et al.  Citizens, Politics and Social Communication: Information and Influence in an Election Campaign , 1995 .

[17]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.

[18]  O. A. Sawy,et al.  Digital business strategy: toward a next generation of insights , 2013 .

[19]  M. Hallinan,et al.  Sex differences in children's friendships. , 1978, American sociological review.

[20]  Michael J. Pazzani,et al.  A Framework for Collaborative, Content-Based and Demographic Filtering , 1999, Artificial Intelligence Review.

[21]  Ludmila I. Kuncheva,et al.  On the optimality of Naïve Bayes with dependent binary features , 2006, Pattern Recognit. Lett..

[22]  Shuchuan Lo,et al.  WMR--A Graph-Based Algorithm for Friend Recommendation , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[23]  A. Kaplan,et al.  Users of the world, unite! The challenges and opportunities of Social Media , 2010 .

[24]  Michael J. Pazzani,et al.  Workshop on recommender systems: algorithms and evaluation , 1999, SIGF.

[25]  Michael J. Muller,et al.  Make new friends, but keep the old: recommending people on social networking sites , 2009, CHI.

[26]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[27]  Christopher C. Yang,et al.  Harnessing Social Media for Drug-Drug Interactions Detection , 2013, 2013 IEEE International Conference on Healthcare Informatics.

[28]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[29]  David Godes,et al.  Introduction to the Special Issue - Social Media and Business Transformation: A Framework for Research , 2013, Inf. Syst. Res..

[30]  Rynson W. H. Lau,et al.  Knowledge and Data Engineering for e-Learning Special Issue of IEEE Transactions on Knowledge and Data Engineering , 2008 .

[31]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[32]  M Girvan,et al.  Structure of growing social networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  Harry Zhang,et al.  Naive Bayesian Classifiers for Ranking , 2004, ECML.

[34]  Mark Claypool,et al.  Combining Content-Based and Collaborative Filters in an Online Newspaper , 1999, SIGIR 1999.

[35]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[36]  Harry Zhang,et al.  Naive Bayes for optimal ranking , 2008, J. Exp. Theor. Artif. Intell..

[37]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[38]  P. V. Marsden,et al.  Core Discussion Networks of Americans , 1987 .

[39]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[40]  Kathleen M. Carley,et al.  Communication at A Distance : The Influence of Print on Sociocultural Organization and Change , 1994 .

[41]  Barry Wellman,et al.  Examining Community in the Digital Neighborhood: Early Results from Canada's Wired Suburb , 1999, Digital Cities.

[42]  Paula England,et al.  Theory on gender/feminism on theory , 1993 .

[43]  Ian Soboroff. Charles Nicholas Combining Content and Collaboration in Text Filtering , 1999 .

[44]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[45]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  Kathleen M. Carley,et al.  Communication at a Distance: The Influence of Print on Sociocultural Organization and Change , 1994 .

[47]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[48]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[49]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[50]  Karl Rihaczek,et al.  1. WHAT IS DATA MINING? , 2019, Data Mining for the Social Sciences.