Social Network Extraction: A Review of Automatic Techniques

The advent of Web 2.0 has been instrumental in paradigm shift of how people communicate? These communications are a rich source of relationship data. Analyzing such a vast amount of relationship data is not a trivial task. Social Network Analysis is a promising field of research to take advantage of this huge pool of relationship data. But before this data is analyzed from Social Network Analysis perspective, Social Networks have to be extracted from this data. Social network extraction deals with the extraction of online social networks from a wide variety of online resources. These resources include web documents, e-mail communication, Internet relay chats, web usage logs, event logs, instant messenger logs, online blogs etc. Social network extraction is beneficial for many Web mining and social network applications such as expert finding for research guidance, potential speakers and contributors for conferences, journals, workshops, product recommendation, targeted advertising etc. In the last decade, many efforts have been made in the area of social network extraction. As a result, a good number of social network extraction methods have been proposed in the literature. These social network extraction methods use different sources for social network extraction. Some of these systems also use data from more than one resource. Although there are some social network extraction methods which construct a social network manually and as such cannot be considered in this work, as we deal with automatic methods only. In this paper, we classify automatic methods for social network extraction on the basis of information source they use. We also outline a general framework for social network extraction and give some future directions.

[1]  I-Hsien Ting,et al.  Analyzing Multi-source Social Data for Extracting and Mining Social Networks , 2009, 2009 International Conference on Computational Science and Engineering.

[2]  Kôiti Hasida,et al.  POLYPHONET: An advanced social network extraction system from the Web , 2007, J. Web Semant..

[3]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[4]  Rashid Ali,et al.  Scientific Co-authorship Social Networks: A Case Study of Computer Science Scenario in India , 2012 .

[5]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[6]  Pasquale De Meo,et al.  Analyzing the Facebook Friendship Graph , 2010, ArXiv.

[7]  Jie Tang,et al.  Social Network Extraction of Academic Researchers , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[8]  Mitsuru Ishizuka,et al.  Extracting Social Networks Among Various Entities on the Web , 2007, ESWC.

[9]  Paul Mutton,et al.  Inferring and visualizing social networks on Internet relay chat , 2004, Proceedings. Eighth International Conference on Information Visualisation, 2004. IV 2004..

[10]  Junghwan Kim,et al.  Extraction and Visualization of Implicit Social Relations on Social Networking Services , 2010, AAAI.

[11]  Dennis M. Wilkinson,et al.  A method for finding communities of related genes , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.

[13]  Ji Wang,et al.  Measuring the influence of social networks on information diffusion on blogspheres , 2009, 2009 International Conference on Machine Learning and Cybernetics.

[14]  I-Hsien Ting,et al.  A Dynamic and Task-Oriented Social Network Extraction System Based on Analyzing Personal Social Data , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[15]  Andrew McCallum,et al.  Extracting social networks and contact information from email and the Web , 2004, CEAS.

[16]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[17]  Ankur Teredesai,et al.  Extracting Social Networks from Instant Messaging Populations , 2004 .

[18]  Michael Gertz,et al.  Mining email social networks , 2006, MSR '06.

[19]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[20]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[21]  Kôiti Hasida,et al.  Social Network Extraction of Conference Participants , 2003, WWW.

[22]  Denilson Barbosa,et al.  Extracting Information Networks from the Blogosphere: State-of-the-Art and Challenges , 2010 .

[23]  Denilson Barbosa,et al.  Adaptive record extraction from web pages , 2007, WWW '07.

[24]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[25]  Oka Yutaka Matsuo Weighting Relations in Social Networks Using the Web Mizuki , 2009 .

[26]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[27]  Katarzyna Musial,et al.  Multidimensional Social Network: Model and Analysis , 2011, ICCCI.

[28]  Bernardo A. Huberman,et al.  Email as spectroscopy: automated discovery of community structure within organizations , 2003 .

[29]  Daniel Neagu,et al.  Online social network profile data extraction for vulnerability analysis , 2011 .

[30]  Chaomei Chen,et al.  Mining the Web: Discovering knowledge from hypertext data , 2004, J. Assoc. Inf. Sci. Technol..

[31]  Jian Su,et al.  Discovering Relations Between Named Entities from a Large Raw Corpus Using Tree Similarity-Based Clustering , 2005, IJCNLP.

[32]  Maria Teresa Gomez Lopez,et al.  COMPETITIVE INTELLIGENCE BASED ON SOCIAL NETWORKS FOR DECISION MAKING , 2009 .

[33]  K Xu Measurement and Analysis of Online Social Networks , 2014 .

[34]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[35]  Jon Oberlander,et al.  Identifying more bloggers: Towards large scale personality classification of personal weblogs , 2007, ICWSM.

[36]  I-Hsien Ting,et al.  Web mining techniques for on-line social networks analysis , 2008, 2008 International Conference on Service Systems and Service Management.

[37]  Juan-Zi Li,et al.  Extraction and mining of an academic social network , 2008, WWW.

[38]  Bart Selman,et al.  The Hidden Web , 1997, AI Mag..

[39]  Peter Mika,et al.  Flink: Semantic Web technology for the extraction and analysis of social networks , 2005, J. Web Semant..

[40]  Heng Huang,et al.  Link prediction of multimedia social network via unsupervised face recognition , 2009, ACM Multimedia.

[41]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[42]  Tanev Hristo,et al.  Extracting and Learning Social Networks out of Multilingual News , 2008 .