Mining social media: key players, sentiments, and communities

Social media is the key component of social networks and organizational social applications. The emergence of new systems and services has created a number of novel social and ubiquitous environments for mining information, data, and, finally, knowledge. This connects but also transcends private and business applications featuring a range of different types of networks and organizational contexts. Important structures concern subgroups emerging in those applications as communities (connecting people), roles and key actors in the networks and communities, and opinions, beliefs, and sentiments of the set of actors. Collective intelligence can then be considered as an emerging phenomenon of the different interactions. This focus article considers mining approaches concerning social media in social networks and organizations and the analysis of such data. We first summarize important terms and concepts. Next, we describe and discuss key actor identification and characterization, sentiment mining and analysis, and community mining. In the sequel we consider different application areas and briefly discuss two exemplary ubiquitous and social applications—the social conference guidance system Conferator, and the MyGroup system for supporting working groups. Furthermore, we describe the VIKAMINE system for mining communities and subgroups in social media in the sketched application domains. Finally, we conclude with a discussion and outlook. © 2012 Wiley Periodicals, Inc.

[1]  Florian Lemmerich,et al.  Modeling Location-Based Profiles of Social Image Media Using Explorative Pattern Mining , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[2]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[3]  Julia Heidemann,et al.  Online Social Networks – Ein sozialer und technischer Überblick , 2010, Informatik-Spektrum.

[4]  Florian Lemmerich,et al.  VIKAMINE - Open-Source Subgroup Discovery, Pattern Mining, and Analytics , 2012, ECML/PKDD.

[5]  Peter Brusilovsky,et al.  Community-based Conference Navigator , 2007 .

[6]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[7]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Ciro Cattuto,et al.  What's in a crowd? Analysis of face-to-face behavioral networks , 2010, Journal of theoretical biology.

[9]  Joaquin Vanschoren,et al.  EUROPEAN CONFERENCE ON MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES , 2012 .

[10]  Chirayu Wongchokprasitti,et al.  Conference Navigator 2.0: Community-Based Recommendation for Academic Conferences , 2010 .

[11]  Einoshin Suzuki,et al.  Discovering Community-Oriented Roles of Nodes in a Social Network , 2010, DaWak.

[12]  Dominik Benz,et al.  Enhancing Social Interactions at Conferences , 2011, it Inf. Technol..

[13]  Dominik Benz,et al.  Community Assessment Using Evidence Networks , 2010, MSM/MUSE.

[14]  Pierangela Samarati,et al.  Protecting Respondents' Identities in Microdata Release , 2001, IEEE Trans. Knowl. Data Eng..

[15]  A. Kaplan,et al.  Users of the world, unite! The challenges and opportunities of Social Media , 2010 .

[16]  Dominik Benz,et al.  Towards Mining Semantic Maturity in Social Bookmarking Systems , 2011, SDoW@ISWC.

[17]  Sreeram V Ramagopalan,et al.  Risk of venous thromboembolism in people admitted to hospital with selected immune-mediated diseases: record-linkage study , 2011, BMC medicine.

[18]  Dominik Benz,et al.  The social bookmark and publication management system bibsonomy , 2010, The VLDB Journal.

[19]  Dino Pedreschi,et al.  Anonymity preserving pattern discovery , 2008, The VLDB Journal.

[20]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[21]  Chrysanthos Dellarocas,et al.  Harnessing Crowds: Mapping the Genome of Collective Intelligence , 2009 .

[22]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[23]  Hsinchun Chen,et al.  AI and Opinion Mining , 2010, IEEE Intelligent Systems.

[24]  Tom M Mitchell,et al.  Mining Our Reality , 2009, Science.

[25]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[26]  Gerd Stumme,et al.  Profile Mining in CVS-Logs and Face-to-Face Contacts for Recommending Software Developers , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[27]  Andreas Hotho,et al.  Face-to-Face Contacts at a Conference: Dynamics of Communities and Roles , 2011, MSM/MUSE.

[28]  Jure Leskovec,et al.  Empirical comparison of algorithms for network community detection , 2010, WWW '10.

[29]  Michalis Faloutsos,et al.  Online social networks , 2010, IEEE Network.

[30]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[31]  Martin Atzmüller,et al.  Efficient Descriptive Community Mining , 2011, FLAIRS.

[32]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[33]  A. Barrat,et al.  Simulation of an SEIR infectious disease model on the dynamic contact network of conference attendees , 2011, BMC medicine.

[34]  Andrea Esuli,et al.  Sentiment Quantification , 2010, IEEE Intell. Syst..

[35]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[36]  Bing Liu,et al.  Opinion spam and analysis , 2008, WSDM '08.

[37]  Vahab S. Mirrokni,et al.  Large-Scale Community Detection on YouTube for Topic Discovery and Exploration , 2011, ICWSM.

[38]  Gerd Stumme,et al.  Anatomy of a conference , 2012, HT '12.

[39]  Andreas Hotho,et al.  Privacy-aware spam detection in social bookmarking systems , 2011, i-KNOW '11.

[40]  Alessandro Chessa,et al.  Group Recommendation with Automatic Identification of Users Communities , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[41]  Andreas Hotho,et al.  Analysis of Social Media and Ubiquitous Data , 2011, Lecture Notes in Computer Science.

[42]  Oleg E. Melnik,et al.  Encyclopedia of Complexity and Systems Science , 2008 .

[43]  Pang-Ning Tan,et al.  Exploration of Link Structure and Community-Based Node Roles in Network Analysis , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).