Clustering and Classification of Like-Minded People from their Tweets

Several challenges accompanied the growth of online social networks, such as grouping people with similar interest. Grouping like-minded people is of a high importance. Indeed, it leads to many applications like link prediction and friend or product suggestion, and explains various social phenomenon. In this paper, we present two methods of grouping like-minded people based on their textual posts. Compared to three baseline methods K-Means, LDA and the Scalable Multistage Clustering algorithm (SMSC), our algorithms achieves relative improvements on two corpora of tweets.

[1]  Xin Li,et al.  Tag-based social interest discovery , 2008, WWW.

[2]  S. Jaffali,et al.  Principal component analysis neural network for textual document categorization and dimension reduction , 2012, 2012 6th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT).

[3]  Omar Boussaïd,et al.  Community Extraction Based on Topic-Driven-Model for Clustering Users Tweets , 2012, ADMA.

[4]  Matthew Michelson,et al.  Tweet Disambiguate Entities Retrieve Folksonomy SubTree Step 1 : Discover Categories Generate Topic Profile from SubTrees Step 2 : Discover Profile Topic Profile : “ English Football ” “ World Cup ” , 2010 .

[5]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[8]  Iadh Ounis,et al.  Overview of the TREC 2011 Microblog Track , 2011, TREC.

[9]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[10]  Bamshad Mobasher,et al.  Personalizing Navigation in Folksonomies Using Hierarchical Tag Clustering , 2008, DaWaK.

[11]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[12]  Abdelmajid Ben Hamadou,et al.  Grouping Like-Minded Users Based on Text and Sentiment Analysis , 2014, ICCCI.

[13]  Yiannis Kompatsiaris,et al.  Community detection in Social Media , 2012, Data Mining and Knowledge Discovery.

[14]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Marco Pennacchiotti,et al.  Investigating topic models for social media user recommendation , 2011, WWW.

[16]  Huan Liu,et al.  Connecting users with similar interests via tag network inference , 2011, CIKM '11.

[17]  L. Venkata Subramaniam,et al.  Using content and interactions for discovering communities in social networks , 2012, WWW.

[18]  Ari Rappoport,et al.  Efficient Clustering of Short Messages into General Domains , 2013, ICWSM.

[19]  Lekha Bhambhu,et al.  DATA CLASSIFICATION USING SUPPORT VECTOR MACHINE , 2009 .

[20]  Lylia Abrouk,et al.  Découverte de communautés par analyse des usages , 2010 .

[21]  Jian Liu,et al.  Comparative Analysis for k-Means Algorithms in Network Community Detection , 2010, ISICA.

[22]  Yun Chi,et al.  Discovery of Blog Communities based on Mutual Awareness , 2006 .

[23]  Wei-keng Liao,et al.  User-Interest based Community Extraction in Social Networks , 2012, KDD 2012.

[24]  Christos Faloutsos,et al.  Statistical Properties of Social Networks , 2011, Social Network Data Analytics.

[25]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[26]  VakaliAthena,et al.  Community detection in Social Media , 2012 .

[27]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[28]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[29]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.