Why are these people there? An analysis based on Twitter

Social mining techniques enable gathering huge amount of data and allows inferring relevant information about Social Network Sites users such as social ties, interests, emotions, habits, etc. However, applying these techniques to tweets entail several difficulties consequence of their own characteristics: short messages written in informal language. In this paper we deal with this issues with the aim of finding out the reason why an unexpected group of people is in the same location at the same time. We show the results of applying our methodology in Madrid City during May Day.

[1]  Helena Ahonen-Myka,et al.  Simple Semantics in Topic Detection and Tracking , 2004, Information Retrieval.

[2]  Gang Liu,et al.  Short text similarity based on probabilistic topics , 2009, Knowledge and Information Systems.

[3]  Sandra Servia Rodríguez,et al.  Is There a Crowd? Experiences in Using Density-Based Clustering and Outlier Detection , 2014, MIKE.

[4]  Andreas Rauber,et al.  Uncovering hierarchical structure in data using the growing hierarchical self-organizing map , 2002, Neurocomputing.

[5]  Fang Wu,et al.  Social Networks that Matter: Twitter Under the Microscope , 2008, First Monday.

[6]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[7]  Fionn Murtagh,et al.  Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? , 2011, Journal of Classification.

[8]  David R. Karger,et al.  Scatter/Gather: a cluster-based approach to browsing large document collections , 1992, SIGIR '92.

[9]  Somnath Banerjee,et al.  Clustering short texts using wikipedia , 2007, SIGIR.

[10]  James P. Callan,et al.  Automatically labeling hierarchical clusters , 2006, DG.O.

[11]  Young-Ho Park,et al.  Finding Core Topics: Topic Extraction with Clustering on Tweet , 2012, 2012 Second International Conference on Cloud and Green Computing.

[12]  Latifur Khan,et al.  Tweets mining using WIKIPEDIA and impurity cluster measurement , 2010, 2010 IEEE International Conference on Intelligence and Security Informatics.

[13]  Oren Etzioni,et al.  Web document clustering: a feasibility demonstration , 1998, SIGIR '98.

[14]  Hanan Samet,et al.  TwitterStand: news in tweets , 2009, GIS.

[15]  James Allan,et al.  Introduction to topic detection and tracking , 2002 .

[16]  Zhi Lu,et al.  Short text clustering by finding core terms , 2011, Knowledge and Information Systems.

[17]  Oren Etzioni,et al.  Grouper: A Dynamic Clustering Interface to Web Search Results , 1999, Comput. Networks.

[18]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.