Event detection in Twitter using text and image fusion

In this paper, we describe an accurate and effective event detection method to detect events from Twitter stream. It detects events using visual information as well as textual information to improve the performance of the mining. It monitors Twitter stream to pick up tweets having texts and photos and stores them into database. Then it applies mining algorithm to detect the event. Firstly, it detects event based on text only by using the feature of the bag-of-words which is calculated using the term frequency-inverse document frequency (TF-IDF) method. Secondly, it detects the event based on image only by using visual features including histogram of oriented gradients (HOG) descriptors, grey-level co-occurrence matrix (GLCM), and color histogram. K nearest neighbours (Knn) classification is used in the detection. Finally, the final decision of the event detection is made based on the reliabilities of text only detection and image only detection. The experiment result showed that the proposed method achieved high accuracy of 0.93, comparing with 0.89 with texts only, and 0.86 with images only.

[1]  Mohan S. Kankanhalli,et al.  Multimedia data mining: state of the art and challenges , 2010, Multimedia Tools and Applications.

[2]  Raleigh North Haewoon, Kwak, Changhyun, Lee, Park, Hosung, and Moon, Sue. . What is Twitter, a Social Network or a News Media?. 19th International World Wide Web (WWW) Conference.April. , 2010 .

[3]  SaltonGerard,et al.  Term-weighting approaches in automatic text retrieval , 1988 .

[4]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[5]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[6]  Xindong Wu,et al.  The Top Ten Algorithms in Data Mining , 2009 .

[7]  Jennifer Xu,et al.  Data Mining for Social Network Data , 2010, Annals of Information Systems.

[8]  A. Kaplan,et al.  Users of the world, unite! The challenges and opportunities of Social Media , 2010 .

[9]  A. Stefanidis,et al.  Harvesting ambient geospatial information from social media feeds , 2011, GeoJournal.

[10]  Edward Y. Chang,et al.  Support vector machine active learning for image retrieval , 2001, MULTIMEDIA '01.

[11]  John Mingers,et al.  The paucity of multimethod research: a review of the information systems literature , 2003, Inf. Syst. J..

[12]  Huan Liu,et al.  Community Detection and Mining in Social Media , 2010, Community Detection and Mining in Social Media.

[13]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Reza Zafarani,et al.  Social Media Mining: An Introduction , 2014 .

[15]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.