Comparison of Measurements by the Betweenness Centrality and Subjective Experiment on the Word Priority of Tweets

In recent years, it is very popular to use Twitter as an interaction tool and for information transmission and sharing as well. It is more difficult for us to understand the interaction flow on Twitter than the usual dialogue and conversation. But it will become easier to catch the interaction flow if we can focus on the priority words of Tweets. As we know, the word priority of Tweets can be measured by the betweenness centrality based on the word co-occurrence network. However, it is unknown whether the word priority measured by the betweenness centrality is the same to what the Twitter users think or not, which has not yet been well studied. In this study, we design an experiment to compare the measurement calculated by the betweenness centrality and the evaluation given by subjects. We further analyze their difference by statistical methods. Our result shows that the measurement determined by the betweenness centrality does not always match the subjective evaluation, but they tend to have a positive correlation. Particularly, proper nouns and words with long characters are more likely to be regarded higher priority than the betweenness centrality.

[1]  Dmitry Paranyushkin,et al.  Identifying the Pathways for Meaning Circulation using Text Network Analysis , 2011 .

[2]  Peter van der Weerd,et al.  Conceptual Grouping in Word Co-Occurrence Networks , 1999, IJCAI.

[3]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[4]  J. Bordogna Network analysis: Second edition , 1965 .

[5]  Masakazu Takahashi,et al.  Comparison of Centrality Indexes in Network Japanese Text Analysis , 2013 .

[6]  Pei-Chun Lee,et al.  Mapping knowledge structure by keyword co-occurrence: a first look at journal papers in Technology Foresight , 2010, Scientometrics.

[7]  Detlef Schoder,et al.  Web Science 2.0: Identifying Trends through Semantic Social Network Analysis , 2008, 2009 International Conference on Computational Science and Engineering.

[8]  Philip Edmonds Choosing the word most typical in context using a lexical co-occurrence network , 1997 .

[9]  Dekuan Xu,et al.  An exploration on the word co-occurrence network of Chinese popular song titles , 2016, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[10]  Tetsuya Nasukawa,et al.  Tweeting about the tsunami?: mining twitter for information on the tohoku earthquake and tsunami , 2012, WWW.

[11]  Shalin Hai-Jew Analyzing Social Media Networks with NodeXL: Insights from a Connected World , 2012 .

[12]  Timothy Baldwin,et al.  On-line Trend Analysis with Topic Models: #twitter Trends Detection Topic Model Online , 2012, COLING.

[13]  Daniel D. Suthers,et al.  Social and semantic network analysis of chat logs , 2011, LAK.