An Analysis of Twitter Users From The Perspective of Their Behavior, Language, Region and Development Indices - A Study of 80 Million Tweets

The need for a comprehensive study to explore various aspects of online social media has been instigated by many researchers. This paper gives an insight into the social platform, Twitter. In this present work, we have illustrated stepwise procedure for crawling the data and discuss the key issues related to extracting associated features that can be useful in Twitter-related research while crawling these data from Application Programming Interfaces (APIs). Further, the data that comprises of over 86 million tweets have been analysed from various perspective including the most used languages, most frequent words, most frequent users, countries with most and least tweets and re-tweets, etc. The analysis reveals that the users’ data associated with Twitter has a high affinity for researches in the various domain that includes politics, social science, economics, and linguistics, etc. In addition, the relation between Twitter users of a country and its human development index has been identified. It is observed that countries with very high human development indices have a relatively higher number of tweets compared to low human development indices countries. It is envisaged that the present study shall open many doors of researches in information processing and data science.

[1]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[2]  Rashid Mehmood,et al.  Sentiment Analysis of Arabic Tweets in Smart Cities: A Review of Saudi Dialect , 2019, 2019 Fourth International Conference on Fog and Mobile Edge Computing (FMEC).

[3]  Dana Al-Ghadhban,et al.  Arabic sarcasm detection in Twitter , 2017, 2017 International Conference on Engineering & MIS (ICEMIS).

[4]  Helen Margetts,et al.  Political behaviour and the acoustics of social media , 2017, Nature Human Behaviour.

[5]  Rashid Ali,et al.  Aggregating Subjective and Objective Measures of Web Search Quality using Modified Shimura Technique , 2006, 9th International Conference on Information Technology (ICIT'06).

[6]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[7]  S. Ye Measuring message propagation and social influence on Twitter , 2013 .

[8]  E. Nsoesie,et al.  Social media captures demographic and regional physical activity , 2019, BMJ Open Sport & Exercise Medicine.

[9]  Dayou Li,et al.  Analysis of the relationship between Saudi twitter posts and the Saudi stock market , 2015, 2015 IEEE Seventh International Conference on Intelligent Computing and Information Systems (ICICIS).

[10]  Shyhtsun Felix Wu,et al.  Measuring message propagation and social influence on Twitter.com , 2010, Int. J. Commun. Networks Distributed Syst..

[11]  Ram Gopal Raj,et al.  Sentiment Analysis for Arabic in Social Media Network: A Systematic Mapping Study , 2019, ArXiv.

[12]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[13]  A Note on Modified Rank Correlation , 1994 .

[14]  Hernán A. Makse,et al.  CUNY Academic Works , 2022 .

[15]  P. Gloor,et al.  Predicting Stock Market Indicators Through Twitter “I hope it is not as bad as I fear” , 2011 .

[16]  Hsinchun Chen,et al.  Textual analysis of stock market prediction using breaking financial news: The AZFin text system , 2009, TOIS.

[17]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[18]  Liwei Wang,et al.  Exploring demographic information in social media for product recommendation , 2015, Knowledge and Information Systems.

[19]  James P. Bagrow,et al.  Information flow reveals prediction limits in online social activity , 2017, Nature Human Behaviour.

[20]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[21]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[22]  Krishna P. Gummadi,et al.  Who Makes Trends? Understanding Demographic Biases in Crowdsourced Recommendations , 2017, ICWSM.

[23]  Maurice Vergeer,et al.  Campaigning on Twitter: Microblogging and Online Social Networking as Campaign Tools in the 2010 General Elections in the Netherlands , 2013, J. Comput. Mediat. Commun..

[24]  Gilad Mishne,et al.  Capturing Global Mood Levels using Blog Posts , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[25]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[26]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[27]  Jin-Cheon Na,et al.  Demographics Analysis of Twitter Users who Tweeted on Psychological Articles and Tweets Analysis , 2018, INNS Conference on Big Data.

[28]  Alex Leavitt,et al.  The Influentials : New Approaches for Analyzing Influence on Twitter , 2009 .

[29]  Mishari Almishari,et al.  Arabic Twitter Profiling For Arabic-Speaking Users , 2018, 2018 21st Saudi Computer Society National Computer Conference (NCC).

[30]  Ramanathan V. Guha,et al.  The predictive power of online chatter , 2005, KDD '05.

[31]  M. M. Sufyan Beg A subjective measure of web search quality , 2005, Inf. Sci..

[32]  Xin Shuai,et al.  Loose tweets: an analysis of privacy leaks on twitter , 2011, WPES.

[33]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[34]  Akemi Takeoka Chatfield,et al.  Twitter Early Tsunami Warning System: A Case Study in Indonesia's Natural Disaster Management , 2013, 2013 46th Hawaii International Conference on System Sciences.

[35]  Claudio Soriente,et al.  Hummingbird: Privacy at the Time of Twitter , 2012, 2012 IEEE Symposium on Security and Privacy.

[36]  Tahani Almanie,et al.  Saudi Mood: A Real-Time Informative Tool for Visualizing Emotions in Saudi Arabia Using Twitter , 2018, 2018 21st Saudi Computer Society National Computer Conference (NCC).

[37]  Rashid Ali,et al.  Feature-Based Opinion Mining Approach (FOMA) for Improved Book Recommendation , 2018 .

[38]  E. Nsoesie,et al.  Use of social media, search queries, and demographic data to assess obesity prevalence in the United States , 2019, Palgrave Communications.

[39]  Cory L. Armstrong,et al.  Now Tweet This: How News Organizations Use Twitter , 2010 .

[40]  Johan Bollen,et al.  The minute-scale dynamics of online emotions reveal the effects of affect labeling , 2018, Nature Human Behaviour.

[41]  Geetika Gautam,et al.  Sentiment analysis of twitter data using machine learning approaches and semantic analysis , 2014, 2014 Seventh International Conference on Contemporary Computing (IC3).

[42]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[43]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[44]  Sounman Hong,et al.  Online news on Twitter: Newspapers' social media adoption and their online readership , 2012, Inf. Econ. Policy.

[45]  Vagelis Hristidis,et al.  Demographic-Based Content Analysis of Web-Based Health-Related Social Media , 2016, Journal of medical Internet research.