#europehappinessmap: A Framework for Multi-Lingual Sentiment Analysis via Social Media Big Data (A Twitter Case Study)

The growth and popularity of social media platforms have generated a new social interaction environment thus a new collaboration and communication network among individuals. These platforms own tremendous amount of data about users’ behaviors and sentiments since people create, share or exchange their information, ideas, pictures or video using them. One of these popular platforms is Twitter, which via its voluntary information sharing structure, provides researchers data potential of benefit for their studies. Based on Twitter data, in this study a multilingual sentiment detection framework is proposed to compute European Gross National Happiness (GNH). This framework consists of a novel data collection, filtering and sampling method, and a newly constructed multilingual sentiment detection algorithm for social media big data, and tested with nine European countries (United Kingdom, Germany, Sweden, Turkey, Portugal, The Netherlands, Italy, France and Spain) and their national languages over a six year period. The reliability of the data is checked with peak/troughs comparison for special days from Wikipedia news lists. The validity is checked with a group of correlation analyses with OECD Life Satisfaction survey reports’, Euro-Dollar and other currency exchanges, and national stock market time series data. After validity and reliability confirmations, the European GNH map is drawn for six years. The main problem addressed is to propose a novel multilingual social media sentiment analysis framework for calculating GNH for countries and change the way of OECD type organizations’ survey and interview methodology. Also, it is believed that this framework can serve more detailed results (e.g., daily or hourly sentiments of society in different languages).

[1]  Barbara Poblete,et al.  Sentiment-based User Profiles in Microblogging Platforms , 2015, HT.

[2]  Xiao Wang,et al.  World Cup 2014 in the Twitter World: A big data analysis of sentiments in U.S. sports fans' tweets , 2015, Comput. Hum. Behav..

[3]  Tadahiko Kumamoto,et al.  Role of Emoticons for Multidimensional Sentiment Analysis of Twitter , 2014, iiWAS.

[4]  Cliff Lampe,et al.  The Benefits of Facebook "Friends: " Social Capital and College Students' Use of Online Social Network Sites , 2007, J. Comput. Mediat. Commun..

[5]  Lina Zhou,et al.  Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[6]  Katrin Weller,et al.  Think before you collect: Setting up a data collection approach for social media studies , 2016, ArXiv.

[7]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[8]  A. Lenhart,et al.  Social networking websites and teens: an overview , 2007 .

[9]  Kerk F. Kee,et al.  Positive Impacts of Social Media at Work: Job Satisfaction, Job Calling, and Facebook Use among Co-Workers , 2017 .

[10]  Jonathan Newell The Strategic Project Leader: Mastering Service‐Based Project Leadership, Second Edition , 2016 .

[11]  Jorge E. Camargo,et al.  Ideological Consumerism in Colombian Elections, 2015: Links Between Political Ideology, Twitter Activity, and Electoral Results , 2016, Cyberpsychology Behav. Soc. Netw..

[12]  Alexandru Adrian Tole,et al.  Big Data Challenges , 2013 .

[13]  Jaehyun Yoo,et al.  Manifestation of Depression and Loneliness on Social Networks: A Case Study of Young Adults on Facebook , 2015, CSCW.

[14]  David Vilares,et al.  Lyapunov filtering of objectivity for Spanish Sentiment Model , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[15]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[16]  Banu Diri,et al.  Twitter verileri ile duygu analizi , 2016 .

[17]  Berkant Barla Cambazoglu,et al.  A Framework for Sentiment Analysis in Turkish: Application to Polarity Detection of Movie Reviews in Turkish , 2012, ISCIS.

[18]  B. Jeong,et al.  Activities on Facebook Reveal the Depressive State of Users , 2013, Journal of medical Internet research.

[19]  Amanda Lenhart,et al.  Adults and social network websites , 2009 .

[20]  Ahmet Onur Durahim,et al.  #iamhappybecause: Gross National Happiness through Twitter analysis and big data , 2015 .

[21]  F. Schweitzer,et al.  Emotional persistence in online chatting communities , 2012, Scientific Reports.

[22]  Pete Burnap,et al.  Machine Classification and Analysis of Suicide-Related Communication on Twitter , 2015, HT.

[23]  Christoph Rosenkranz,et al.  Increasing the Willingness to Collaborate Online: an Analysis of Sentiment-Driven Interactions in Peer Content Production , 2011, ICIS.

[24]  M. Kosinski,et al.  Computer-based personality judgments are more accurate than those made by humans , 2015, Proceedings of the National Academy of Sciences.

[25]  Glen Coppersmith,et al.  Exploratory Analysis of Social Media Prior to a Suicide Attempt , 2016, CLPsych@HLT-NAACL.

[26]  Barbara Poblete,et al.  Do all birds tweet the same?: characterizing twitter around the world , 2011, CIKM '11.

[27]  Saeed Abdullah,et al.  Collective Smile: Measuring Societal Happiness from Geolocated Images , 2015, CSCW.

[28]  Omar Paccagnella,et al.  Do Danes and Italians Rate Life Satisfaction in the Same Way? Using Vignettes to Correct for Individual‐Specific Scale Biases , 2014 .

[29]  Miguel A. Vadillo,et al.  Researching Mental Health Disorders in the Era of Social Media: Systematic Review , 2017, Journal of medical Internet research.

[30]  Stephen Shaoyi Liao,et al.  Combining empirical experimentation and modeling techniques: A design research approach for personalized mobile advertising applications , 2008, Decis. Support Syst..

[31]  W. Duncan A GUIDE TO THE PROJECT MANAGEMENT BODY OF KNOWLEDGE , 1996 .

[32]  Preslav Nakov,et al.  Developing a successful SemEval task in sentiment analysis of Twitter and other social media texts , 2016, Language Resources and Evaluation.

[33]  Mark Dredze,et al.  From ADHD to SAD: Analyzing the Language of Mental Health on Twitter through Self-Reported Diagnoses , 2015, CLPsych@HLT-NAACL.

[34]  Hiroyuki Ohsaki,et al.  Recognizing Depression from Twitter Activity , 2015, CHI.

[35]  John H. L. Hansen,et al.  Dialect Classification on Printed Text using Perplexity Measure and Conditional Random Fields , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[36]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[37]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[38]  Harith Alani,et al.  Contextual semantics for sentiment analysis of Twitter , 2016, Inf. Process. Manag..

[39]  Lin Qiu,et al.  Do Facebook Status Updates Reflect Subjective Well-Being? , 2015, Cyberpsychology Behav. Soc. Netw..

[40]  Mike Thelwall,et al.  Sentiment in short strength detection informal text , 2010 .

[41]  Frank Schweitzer,et al.  Emotional Divergence Influences Information Spreading in Twitter , 2012, ICWSM.

[42]  Anabel Quan-Haase,et al.  Uses and Gratifications of Social Media: A Comparison of Facebook and Instant Messaging , 2010 .

[43]  Adam D. I. Kramer An unobtrusive behavioral model of "gross national happiness" , 2010, CHI.

[44]  Christophe Giraud-Carrier,et al.  Validating Machine Learning Algorithms for Twitter Data Against Established Measures of Suicidality , 2016, JMIR mental health.

[45]  Tomaso Aste,et al.  When Can Social Media Lead Financial Markets? , 2014, Scientific Reports.

[46]  Winter A. Mason,et al.  Emotional States vs. Emotional Words in Social Media , 2015, WebSci.

[47]  King-wa Fu,et al.  Analyzing Online Sentiment to Predict Telephone Poll Results , 2013, Cyberpsychology Behav. Soc. Netw..

[48]  Khurshid Ahmad,et al.  Visualising sentiments in financial texts? , 2005, Ninth International Conference on Information Visualisation (IV'05).

[49]  Preslav Nakov,et al.  SemEval-2016 Task 4: Sentiment Analysis in Twitter , 2016, *SEMEVAL.

[50]  Fabio Crestani,et al.  Like It or Not , 2016, ACM Comput. Surv..

[51]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[52]  B. Steele,et al.  Subjective Well-Being and Culture Across Time and Space , 2004 .

[53]  Felipe Bravo-Marquez,et al.  From Unlabelled Tweets to Twitter-specific Opinion Words , 2015, SIGIR.

[54]  Gaurav Jain,et al.  An approach to text classification using dimensionality reduction and combination of classifiers , 2004, Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, 2004. IRI 2004..

[55]  Desheng Dash Wu,et al.  Using text mining and sentiment analysis for online forums hotspot detection and forecast , 2010, Decis. Support Syst..

[56]  Niloy Ganguly,et al.  Understanding the Usage of Idioms in the Twitter Social Network , 2017 .

[57]  Cengiz Acartürk,et al.  Does the Strength of Sentiment Matter? A Regression Based Approach on Turkish Social Media , 2017, NLDB.

[58]  C. Fuchs Social Media: A Critical Introduction , 2013 .

[59]  H. Christensen,et al.  Detecting suicidality on Twitter , 2015 .

[60]  Conal Smith,et al.  Comparing Happiness across the World: Does Culture Matter? , 2015 .

[61]  Eric Horvitz,et al.  Characterizing and predicting postpartum depression from shared facebook data , 2014, CSCW.

[62]  Berkant Barla Cambazoglu,et al.  A large-scale sentiment analysis for Yahoo! answers , 2012, WSDM '12.

[63]  M. Minkov,et al.  Nations With More Dialectical Selves Exhibit Lower Polarization in Life Quality Judgments and Social Opinions , 2009 .

[64]  Jonathan J. H. Zhu,et al.  Discussing Occupy Wall Street on Twitter: Longitudinal Network Analysis of Equality, Emotion, and Stability of Public Discussion , 2013, Cyberpsychology Behav. Soc. Netw..

[65]  R. V. Krejcie,et al.  Determining Sample Size for Research Activities , 1970 .

[66]  Soe-Tsyr Yuan,et al.  A personalized and integrative comparison-shopping engine and its applications , 2003, Decis. Support Syst..

[67]  Mark Dredze,et al.  Measuring Post Traumatic Stress Disorder in Twitter , 2014, ICWSM.

[68]  Munmun De Choudhury,et al.  Quantifying and Predicting Mental Illness Severity in Online Pro-Eating Disorder Communities , 2016, CSCW.

[69]  Timos K. Sellis,et al.  Diversifying User Comments on News Articles , 2012, WISE.

[70]  Veselin Stoyanov,et al.  Evaluation Measures for the SemEval-2016 Task 4 “Sentiment Analysis in Twitter” (Draft: Version 1.13) , 2016 .

[71]  Tingshao Zhu,et al.  Distributed under Creative Commons Cc-by 4.0 Creating a Chinese Suicide Dictionary for Identifying Suicide Risk on Social Media , 2022 .

[72]  Tingshao Zhu,et al.  Identifying Chinese Microblog Users With High Suicide Probability Using Internet-Based Profile and Linguistic Features: Classification Model , 2015, JMIR mental health.

[73]  Daniele Quercia,et al.  Tracking "gross community happiness" from tweets , 2012, CSCW.

[74]  Aixin Sun,et al.  A Survey of Location Prediction on Twitter , 2017, IEEE Transactions on Knowledge and Data Engineering.

[75]  Mike Thelwall,et al.  Topic-based sentiment analysis for the social web: The role of mood and issue-related words , 2013, J. Assoc. Inf. Sci. Technol..

[76]  Erik Cambria,et al.  Fusing audio, visual and textual clues for sentiment analysis from multimodal content , 2016, Neurocomputing.

[77]  E. Diener,et al.  Positivity and the Construction of Life Satisfaction Judgments: Global Happiness is not the Sum of its Parts , 2000 .

[78]  Tingshao Zhu,et al.  Detecting Suicidal Ideation in Chinese Microblogs with Psychological Lexicons , 2014, 2014 IEEE 11th Intl Conf on Ubiquitous Intelligence and Computing and 2014 IEEE 11th Intl Conf on Autonomic and Trusted Computing and 2014 IEEE 14th Intl Conf on Scalable Computing and Communications and Its Associated Workshops.

[79]  Jason J. Jung,et al.  Social big data: Recent achievements and new challenges , 2015, Information Fusion.

[80]  Avi Arampatzis,et al.  Sentiment analysis of greek tweets and hashtags using a sentiment lexicon , 2015, Panhellenic Conference on Informatics.

[81]  Stefan Priesner Gross National Happiness – Bhutan ’ s Vision of Development and its Challenges , 2002 .