String-based Multinomial Naïve Bayes for Emotion Detection among Facebook Diabetes Community

Abstract This paper determined the emotions among the online diabetes community using the string-based Multinomial Naive Bayes algorithm. Facebook posts from official diabetes support groups were crawled, for a total of 15 000 pre-processed posts. Of these, 800 were manually annotated by human experts. The posts were first classified according to Plutchik’s wheel of emotions, comprising of eight dominant emotions: anger, sadness, fear, joy, surprised, trust, anticipation and disgust using the NRC Emotion Lexicon (Emolex). The emotion classifications were then refined using string-based Multinomial Naive Bayes algorithm, with results indicating a 6.3% improvement (i.e. 82% vs. 75.7% for average F-score) when compared to the Emolex-approach, and other machine learning algorithms, namely, Naive Bayes and Multinomial Naive Bayes. The higher accuracy in emotion classification reflects the feasibility of our approach. Further analysis also revealed emotions such as joy, fear and sadness to be of the highest frequencies for the diabetes community.

[1]  Yang Liu,et al.  Multi-class sentiment classification: The experimental comparisons of feature selection and machine learning algorithms , 2017, Expert Syst. Appl..

[2]  D. Blanch-Hartigan Patient satisfaction with physician errors in detecting and identifying patient emotion cues. , 2013, Patient education and counseling.

[3]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[4]  J. Shaw,et al.  IDF Diabetes Atlas: Global estimates for the prevalence of diabetes for 2015 and 2040. , 2011, Diabetes research and clinical practice.

[6]  Fawaz S. Al-Anzi,et al.  Toward an enhanced Arabic text classification using cosine similarity and Latent Semantic Indexing , 2017, J. King Saud Univ. Comput. Inf. Sci..

[7]  Ying Wah Teh,et al.  Text mining of news-headlines for FOREX market prediction: A Multi-layer Dimension Reduction Algorithm with semantics and sentiment , 2015, Expert Syst. Appl..

[8]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[9]  Osmar R. Zaïane,et al.  Current State of Text Sentiment Analysis from Opinion to Emotion Mining , 2017, ACM Comput. Surv..

[10]  Fouzi Harrag,et al.  Improving Arabic Text Categorization Using Neural Network with SVD , 2010, J. Digit. Inf. Manag..

[11]  Timothy Bickmore,et al.  Health dialog systems for patients and consumers , 2006, J. Biomed. Informatics.

[12]  S. Shyam Sundar,et al.  Posting, commenting, and tagging: Effects of sharing news stories on Facebook , 2015, Comput. Hum. Behav..