Sentiment analysis of Bengali comments with Word2Vec and sentiment information of words

The vector representation of Bengali words using word2vec model (Mikolov et al. (2013)) plays an important role in Bengali sentiment classification. It is observed that the words that are from same context stay closer in the vector space of word2vec model and they are more similar than other words. In this article, a new approach of sentiment classification of Bengali comments with word2vec and Sentiment extraction of words are presented. Combining the results of word2vec word co-occurrence score with the sentiment polarity score of the words, the accuracy obtained is 75.5%.

[1]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[2]  Md. Saiful Islam,et al.  Word embedding with hellinger PCA to detect the sentiment of bengali text , 2016, 2016 19th International Conference on Computer and Information Technology (ICCIT).

[3]  Christopher D. Manning,et al.  Bilingual Word Embeddings for Phrase-Based Machine Translation , 2013, EMNLP.

[4]  Vibhu O. Mittal,et al.  Comparative Experiments on Sentiment Classification for Online Product Reviews , 2006, AAAI.

[5]  Md. Saiful Islam,et al.  Supervised approach of sentimentality extraction from bengali facebook status , 2016, 2016 19th International Conference on Computer and Information Technology (ICCIT).

[6]  Cecilia Ovesdotter Alm,et al.  Emotions from Text: Machine Learning for Text-based Emotion Prediction , 2005, HLT.

[7]  Dan Klein,et al.  How much do word embeddings encode about syntax? , 2014, ACL.

[8]  K. M. Azharul Hasan,et al.  Sentiment detection from Bangla text using contextual valency analysis , 2014, 2014 17th International Conference on Computer and Information Technology (ICCIT).

[9]  Ronan Collobert,et al.  Word Embeddings through Hellinger PCA , 2013, EACL.

[10]  Wasifa Chowdhury,et al.  Performing sentiment analysis in Bangla microblog posts , 2014, 2014 International Conference on Informatics, Electronics & Vision (ICIEV).

[11]  Ming Zhou,et al.  Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification , 2014, ACL.

[12]  P. Lewis Ethnologue : languages of the world , 2009 .

[13]  Omer Levy,et al.  Improving Distributional Similarity with Lessons Learned from Word Embeddings , 2015, TACL.

[14]  Peter Kulchyski and , 2015 .

[15]  Steven Skiena,et al.  The Expressive Power of Word Embeddings , 2013, ArXiv.

[16]  K. M. Azharul Hasan,et al.  Sentiment Recognition from Bangla Text , 2013 .

[17]  Omer Levy,et al.  Dependency-Based Word Embeddings , 2014, ACL.

[18]  Dipankar Das Analysis and tracking of emotions in english and bengali texts: a computational approach , 2011, WWW.

[19]  Virendrakumar A. Dhotre,et al.  SVM and HMM Based Hybrid Approach of Sentiment Analysis for Teacher Feedback Assessment , 2014 .

[20]  Yoshua Bengio,et al.  Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[21]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.