Sentiment analysis for Bangla microblog posts

6 2. RELATED WORKS IN ENGLISH 9 3. LEXICON 11 3.1 METHODOLOGY TO CONSTRUCT THE BANGLA SENTIMENT LEXICON: 11 4. METHODOLOGY 13 4.1 DATASET: 13 4.2 TWEET DESCRIPTION: 14 4.3 PREPROCESSING: 14 4.4 OUR APPROACH 17 4.4.1 TRAINING SET CONSTRUCTION: 17 4.4.1.1 SEMI-SUPERVISED: 17 4.4.1.2 SELF-TRAINING BOOTSTRAPPING: 17 4.4.2 FEATURE EXTRACTION: 20 4.4.3 CLASSIFIER: 23 4.4.3.1 SUPPORT VECTOR MACHINE: 23 4.4.3.2 MAXIMUM ENTROPY: 24 5. EXPERIMENTAL RESULTS AND EVALUATION 25 5.1 EVALUATION METRICS: 25 5.2 RESULTS AND DISCUSSION: 27 6. CONCLUSION AND FUTURE WORKS 33 REFERENCES 43

[1]  Yasuhiro Suzuki,et al.  Application of Semi-supervised Learning to Evaluative Expression Classification , 2006, CICLing.

[2]  Alan F. Smeaton,et al.  Classifying sentiment in microblogs: is brevity an advantage? , 2010, CIKM.

[3]  Luis Alfonso Ureña López,et al.  Sentiment analysis in Twitter , 2012, Natural Language Engineering.

[4]  Takashi Inui,et al.  Latent Variable Models for Semantic Orientations of Phrases , 2006, EACL.

[5]  Rada Mihalcea,et al.  Multilingual Subjectivity Analysis Using Machine Translation , 2008, EMNLP.

[6]  Finn Årup Nielsen,et al.  A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs , 2011, #MSM.

[7]  Khalid Choukri,et al.  The european language resources association , 1998, LREC.

[8]  Rada Mihalcea,et al.  Co-training and Self-training for Word Sense Disambiguation , 2004, CoNLL.

[9]  Lei Zhang,et al.  Combining lexicon-based and learning-based methods for twitter sentiment analysis , 2011 .

[10]  John Carroll,et al.  Automatic Seed Word Selection for Unsupervised Sentiment Classification of Chinese Text , 2008, COLING.

[11]  Matthias Seeger,et al.  Learning from Labeled and Unlabeled Data , 2010, Encyclopedia of Machine Learning.

[12]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[13]  Diana Maynard,et al.  Automatic Detection of Political Opinions in Tweets , 2011, #MSM.

[14]  Soo-Min Kim,et al.  Identifying and Analyzing Judgment Opinions , 2006, NAACL.

[15]  Vincent Ng,et al.  Mine the Easy, Classify the Hard: A Semi-Supervised Approach to Automatic Sentiment Classification , 2009, ACL.

[16]  Ari Rappoport,et al.  Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[17]  Dipankar Das,et al.  Labeling Emotion in Bengali Blog Corpus – A Fine Grained Tagging at Sentence Level , 2010 .

[18]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[19]  Yuji Matsumoto,et al.  Collecting Evaluative Expressions for Opinion Extraction , 2004, IJCNLP.

[20]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[21]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[22]  Sivaji Bandyopadhyay,et al.  Subjectivity Detection in English and Bengali: A CRF-based Approach , 2009 .

[23]  Akshi Kumar,et al.  Sentiment Analysis on Twitter , 2012 .

[24]  Chu-Ren Huang,et al.  Employing Personal/Impersonal Views in Supervised and Semi-Supervised Sentiment Classification , 2010, ACL.

[25]  Rada Mihalcea,et al.  Learning Multilingual Subjective Language via Cross-Lingual Projections , 2007, ACL.

[26]  Hiroshi Kanayama,et al.  Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis , 2006, EMNLP.

[27]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[28]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[29]  Xiaojun Wan,et al.  Using Bilingual Knowledge and Ensemble Techniques for Unsupervised Chinese Sentiment Analysis , 2008, EMNLP.

[30]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[31]  Xiaoming Chen,et al.  A New Method for Sentiment Classification in Text Retrieval , 2005, IJCNLP.

[32]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.