论文信息 - Sentiwordnet for Bangla Sentiwordnet for Bangla

Sentiwordnet for Bangla Sentiwordnet for Bangla

Advances in NLP techniques have led to a great demand for tagging and analysis of the sentiments from unstructured natural language data over the last few years. A typical approach to sentiment analysis is to start with a lexicon of positive and negative words and phrases. In these lexicons, entries are tagged with their prior out of context polarity. Unfortunately all efforts found in literature deal mostly with English texts. In this squib, we propose a computational technique of generating an equivalent SentiWordNet (Bengali) from publicly available English Sentiment lexicons and EnglishBengali bilingual dictionary. The target language for the present task is Bengali, though the methodology could be replicated for any new language. There are two main lexical resources widely used in English for Sentiment analysis: SentiWordNet (Esuli et. al., 2006) and Subjectivity Word List (Wilson et. al., 2005). SentiWordNet is an automatically constructed lexical resource for English which assigns a positivity score and a negativity score to each WordNet synset. The subjectivity lexicon was compiled from manually developed resources augmented with entries learned from corpora. The entries in the Subjectivity lexicon have been labelled for part of speech (POS) as well as either strong or weak subjective tag depending on reliability of the subjective nature of the entry.

Sivaji Bandyopadhyay | Amitava Das

[1] Janyce Wiebe,et al. A Computational Theory of Perspective and Reference in Narrative , 1988, ACL.

[2] Andrea Esuli,et al. SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[3] Sivaji Bandyopadhyay,et al. Theme detection an exploration of opinion subjectivity , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[4] Vasileios Hatzivassiloglou,et al. Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[5] Marshall S. Smith,et al. The general inquirer: A computer approach to content analysis. , 1967 .

[6] Janyce Wiebe,et al. Development and Use of a Gold-Standard Data Set for Subjectivity Classifications , 1999, ACL.

[7] Peter D. Turney. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[8] Satoshi Morinaga,et al. Mining product reputations on the Web , 2002, KDD.

[9] Janusz S. Bień,et al. Beliefs, Points of View, and Multiple Environments , 1983, Cogn. Sci..

[10] Mike Y. Chen,et al. Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web , 2001 .

[11] Janyce Wiebe,et al. Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.