An opinion mining technique which was developed from document classification in area of data mining now becomes a common interest in domestic as well as international industries. The core of opinion mining is to decide precisely whether an opinion document is a positive or negative one. Although many related approaches have been previously proposed, a classification accuracy was not satisfiable enough to applying them in practical applications. A opinion documents written in Korean are not easy to determine a polarity automatically because they often include various and ungrammatical words in expressing subjective opinions. Proposed in this paper is a new approach of classification of opinion documents, which considers only a frequency of word patterns and excludes the grammatical factors as much as possible. In proposed method, we express a document into a bag of words and then apply a learning algorithm using a frequency of word patterns, and finally decide the polarity of the document using a score function. Additionally, we also present the experiment results for evaluating the accuracy of the proposed method.
[1]
Bing Liu,et al.
Opinion observer: analyzing and comparing opinions on the Web
,
2005,
WWW '05.
[2]
Dongil Han,et al.
Automatic Extraction of Opinion Words from Korean Product Reviews Using the k-Structure
,
2010
.
[3]
Joon Hyung Shim,et al.
The Development of Automatic Ontology Generation System Using Extended Search Keywords
,
2009
.
[4]
Eric Chang,et al.
Red Opal: product-feature scoring from reviews
,
2007,
EC '07.
[5]
Ruwei Dai,et al.
AMAZING: A sentiment mining and retrieval system
,
2009,
Expert Syst. Appl..
[6]
Bing Liu,et al.
The utility of linguistic rules in opinion mining
,
2007,
SIGIR.
[7]
Jae-Young Chang,et al.
A Sentiment Analysis Algorithm for Automatic Product Reviews Classification in On-Line Shopping Mall
,
2009
.
[8]
Sang-goo Lee,et al.
A Korean Product Review Analysis System Using a Semi-Automatically Constructed Semantic Dictionary
,
2008
.