Sentiment analysis of online Chinese comments based on statistical learning combining with pattern matching

Sentiment analysis, as a branch of unstructured data mining, has interested people greatly. Sentiment analysis based on machine learning method usually considers less sentimental feature extraction. This article presents a method based on machine learning combining with pattern matching for sentiment analysis. We conduct basic sub‐word first, and then designed the keyword extraction strategy. We designed some emotional expression patterns. After the success matching to those patterns, we get emotional features, which are in the form of sequence. For each feature pattern, we calculated the value of emotional tendency, and finally to obtain the emotional tendency of the web comment based on machine‐learning method. The experiment result shows the method can improve the classification performances compared to using regular machine‐learning method.

[1]  James Nga-Kwok Liu,et al.  Sentiment classification of online reviews: using sentence-based language model , 2014, J. Exp. Theor. Artif. Intell..

[2]  Chu-Ren Huang,et al.  Chinese sentiment classification based on stacking combination method , 2010 .

[3]  Shi Shuicai An Improved Word Similarity Computing Method Based on HowNet , 2008 .

[4]  Jian Xiao-ya Text Categorization Based on Naive Bayesian , 2013 .

[5]  Songbo Tan,et al.  Optimizing modularity to identify semantic orientation of Chinese words , 2010, Expert Syst. Appl..

[6]  Zhou Hai-yan Study on Web text categorization and algorithm based on machine learning , 2009 .

[7]  Qiang Ye,et al.  Automatically Measuring Subjectivity of Chinese Sentences for Sentiment Analysis to Reviews on the Internet , 2007 .

[8]  Yang Zhihao Text Orientation Identification Based on Semantic Comprehension , 2007 .

[9]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[10]  Wei Zhou,et al.  改进的基于语义理解的文本情感分类方法研究 (Research of Text Sentiment Classification Based on Improved Semantic Comprehension) , 2017, 计算机科学.

[11]  Wei Shi,et al.  Sentiment analysis of Chinese microblogging based on sentiment ontology: a case study of ‘7.23 Wenzhou Train Collision’ , 2013, Connect. Sci..

[12]  Wu Peng Research on the Recognition of Text Valence , 2009 .

[13]  Efim B. Kinber Three-way automata on rectangular tapes over a one-letter alphabet , 1985, Inf. Sci..

[14]  Janyce Wiebe,et al.  Learning Subjective Language , 2004, CL.