In recent years, subjective texts have shown great application value. As a hot research issues in the field of natural language processing, analysis of emotions in text, has attracted attentions from many scholars and also greatly develops research on the emotional polarity of Chinese texts. This paper presents an emotional classification algorithm combining dictionary and ensemble classifier. Firstly, based on the fusion of multiple dictionaries such as emotional dictionary, degree dictionary, and negative dictionary, output the negative and positive scores of each sentence according to the designed emotion calculation algorithm. Defining the difference value between negative and positive scores as the emotional tendency value, the samples are sorted by the amount of emotional tendency value and samples with the highest emotional tendency value are selected as the training samples. Finally, the ensemble classifier is used to classify the text emotions. Based on six machine learning algorithms including polynomial Bayes, decision tree, random forest, k-nearest neighbor, SVM, and logistic regression, the ensemble classifier aims to achieve the best classification effect and minimize the disadvantages of individual classifiers. The results show that the classification accuracy of the ensemble classifier is better than that of individual classifiers.
[1]
Ioannis Hatzilygeroudis,et al.
Recognizing emotions in text using ensemble of classifiers
,
2016,
Eng. Appl. Artif. Intell..
[2]
Wang Xiaohao,et al.
Chinesemicro-blog Sentiment Analysis using Both Explicit and Implicit Text Features
,
2016
.
[3]
He Feng-ying.
Orientation analysis for Chinese blog text based on semantic comprehension
,
2011
.
[4]
Rui Xia,et al.
Ensemble of feature sets and classification algorithms for sentiment classification
,
2011,
Inf. Sci..
[5]
Estevam R. Hruschka,et al.
Tweet sentiment analysis with classifier ensembles
,
2014,
Decis. Support Syst..
[6]
Yuan Xu,et al.
基于语义理解和机器学习的混合的中文文本情感分类算法框架 (Hybrid Algorithm Framework for Sentiment Classification of Chinese Based on Semantic Comprehension and Machine Learning)
,
2015,
计算机科学.