Latent information mining for semi-supervised sentiment classification in catering reviews

In this paper, we aim to address the issue that semi-supervised learning is prone to be influenced by the quality and quantity of initial seeds. In order to expand the initial labeled data, we select credible samples from unlabeled data by a proposed bilateral latent information miner. The miner can extract information from unlabeled data for both positive and negative class respectively. Then we train a semi-supervised sentiment learner with the expanded labeled data. Experiments show that the proposed method can achieve a better and more robust performance than other methods, especially when initial labeled data is randomly selected in a small size.

[1]  Alexei A. Efros,et al.  Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[2]  Zoubin Ghahramani,et al.  Learning from labeled and unlabeled data with label propagation , 2002 .

[3]  Xiaojun Wan,et al.  Co-Training for Cross-Lingual Sentiment Classification , 2009, ACL.

[4]  Janyce Wiebe,et al.  Effects of Adjective Orientation and Gradability on Sentence Subjectivity , 2000, COLING.

[5]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[6]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[7]  Cheng Wang,et al.  Co-training for Semi-supervised Sentiment Classification Based on Dual-view Bags-of-words Representation , 2015, ACL.

[8]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[9]  Meng Wang,et al.  Semi-supervised Sentiment Classification with Self-training on Feature Subspaces , 2014, CLSW.

[10]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[11]  Vikas Sindhwani,et al.  Document-Word Co-regularization for Semi-supervised Sentiment Analysis , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[12]  Patrick Pérez,et al.  Exemplar SVMs as visual feature encoders , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[14]  Michael Gamon,et al.  Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis , 2004, COLING.