Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization

Nowadays, online social media is online discourse where people contribute to create content, share it, bookmark it, and network at an impressive rate. The faster message and ease of use in social media today is Twitter. The messages on Twitter include reviews and opinions on certain topics such as movie, book, product, politic, and so on. Based on this condition, this research attempts to use the messages of twitter to review a movie by using opinion mining or sentiment analysis. Opinion mining refers to the application of natural language processing, computational linguistics, and text mining to identify or classify whether the movie is good or not based on message opinion. Support Vector Machine (SVM) is supervised learning methods that analyze data and recognize the patterns that are used for classification. This research concerns on binary classification which is classified into two classes. Those classes are positive and negative. The positive class shows good message opinion; otherwise the negative class shows the bad message opinion of certain movies. This justification is based on the accuracy level of SVM with the validation process uses 10-Fold cross validation and confusion matrix. The hybrid Partical Swarm Optimization (PSO) is used to improve the election of best parameter in order to solve the dual optimization problem. The result shows the improvement of accuracy level from 71.87% to 77%.

[1]  W. B. Cavnar,et al.  N-gram-based text categorization , 1994 .

[2]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[3]  Chien-Liang Liu,et al.  Movie Rating and Review Summarization in Mobile Environment , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[4]  Rudy Prabowo,et al.  Sentiment analysis: A combined approach , 2009, J. Informetrics.

[5]  Mung Chiang,et al.  Why watching movie tweets won't tell the whole story? , 2012, WOSN '12.

[6]  Azah Kamilah Muda,et al.  PSO and Computationally Inexpensive Sequential Forward Floating Selection in acquiring significant features for handwritten authorship , 2011, 2011 11th International Conference on Hybrid Intelligent Systems (HIS).

[7]  Joan-Andreu Sánchez,et al.  A hybrid language model based on a combination of N-grams and stochastic context-free grammars , 2004, TALIP.

[8]  Lina Zhou,et al.  Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[9]  Lin Pan,et al.  Sentiment Analysis in Chinese , 2012 .

[10]  Yung-Chih Chen,et al.  A PSO- SVM Lips Recognition Method Based on Active Basis Model , 2010, 2010 Fourth International Conference on Genetic and Evolutionary Computing.

[11]  Sheng-wei Fei,et al.  Chinese Grain Production Forecasting Method Based on Particle Swarm Optimization-based Support Vector Machine , 2009 .

[12]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[13]  Nouman Azam,et al.  Comparison of term frequency and document frequency based feature selection metrics in text categorization , 2012, Expert Syst. Appl..

[14]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[15]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[16]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[17]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[18]  Jian Zhu,et al.  Sentiment classification using the theory of ANNs , 2010 .

[19]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[20]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[21]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[22]  Marie-Francine Moens,et al.  Automatic Sentiment Analysis in On-line Text , 2007, ELPUB.

[23]  Luis Alfonso Ureña López,et al.  Experiments with SVM to classify opinions in different domains , 2011, Expert Syst. Appl..

[24]  Xiangji Huang,et al.  Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain , 2012, IEEE Transactions on Knowledge and Data Engineering.

[25]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.