Feature Extraction and Classification of Movie Reviews

Sentiment analysis identifies a user’s attitude towards a service, a topic or an event and it is very useful for companies that receive many written reviews of their services. We investigate the effect of feature extraction techniques on supervised machine learning classifiers using four different performance metrics using a publicly available movie review dataset. Our objective is to explore different classification algorithms as well as utilizing diverse feature extractors and compare outcomes and finally select the trio of feature extraction technique, classification algorithm and performance metric with the best result for the movie review classification use case.

[1]  Ruppa K. Thulasiram,et al.  Twitter sentiment classification using machine learning techniques for stock markets , 2015, 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[2]  Paolo Rosso,et al.  On the difficulty of automatically detecting irony: beyond a simple case of negation , 2014, Knowledge and Information Systems.

[3]  Mehdi Yousefi,et al.  Feature selection for an SVM based webpage classifier , 2017, 2017 IEEE 4th International Conference on Soft Computing & Machine Intelligence (ISCMI).

[4]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[5]  Andie L. Knutson,et al.  JAPANESE OPINION SURVEYS: THE SPECIAL NEED AND THE SPECIAL DIFFICULTIES , 1945 .

[6]  O. Chapelle Multi-Class Feature Selection with Support Vector Machines , 2008 .

[7]  Vincent A. Knight,et al.  Tweeting the terror: modelling the social media reaction to the Woolwich terrorist attack , 2014, Social Network Analysis and Mining.

[8]  Pasquale De Meo,et al.  Web Data Extraction , Applications and Techniques : A Survey , 2010 .

[9]  Lada A. Adamic,et al.  Internet: Growth dynamics of the World-Wide Web , 1999, Nature.

[10]  Ross Stagner,et al.  The Cross-Out Technique as a Method in Public Opinion Analysis , 1940 .

[11]  Akshi Kumar,et al.  Sentiment Analysis: A Perspective on its Past, Present and Future , 2012 .

[12]  Yvan Saeys,et al.  Robust Feature Selection Using Ensemble Feature Selection Techniques , 2008, ECML/PKDD.

[13]  Abderrahim Beni-Hssane,et al.  Prediction of Suicidal Ideation in Twitter Data using Machine Learning algorithms , 2016 .

[14]  Ying Wah Teh,et al.  Text mining for market prediction: A systematic review , 2014, Expert Syst. Appl..

[15]  Steve R. Gunn,et al.  Design and Analysis of the NIPS2003 Challenge , 2006, Feature Extraction.

[16]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[17]  Geoffrey Zweig,et al.  Syntactic Clustering of the Web , 1997, Comput. Networks.

[18]  Paolo Gastaldo,et al.  An ELM-based model for affective analogical reasoning , 2015, Neurocomputing.

[20]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[21]  Aditya K. Ghose,et al.  Automatic feature learning for vulnerability prediction , 2017, ArXiv.

[22]  J. Keziya Rani,et al.  Mining Opinion Features in Customer Reviews. , 2016 .

[23]  Santanu Kumar Rath,et al.  Classification of Sentimental Reviews Using Machine Learning Techniques , 2015 .

[24]  Uzay Kaymak,et al.  Multi-lingual support for lexicon-based sentiment analysis guided by semantics , 2014, Decis. Support Syst..

[25]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[26]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[27]  Jianyu Miao,et al.  A Survey on Feature Selection , 2016 .

[28]  Vivek Kumar,et al.  Implementation of n-gram Methodology for Rotten Tomatoes Review Dataset Sentiment Analysis , 2017, Int. J. Knowl. Discov. Bioinform..

[29]  Katharina Morik,et al.  Decision Tree and Random Forest Implementations for Fast Filtering of Sensor Data , 2018, IEEE Transactions on Circuits and Systems I: Regular Papers.

[30]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[31]  Jeff Heaton,et al.  An empirical analysis of feature engineering for predictive modeling , 2016, SoutheastCon 2016.

[32]  V. Smrithi Rekha,et al.  Recommending products to customers using opinion mining of online product reviews and features , 2015, 2015 International Conference on Circuits, Power and Computing Technologies [ICCPCT-2015].