Cooking Is Creating Emotion: A Study on Hinglish Sentiments of Youtube Cookery Channels Using Semi-Supervised Approach

The success of Youtube has attracted a lot of users, which results in an increase of the number of comments present on Youtube channels. By analyzing those comments we could provide insight to the Youtubers that would help them to deliver better quality. Youtube is very popular in India. A majority of the population in India speak and write a mixture of two languages known as Hinglish for casual communication on social media. Our study focuses on the sentiment analysis of Hinglish comments on cookery channels. The unsupervised learning technique DBSCAN was employed in our work to find the different patterns in the comments data. We have modelled and evaluated both parametric and non-parametric learning algorithms. Logistic regression with the term frequency vectorizer gave 74.01% accuracy in Nisha Madulika’s dataset and 75.37% accuracy in Kabita’s Kitchen dataset. Each classifier is statistically tested in our study.

[1]  Usman Qamar,et al.  A semi-supervised approach to sentiment analysis using revised sentiment strength based on SentiWordNet , 2017, Knowledge and Information Systems.

[2]  F. Z. Laallam,et al.  Opinion Extraction and Classification of Real-Time YouTube Cooking Recipes Comments , 2018, AMLTA.

[3]  Alper Kürşat Uysal,et al.  Feature Selection for Comment Spam Filtering on YouTube , 2018 .

[4]  Philemon Bantimaroudis,et al.  Hybrid salience: Examining the role of traditional and digital media in the rise of the Greek radical left , 2018, Journalism.

[5]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[6]  Joseph Timoney,et al.  Nostalgic Sentiment Analysis of YouTube Comments for Chart Hits of the 20th Century , 2018, AICS.

[7]  Rui Xia,et al.  Ensemble of feature sets and classification algorithms for sentiment classification , 2011, Inf. Sci..

[8]  Timothy W. Finin,et al.  Delta TFIDF: An Improved Feature Space for Sentiment Analysis , 2009, ICWSM.

[9]  Tiago A. Almeida,et al.  TubeSpam: Comment Spam Filtering on YouTube , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[10]  Valeria De Antonellis,et al.  PREFer: A prescription-based food recommender system , 2017, Comput. Stand. Interfaces.

[11]  Eduardo R. Hruschka,et al.  A Survey and Comparative Study of Tweet Sentiment Analysis via Semi-Supervised Learning , 2016, ACM Comput. Surv..

[12]  Gui Xiaolin,et al.  Comparison Research on Text Pre-processing Methods on Twitter Sentiment Analysis , 2017, IEEE Access.

[13]  Abhishek Kaushik,et al.  A Comprehensive Study of Text Mining Approach , 2016 .

[14]  Pakawan Pugsee,et al.  Comment analysis for food recipe preferences , 2015, 2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON).

[15]  Desislava Zhekova,et al.  Do Good Recipes Need Butter ? Predicting User Ratings of Online Recipes , 2013 .

[16]  Abhishek Kaushik,et al.  A Study on Sentiment Analysis: Methods and Tools , 2015 .

[17]  Siti Mariyam Shamsuddin,et al.  Deep learning-based sentiment classification of evaluative text based on Multi-feature fusion , 2019, Inf. Process. Manag..

[18]  Lei Zhang,et al.  Combining lexicon-based and learning-based methods for twitter sentiment analysis , 2011 .

[19]  Suad Alhojely,et al.  Sentiment Analysis and Opinion Mining: A Survey , 2016 .

[20]  Sourabh Joshi,et al.  Comparative Study of Classification Algorithms used in Sentiment Analysis , 2014 .

[21]  Nafis Irtiza Trinto,et al.  Detecting Multilabel Sentiment and Emotions from Bangla YouTube Comments , 2018, 2018 International Conference on Bangla Speech and Language Processing (ICBSLP).

[22]  Kumar Ravi,et al.  Sentiment classification of Hinglish text , 2016, 2016 3rd International Conference on Recent Advances in Information Technology (RAIT).

[23]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[24]  Abhay Sharma,et al.  An Investigation of Supervised Learning Methods for Authorship Attribution in Short Hinglish Texts using Char & Word N-grams , 2018, ArXiv.

[25]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[26]  R. Rajasree,et al.  Sentiment analysis in twitter using machine learning techniques , 2013, 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT).

[27]  Muhammad Shahid,et al.  Sentiment classification of Roman-Urdu opinions using Naïve Bayesian, Decision Tree and KNN classification techniques , 2016, J. King Saud Univ. Comput. Inf. Sci..

[28]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[29]  Estevam R. Hruschka,et al.  Tweet sentiment analysis with classifier ensembles , 2014, Decis. Support Syst..

[30]  Cheri Ketchum,et al.  The Essence of Cooking Shows: How the Food Network Constructs Consumer Fantasies , 2005 .

[31]  Veenu Mangat,et al.  Dictionary based Sentiment Analysis of Hinglish Text , 2017 .

[32]  Vineet Kansal,et al.  Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis , 2019, Social Network Analysis and Mining.