Amrita-CEN-SentiDB 1: Improved Twitter Dataset for Sentimental Analysis and Application of Deep learning

Powerful weapon in today's world is ones emotion is social media. They have the power to make an individual trending overnight or even may pull down anyone reputation. In this paper, the sentiment analysis task has been performed by collecting the dataset from the publically available sources and by merging them together to form a new dataset for the sentiment analysis task that is positive or negative sentiment based on the context of the subject. A new reliable dataset is subjected to various pre-processing techniques and then the feature extraction techniques aftermath they are passed to the deep learning techniques out of which by using the text representation method, global vectors (glovec) with the long short-term memory (lstm) has the highest accuracy of 75%, which is the benchmark accuracy for this dataset. For the research purpose the dataset used in this paper is made available publically for research purpose.

[1]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[2]  Sanjeev Arora,et al.  A Simple but Tough-to-Beat Baseline for Sentence Embeddings , 2017, ICLR.

[3]  Ronald M. Summers,et al.  Deep Learning in Medical Imaging: Overview and Future Promise of an Exciting New Technique , 2016 .

[4]  Saif Mohammad,et al.  Emotion Intensities in Tweets , 2017, *SEMEVAL.

[5]  Khairullah Khan,et al.  A Review of Machine Learning Algorithms for Text-Documents Classification , 2010 .

[6]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[7]  Erik Cambria,et al.  Context-Dependent Sentiment Analysis in User-Generated Videos , 2017, ACL.

[8]  K. P. Soman,et al.  Detecting malicious domain names using deep learning approaches at scale , 2018, J. Intell. Fuzzy Syst..

[9]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[10]  K. P. Soman,et al.  Deep Learning Approach for Intelligent Intrusion Detection System , 2019, IEEE Access.

[11]  K. P. Soman,et al.  Robust Intelligent Malware Detection Using Deep Learning , 2019, IEEE Access.

[12]  Prabaharan Poornachandran,et al.  Scalable Framework for Cyber Threat Situational Awareness Based on Domain Name Systems Data Analysis , 2018 .

[13]  K. P. Soman,et al.  Evaluating deep learning approaches to characterize and classify malicious URL's , 2018, J. Intell. Fuzzy Syst..

[14]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[15]  Saif Mohammad,et al.  SemEval-2018 Task 1: Affect in Tweets , 2018, *SEMEVAL.

[16]  Haji Binali,et al.  A new significant area: Emotion detection in E-learning using opinion mining techniques , 2009, 2009 3rd IEEE International Conference on Digital Ecosystems and Technologies.

[17]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[18]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[19]  Akshi Kumar,et al.  Sentiment Analysis on Twitter , 2012 .

[20]  P SomanK.,et al.  TeamCEN at SemEval-2018 Task 1: Global Vectors Representation in Emotion Detection , 2018, SemEval@NAACL-HLT.

[21]  Prema Nedungadi,et al.  Enhanced sentiment analysis of informal textual communication in social media by considering objective words and intensifiers , 2014, International Conference on Recent Advances and Innovations in Engineering (ICRAIE-2014).

[22]  Preslav Nakov,et al.  SemEval-2016 Task 4: Sentiment Analysis in Twitter , 2016, *SEMEVAL.

[23]  Saif Mohammad,et al.  Stance and Sentiment in Tweets , 2016, ACM Trans. Internet Techn..

[24]  Véronique Hoste,et al.  Emotion detection in suicide notes , 2013, Expert Syst. Appl..

[25]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .