Sentiment Analysis in Turkish with Deep Learning

Social media (such as Twitter) helps users express their opinions easily through writings. With the increase in the use of social media, the amount of textual data coming from direct users has grown enormously. These data can help to extract meanings of the specific situations using text mining. For example, in moments of disaster, social media can be used to get important information about the current situation. Understanding the emotions of the users can help humanitarian aid to be allocated more effectively. Sentiment analysis is a method used to get emotion rank from the textual data. It helps get the numerical scale of the emotion from the unstructured data. This study aims to create sentiment analysis, also known as opinion mining, from the given data by applying both machine learning and deep learning methods for disaster moments. We intend to see which technique is more successful in terms of sentiment analysis through comparison of the methods. We created a Turkish sentiment analysis about the coup attempt that happened on July 15, 2016. In this work, the coup attempt is considered a disaster as it consisted of violence and happened suddenly. The dataset used is comprised of unstructured sentences and labels with sentiment points that are either positive or negative. In this work, we pre-process the dataset and apply Turkish Natural Language Processing (NLP). Three text-features (Bag of Words, Tf-Idf, Word 2Vec) are used to vectorize and prepare the dataset for the application of the models. Then, we apply machine learning and deep learning methods and compare them while looking at the results. Examining the results and comparison, we aim to see whether social media can help disaster management. In this comparison, we observed that deep learning has higher accuracy. Our findings indicated that social media can be used to understand sudden situations like disasters and may help to decide how to act.

[1]  Darcy Reynard,et al.  Harnessing the power of machine learning: Can Twitter data be useful in guiding resource allocation decisions during a natural disaster? , 2019 .

[2]  Serkan Savaş,et al.  Data analysis through social media according to the classified crime , 2019, Turkish J. Electr. Eng. Comput. Sci..

[3]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[4]  Rong Jin,et al.  Understanding bag-of-words model: a statistical framework , 2010, Int. J. Mach. Learn. Cybern..

[5]  Sayyada Hajera Begum,et al.  Sentiment analysis to predict election results using Python , 2018, 2018 2nd International Conference on Inventive Systems and Control (ICISC).

[6]  Lei Zou,et al.  Mining Twitter Data for Improved Understanding of Disaster Resilience , 2018 .

[7]  Theus Hossmann,et al.  Twitter in disaster mode: opportunistic communication and distribution of sensor data in emergencies , 2011 .

[8]  Kiarash Ghazvini,et al.  Predictors of tuberculosis: Application of a logistic regression model , 2019 .

[9]  Serkan Ayvaz,et al.  Sentiment analysis on Twitter: A text mining approach to the Syrian refugee crisis , 2018, Telematics Informatics.

[10]  Rahim Dehkharghani,et al.  Sentiment Analysis in Turkish : Towards a Complete Framework , 2015 .

[11]  Mehmet Ulas Cakir,et al.  Text Mining Analysis in Turkish Language Using Big Data Tools , 2016, 2016 IEEE 40th Annual Computer Software and Applications Conference (COMPSAC).

[12]  Xiao Huang,et al.  Geospatial Assessment of Wetness Dynamics in the October 2015 SC Flood with Remote Sensing and Social Media , 2018 .

[13]  Hayri Sever,et al.  Developing Turkish sentiment lexicon for sentiment analysis using online news media , 2016, 2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA).

[14]  Gary L. Frankwick,et al.  Effects of big data analytics and traditional marketing analytics on new product success: A knowledge fusion perspective , 2016 .

[15]  Kathleen M. Carley,et al.  Using tweets to support disaster planning, warning and response , 2016 .

[16]  Bruno Trstenjak,et al.  on Intelligent Manufacturing and Automation , 2013 KNN with TF-IDF Based Framework for Text Categorization , 2014 .

[17]  Kamal Eddine El Kadiri,et al.  Random Forest and Support Vector Machine based Hybrid Approach to Sentiment Analysis , 2018 .