Social Media Engineering for Issues Feature Extraction using Categorization Knowledge Modelling and Rule-based Sentiment Analysis

A company maintains and improves its quality services by paying attention to reviews and complaints from users. The complaints from users are commonly written using human natural language expression so that their messages are computationally difficult to extract and proceed. To overcome this difficulty, in this study, we presented a new system for issues feature extraction from users’ reviews and complaints from social media data. This system consists of four main functions: (1) Data Crawling and Preprocessing, (2) Categorization Knowledge Modelling, (3) Rule-based Sentiment Analysis, and (4) Application Environment. Data Crawling and Preprocessing provides data acquisition from users’ tweets on social media, crawls the data and applies the data preprocessing. Categorization Knowledge Modelling provides text mining of textual data, vector space transformation to create knowledge metadata, context recognition of keyword queries to the knowledge metadata, and similarity measurement for categorization. In the Rule-based Sentiment Analysis, we developed our own rules of computatioal linguistics to measure polarity of sentiment. Application Environment consists of 3 layers: database management, back-end services and front-end services. For applicability of our proposed system, we conducted two kinds of experimental study: (1) categorization performance, and (2) sentiment analysis performance. For categorization performance, we used 8743 tweet data and performed 82% of accuracy. For categorization performance, we made experiments on 217 tweet data and performed 92% of accuracy.

[1]  Ayu Purwarianti,et al.  InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification , 2016, 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA).

[2]  Ali Ridho Barakbah,et al.  Temporal sentiment analysis for opinion mining of ASEAN free trade area on social media , 2016, 2016 International Conference on Knowledge Creation and Intelligent Computing (KCIC).

[3]  Khin Zezawar Aung,et al.  Sentiment analysis of students' comment using lexicon based approach , 2017, 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS).

[4]  Kuspuji Catur Bagus Wicaksono Mengukur Efektivitas Social Media Bagi Perusahaan , 2013 .

[5]  Iuliana Cetina,et al.  The Effects of Social Media Marketing on Online Consumer Behavior , 2013 .

[6]  Choochart Haruechaiyasak,et al.  Social Media Text Classification by Enhancing Well-Formed Text Trained Model , 2016 .

[7]  Alex Clark Pre-processing very noisy text , 2003 .

[8]  Aldo Erianda,et al.  Improvement of Email And Twitter Classification Accuracy Based On Preprocessing Bayes Naive Classifier Optimization In Integrated Digital Assistant , 2017 .

[9]  Muhammad Faheem Mushtaq,et al.  Efficient processing of GRU based on word embedding for text classification , 2019, JOIV : International Journal on Informatics Visualization.

[10]  Bracha Shapira,et al.  CoBAn : A Context Based Approach for Text Classification , 2014 .

[11]  Mitsuru Ishizuka,et al.  SentiFul: A Lexicon for Sentiment Analysis , 2011, IEEE Transactions on Affective Computing.

[12]  Siddhaling Urolagin,et al.  Airline Sentiment Visualization, Consumer Loyalty Measurement and Prediction using Twitter Data , 2018 .

[13]  Sarada Prasad Gochhayat,et al.  Sentiment Analysis for Airlines Services Based on Twitter Dataset , 2019, Social Network Analytics.

[14]  Qigang Gao,et al.  An Ensemble Sentiment Classification System of Twitter Data for Airline Services Analysis , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[15]  Rey-Long Liu Context-Based Term Frequency Assessment for Text Classification , 2008, PRICAI.

[16]  Hiroki Takikawa,et al.  Political polarization in social media: Analysis of the “Twitter political field” in Japan , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[17]  Ayu Purwarianti,et al.  Sentiment classification for Indonesian message in social media , 2011, Proceedings of the 2011 International Conference on Electrical Engineering and Informatics.

[18]  Basabi Chakraborty,et al.  Topic extraction from millions of tweets using singular value decomposition and feature selection , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[19]  William Ford,et al.  Numerical Linear Algebra with Applications: Using MATLAB , 2014 .

[20]  Rey-Long Liu Context recognition for hierarchical text classification , 2009, J. Assoc. Inf. Sci. Technol..

[21]  John Elder,et al.  Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications , 2012 .