Evaluation of Machine Learning Performance Based on BERT Data Representation with LSTM Model to Conduct Sentiment Analysis in Indonesian for Predicting Voices of Social Media Users in the 2024 Indonesia Presidential Election

In this era, the innovation of science and technology has changed rapidly, such as Artificial Intelligence (A.I.) which has helped a lot in human life. Deep learning (DL) as part of A.I. is the development of one of the machine learning models, namely the neural network. With many neural network layers, deep learning models can perform feature extraction and classification processes in a single architecture. This model has proven to perform state-of-the-art machine learning techniques in text classification, pattern recognition, speech, and imagery. Various text classification tasks, including sentiment analysis, have gone beyond AI-based approaches in Deep Learning models. Text data can come from multiple sources, including social media, such as comments on videos on YouTube. Sentiment analysis, one of the opinion mining, is a computational study that analyzes people's opinions from texts. In this research, machine learning performance analysis is carried out on a deep learning method based on BERT data representation with the LSTM method. The implementation of the model uses youtube commentary data on political videos related to the 2024 presidential election in Indonesia; performance analysis is carried out using accuracy, precision, and recall metrics. In this study, the accuracy of the BERT-LSTM model outperformed the BERT model with an accuracy of 0.8783.

[1]  H. Murfi,et al.  The performance of BERT as data representation of text clustering , 2021, Journal of Big Data.

[2]  Peter J. Verovšek,et al.  Promoting Democracy in the Digital Public Sphere: Applying Theoretical Ideals to Online Political Communication , 2021, Javnost - The Public.

[3]  Abheek Nandi,et al.  Comparative Study of Sentiment Analysis Techniques , 2021, Interdisciplinary Research in Technology and Management.

[4]  A. Bustamam,et al.  High Accuracy Conversational AI Chatbot Using Deep Recurrent Neural Networks Based on BiLSTM Model , 2020, 2020 3rd International Conference on Information and Communications Technology (ICOIACT).

[5]  Ayu Purwarianti,et al.  IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding , 2020, AACL.

[6]  Ao Feng,et al.  Target-Dependent Sentiment Classification With BERT , 2019, IEEE Access.

[7]  Hendri Murfi,et al.  Analysis of convolution neural network for transfer learning of sentiment analysis in Indonesian tweets , 2018, DSIT '18.

[8]  Shuai Wang,et al.  Deep learning for sentiment analysis: A survey , 2018, WIREs Data Mining Knowl. Discov..

[9]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[10]  M. D. Carpini Gen.com: Youth, Civic Engagement, and the New Information Environment , 2000 .

[11]  Theresia Gowandi,et al.  Performance Analysis of Hybrid Architectures of Deep Learning for Indonesian Sentiment Analysis , 2021, SCDS.

[12]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[13]  B. Usharani,et al.  Analysis of Supervised and Unsupervised Learning Classifiers for Online Sentiment Analysis , 2018 .

[14]  Pooja Dinkar Shinde,et al.  A Comparative Study of Sentiment Analysis Techniques , 2018 .

[15]  Beth Filar Williams,et al.  Getting Started with Google , 2012 .