Using Machine Learning to Detect Events on the Basis of Bengali and Banglish Facebook Posts

In modern times, ensuring social security has become the prime concern for security administrators. The widespread and recurrent use of social media sites is creating a huge risk for the lives of the general people, as these sites are frequently becoming potential sources of the organization of various types of immoral events. For protecting society from these dangers, a prior detection system which can effectively detect events by analyzing these social media data is essential. However, automating the process of event detection has been difficult, as existing processes must account for diverse writing styles, languages, dialects, post lengths, and et cetera. To overcome these difficulties, we developed an effective model for detecting events, which, for our purposes, were classified as either protesting, celebrating, religious, or neutral, using Bengali and Banglish Facebook posts. At first, the collected posts’ text were processed for language detection, and then, detected posts were pre-processed using stopwords removal and tokenization. Features were then extracted from these pre-processed texts using three sub-processes: filtering, phrase matching of specific events, and sentiment analysis. The collected features were ultimately used to train our Bernoulli Naive Bayes classification model, which was capable of detecting events with 90.41% accuracy (for Bengali-language posts) and 70% (for the Banglish-form posts). For evaluating the effectiveness of our proposed model more precisely, we compared it with two other classifiers: Support Vector Machine and Decision Tree.

[1]  Malik Muhammad Saad Missen,et al.  Multiclass Event Classification from Text , 2021, Sci. Program..

[2]  Krishnaprasad Thirunarayan,et al.  Extracting City Traffic Events from Social Streams , 2015, ACM Trans. Intell. Syst. Technol..

[3]  Manuel Mucientes,et al.  STAC: A web platform for the comparison of algorithms using statistical tests , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[4]  Guandong Xu,et al.  What’s Happening Around the World? A Survey and Framework on Event Detection Techniques on Twitter , 2019, Journal of Grid Computing.

[5]  Bernd Resch,et al.  Spatiotemporal event detection: a review , 2020, Int. J. Digit. Earth.

[6]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[7]  Yue Gao,et al.  Multimedia Social Event Detection in Microblog , 2015, MMM.

[8]  Bernd Resch,et al.  Spatial crime distribution and prediction for sporting events using social media , 2020, Int. J. Geogr. Inf. Sci..

[9]  Ebrahim Bagheri,et al.  Event Identification in Social Networks , 2016, Encycl. Semantic Comput. Robotic Intell..

[10]  Vincent A. Schmidt,et al.  Early detection of heterogeneous disaster events using social media , 2019, J. Assoc. Inf. Sci. Technol..

[11]  Shady Elbassuoni,et al.  Practical extraction of disaster-relevant information from social media , 2013, WWW.

[12]  Nicola Conci,et al.  Natural disasters detection in social media and satellite imagery: a survey , 2019, Multimedia Tools and Applications.

[13]  Yue Gao,et al.  Real-Time Multimedia Social Event Detection in Microblog , 2018, IEEE Transactions on Cybernetics.

[14]  Ajit Jain,et al.  Towards Accurate Event Detection in Social Media: A Weakly Supervised Approach for Learning Implicit Event Indicators , 2016, NUT@COLING.

[15]  Xiaomo Liu,et al.  Real-Time Novel Event Detection from Social Media , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[16]  Manas Gaur,et al.  Unsupervised Detection of Sub-events in Large Scale Disasters , 2019, AAAI.

[17]  Sadia Sharmin,et al.  Attention-based convolutional neural network for Bangla sentiment analysis , 2020, AI & SOCIETY.

[18]  Jeongkyu Lee,et al.  Event detection on large social media using temporal analysis , 2017, 2017 IEEE 7th Annual Computing and Communication Workshop and Conference (CCWC).

[19]  Tat-Seng Chua,et al.  From Tweets to Wellness: Wellness Event Detection from Twitter Streams , 2016, AAAI.

[20]  Sridhar Swaminathan,et al.  SportsBuzzer: Detecting Events at Real Time in Twitter using Incremental Clustering , 2018 .

[21]  Hao Lu,et al.  Wide-grained capsule network with sentence-level feature to detect meteorological event in social network , 2020, Future Gener. Comput. Syst..

[22]  Tabassum Ferdous Mumu,et al.  Depressed People Detection from Bangla Social Media Status using LSTM and CNN Approach , 2021 .

[23]  Md. Nazmus Sakib,et al.  A Machine Learning Approach to Predict Events by Analyzing Bengali Facebook Posts , 2020 .

[24]  Pete Burnap,et al.  Arabic Event Detection in Social Media , 2015, CICLing.

[25]  Rashid Mehmood,et al.  Automatic Event Detection in Smart Cities Using Big Data Analytics , 2017 .

[26]  Lu Liu,et al.  Event detection and identification of influential spreaders in social media data streams , 2018, Big Data Min. Anal..

[27]  Pengfei Wang,et al.  An algorithm for event detection based on social media data , 2017, Neurocomputing.

[28]  Kyoungsoo Bok,et al.  Local Event Detection Scheme by Analyzing Relevant Documents in Social Networks , 2020 .

[29]  A. Asif,et al.  Bangla hate speech detection on social media using attention-based recurrent neural network , 2021, J. Intell. Syst..

[30]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[31]  Edi Winarko,et al.  Event detection in social media: A survey , 2013, International Conference on ICT for Smart Society.

[32]  Zillur Rahman,et al.  Identifying and Categorizing Opinions Expressed in Bangla Sentences using Deep Learning Technique , 2020 .

[33]  Thomas Demeester,et al.  Sub-event detection from twitter streams as a sequence labeling problem , 2019, NAACL.

[34]  Rashid Mehmood,et al.  Sentiment Analysis of Arabic Tweets for Road Traffic Congestion and Event Detection , 2019, Smart Infrastructure and Applications.

[35]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[36]  Rashid Mehmood,et al.  Iktishaf: a Big Data Road-Traffic Event Detection Tool Using Twitter and Spark Machine Learning , 2020 .

[37]  Mateusz Fedoryszak,et al.  Real-time Event Detection on Social Data Streams , 2019, KDD.

[38]  Wenji Mao,et al.  Online event detection and tracking in social media based on neural similarity metric learning , 2017, 2017 IEEE International Conference on Intelligence and Security Informatics (ISI).

[39]  Wei Liang,et al.  Efficient Location-Based Event Detection in Social Text Streams , 2015, IScIDE.

[40]  Firoj Alam,et al.  Bangla Text Classification using Transformers , 2020, ArXiv.

[41]  Sivaji Bandyopadhyay,et al.  A simple approach for Monolingual Event Tracking system in Bengali , 2009, 2009 Eighth International Symposium on Natural Language Processing.

[42]  Dimitrios Gunopulos,et al.  Detecting Events in Online Social Networks: Definitions, Trends and Challenges , 2016, Solving Large Scale Learning Tasks.