Two-Way Feature Extraction Using Sequential and Multimodal Approach for Hateful Meme Classification

Millions of memes are created and shared every day on social media platforms. Memes are a great tool to spread humour. However, some people use it to target an individual or a group generating offensive content in a polite and sarcastic way. Lack of moderation of such memes spreads hatred and can lead to depression like psychological conditions. Many successful studies related to analysis of language such as sentiment analysis and analysis of images such as image classification have been performed. However, most of these studies rely only upon either one of these components. As classifying meme is one problem which cannot be solved by relying upon only any one of these aspects, the present work identifies, addresses, and ensembles both the aspects for analyzing such data. In this research, we propose a solution to the problems in which the classification depends on more than one model. This paper proposes two different approaches to solve the problem of identifying hate memes. The first approach uses sentiment analysis based on image captioning and text written on the meme. The second approach is to combine features from different modalities. These approaches utilize a combination of glove, encoder-decoder, and OCR with Adamax optimizer deep learning algorithms. Facebook Challenge Hateful Meme Dataset is utilized which contains approximately 8500 meme images. Both the approaches are implemented on the live challenge competition by Facebook and predicted quite acceptable results. Both approaches are tested on the validation dataset, and results are found to be promising for both models.

[1]  Soumya K. Ghosh,et al.  Optical Character Recognition Systems , 2017 .

[2]  Selim Akyokus,et al.  Deep Learning- and Word Embedding-Based Heterogeneous Classifier Ensembles for Text Classification , 2018, Complex..

[3]  Travis Wiens,et al.  Engine Speed Reduction for Hydraulic Machinery Using Predictive Algorithms , 2019, International Journal of Hydromechatronics.

[4]  Yanni Liu,et al.  Research on Sentiment Tendency and Evolution of Public Opinions in Social Networks of Smart City , 2020, Complex..

[5]  Vikram Singh,et al.  AN EFFECTIVE TOKENIZATION ALGORITHM FOR INFORMATION RETRIEVAL SYSTEMS , 2014 .

[6]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Tong Lu,et al.  Graphology based handwritten character analysis for human behaviour identification , 2020, CAAI Trans. Intell. Technol..

[8]  Jiebo Luo,et al.  Image Captioning with Semantic Attention , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Frank Puppe,et al.  Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition , 2018, Digit. Humanit. Q..

[10]  Zhijie Xia,et al.  Research on Sentiment Classification Algorithms on Online Review , 2020, Complex..

[11]  Anu Mehra,et al.  A Robust Real Time Object Detection and Recognition Algorithm for Multiple Objects , 2019 .

[12]  Thomas S. Huang,et al.  How deep neural networks can improve emotion recognition on video data , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[13]  Dan Yang,et al.  Topic Modeling Based Image Clustering by Events in Social Media , 2016, Sci. Program..

[14]  Kyungho Lee,et al.  Detecting Potential Insider Threat: Analyzing Insiders' Sentiment Exposed in Social Media , 2018, Secur. Commun. Networks.

[15]  Hechun Yu,et al.  Study on the dynamic and static characteristics of gas static thrust bearing with micro-hole restrictors , 2019, International Journal of Hydromechatronics.

[16]  M Harshali,et al.  Image classification using Deep learning , 2018 .

[17]  Christopher Joseph Pal,et al.  EmoNets: Multimodal deep learning approaches for emotion recognition in video , 2015, Journal on Multimodal User Interfaces.

[18]  Bhupendra Gupta,et al.  Visibility improvement and mass segmentation of mammogram images using quantile separated histogram equalisation with local contrast enhancement , 2019, CAAI Trans. Intell. Technol..

[19]  Lei Zhang,et al.  Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[21]  Guesh Dagnew,et al.  Deep learning approach for microarray cancer data classification , 2020, CAAI Trans. Intell. Technol..

[22]  Herman Martins Gomes,et al.  Hybrid Parallel Cascade Classifier Training for Object Detection , 2014, CIARP.

[23]  Xiuzhen Zhang,et al.  Social Big Data: Mining, Applications, and Beyond , 2019, Complex..

[24]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[25]  Hong Chen,et al.  Hybrid Depth-Separable Residual Networks for Hyperspectral Image Classification , 2020, Complex..

[26]  Jürgen Weber,et al.  Analytical analysis of single-stage pressure relief valves , 2019, International Journal of Hydromechatronics.