A temporal ensembling based semi-supervised ConvNet for the detection of fake news articles

Abstract Internet-based information circulation has given rise to the proliferation of fake and misleading contents, which has extreme hostile effects on individuals and humanity. Supervised artificial intelligence techniques require a huge amount of annotated data which is a time-consuming, expensive and laborious task as the speed and volume of social media news generation is very high. To counter this situation, we propose an innovative Convolutional Neural Network semi-supervised framework built on the self-ensembling concept to take leverage of the linguistic and stylometric information of annotated news articles, at the same time explore the hidden patterns in unlabelled data as well. Self-ensembling provides consensus predictions of the labels of unannotated data using previous epochs outputs of network-in-training. These accumulated ensemble predictions are supposed to be a better predictor for the unknown labels than the output of most recent training epoch, thus suitable to be used as a proxy for the labels of unannotated data. The uniqueness of the framework is that it ensembles all the outputs of previous training epochs of the neural network to use them as an unsupervised target for comparing them with current output prediction of unlabelled articles. The framework is validated with extensive experiments on three datasets for different proportions of labelled and unlabelled data. It can achieve highest 97.45 % fake news classification accuracy using 50% labelled articles on Fake News Data Kaggle dataset. Contemporary baseline methods are placed in juxtaposition with the proposed architecture which demonstrates the robustness of our work compared to the state-of-the-art.

[1]  Holger H. Hoos,et al.  A survey on semi-supervised learning , 2019, Machine Learning.

[2]  Norman W. Paton,et al.  Crowd-sourced Targeted Feedback Collection for Multi-Criteria Data Source Selection , 2018 .

[3]  Mona T. Diab,et al.  Rumor Detection and Classification for Twitter Data , 2015, ArXiv.

[4]  He Jiang,et al.  Combating Fake News , 2019, ACM Trans. Intell. Syst. Technol..

[5]  Filippo Menczer,et al.  Early detection of promoted campaigns on social media , 2017, EPJ Data Science.

[6]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[7]  Muhammad Al-Qurishi,et al.  A Credibility Analysis System for Assessing Information on Twitter , 2018, IEEE Transactions on Dependable and Secure Computing.

[8]  Jiliang Tang,et al.  Learning Hierarchical Discourse-level Structure for Fake News Detection , 2019, NAACL.

[9]  Huan Liu,et al.  FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media , 2018, Big Data.

[10]  Sabu M. Thampi,et al.  A nature - inspired approach based on Forest Fire model for modeling rumor propagation in social networks , 2019, J. Netw. Comput. Appl..

[11]  Hueiseok Lim,et al.  exBAKE: Automatic Fake News Detection Model Based on Bidirectional Encoder Representations from Transformers (BERT) , 2019, Applied Sciences.

[12]  Ponnurangam Kumaraguru,et al.  SpotFake: A Multi-modal Framework for Fake News Detection , 2019, 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM).

[13]  Antonio Scala,et al.  Polarization and Fake News , 2018, ACM Trans. Web.

[14]  Zion Tsz Ho Tse,et al.  Social Media's Initial Reaction to Information and Misinformation on Ebola, August 2014: Facts and Rumors , 2016, Public health reports.

[15]  Wassim El-Hajj,et al.  Assessing Arabic Weblog Credibility via Deep Co-learning , 2019, WANLP@ACL 2019.

[16]  Alice Patania,et al.  The shape of collaborations , 2017, EPJ Data Science.

[17]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[18]  Issa Traore,et al.  Detecting opinion spams and fake news using text classification , 2018, Secur. Priv..

[19]  Deepayan Bhowmik,et al.  Fake News Identification on Twitter with Hybrid CNN and RNN Models , 2018, SMSociety.

[20]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[21]  Maria Mahmood,et al.  WHITE STAG model: wise human interaction tracking and estimation (WHITE) using spatio-temporal and angular-geometric (STAG) descriptors , 2019, Multimedia Tools and Applications.

[22]  E. Borer,et al.  Soil net nitrogen mineralisation across global grasslands , 2019, Nature Communications.

[23]  Francesco Marcelloni,et al.  A survey on fake news and rumour detection techniques , 2019, Inf. Sci..

[24]  Yang Liu,et al.  FNED: A Deep Network for Fake News Early Detection on Social Media , 2020, ACM Trans. Inf. Syst..

[25]  Mahima Goel,et al.  Comparative Performance of Machine Learning Algorithms for Fake News Detection , 2019, ICACDS.

[26]  Melanie Freeze,et al.  Fake Claims of Fake News: Political Misinformation, Warnings, and the Tainted Truth Effect , 2020 .

[27]  Dinesh Kumar Vishwakarma,et al.  Fake news, rumor, information pollution in social media and web: A contemporary survey of state-of-the-arts, challenges and opportunities , 2020, Expert Syst. Appl..

[28]  H. Marshall,et al.  Post-Truth Politics in the UK's Brexit Referendum , 2018, New Perspectives.

[29]  Davide Cozzolino,et al.  Detection of GAN-Generated Fake Images over Social Networks , 2018, 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[30]  K. Jamieson,et al.  Intentions to use a novel Zika vaccine: the effects of misbeliefs about the MMR vaccine and perceptions about Zika , 2018, Journal of public health.

[31]  Evangelos E. Papalexakis,et al.  Semi-supervised Content-Based Detection of Misinformation via Tensor Embeddings , 2018, 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[32]  Hai Su,et al.  Graph temporal ensembling based semi-supervised convolutional neural network with noisy labels for histopathology image analysis , 2019, Medical Image Anal..

[33]  Ahmad Jalal,et al.  Vision-Based Human Activity Recognition System Using Depth Silhouettes: A Smart Home System for Monitoring the Residents , 2019, Journal of Electrical Engineering & Technology.

[34]  ZhangRichong,et al.  Pairwise Link Prediction Model for Out of Vocabulary Knowledge Base Entities , 2020 .

[35]  Pranav Bharadwaj,et al.  Fake News Detection with Semantic Features and Text Mining , 2019, International Journal on Natural Language Computing.

[36]  Xu Yong,et al.  Three-stage network for age estimation , 2019 .

[37]  Sibel Adali,et al.  Robust Fake News Detection Over Time and Attack , 2019, ACM Trans. Intell. Syst. Technol..

[38]  Ephraim Suhir 'Miracle–on–the–Hudson': quantitative aftermath , 2013 .

[39]  Dinesh Kumar Vishwakarma,et al.  Detection and veracity analysis of fake news via scrapping and authenticating the web search , 2019, Cognitive Systems Research.

[40]  Hernán A. Makse,et al.  CUNY Academic Works , 2022 .

[41]  Sachin Kumar,et al.  Fake news detection using deep learning models: A novel approach , 2019, Trans. Emerg. Telecommun. Technol..

[42]  Amir Nadeem,et al.  Human Actions Tracking and Recognition Based on Body Parts Detection via Artificial Neural Network , 2020, 2020 3rd International Conference on Advancements in Computational Sciences (ICACS).

[43]  Daijin Kim,et al.  Depth Images-based Human Detection, Tracking and Activity Recognition Using Spatiotemporal Features and Modified HMM , 2016 .

[44]  Oluwaseun Ajao,et al.  Sentiment Aware Fake News Detection on Online Social Networks , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[45]  Sinan Aral,et al.  The spread of true and false news online , 2018, Science.

[46]  Kim-Kwang Raymond Choo,et al.  Revisiting Semi-Supervised Learning for Online Deceptive Review Detection , 2017, IEEE Access.

[47]  Jintao Li,et al.  Exploiting Multi-domain Visual Information for Fake News Detection , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[48]  Yongdong Zhang,et al.  Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs , 2017, ACM Multimedia.

[49]  Daijin Kim,et al.  Robust human activity recognition from depth video using spatiotemporal multi-fused features , 2017, Pattern Recognit..

[50]  Benno Stein,et al.  A Stylometric Inquiry into Hyperpartisan and Fake News , 2017, ACL.

[51]  Kun Cao,et al.  A Survey of Hierarchical Energy Optimization for Mobile Edge Computing , 2020, ACM Comput. Surv..

[52]  Majid Ali Khan Quaid,et al.  Wearable sensors based human behavioral pattern recognition using statistical features and reweighted genetic algorithm , 2019, Multimedia Tools and Applications.