FakeBERT: Fake news detection in social media with a BERT-based deep learning approach

In the modern era of computing, the news ecosystem has transformed from old traditional print media to social media outlets. Social media platforms allow us to consume news much faster, with less restricted editing results in the spread of fake news at an incredible pace and scale. In recent researches, many useful methods for fake news detection employ sequential neural networks to encode news content and social context-level information where the text sequence was analyzed in a unidirectional way. Therefore, a bidirectional training approach is a priority for modelling the relevant information of fake news that is capable of improving the classification performance with the ability to capture semantic and long-distance dependencies in sentences. In this paper, we propose a BERT-based (Bidirectional Encoder Representations from Transformers) deep learning approach (FakeBERT) by combining different parallel blocks of the single-layer deep Convolutional Neural Network (CNN) having different kernel sizes and filters with the BERT. Such a combination is useful to handle ambiguity, which is the greatest challenge to natural language understanding. Classification results demonstrate that our proposed model (FakeBERT) outperforms the existing models with an accuracy of 98.90%.

[1]  Kai Shu Beyond News Contents: The Role of Social Context for Fake News Detection , 2018 .

[2]  G. Caldarelli,et al.  The spreading of misinformation online , 2016, Proceedings of the National Academy of Sciences.

[3]  William Yang Wang “Liar, Liar Pants on Fire”: A New Benchmark Dataset for Fake News Detection , 2017, ACL.

[4]  Arkaitz Zubiaga,et al.  SemEval-2017 Task 8: RumourEval: Determining rumour veracity and support for rumours , 2017, *SEMEVAL.

[5]  Yuanzhi Li,et al.  Convergence Analysis of Two-layer Neural Networks with ReLU Activation , 2017, NIPS.

[6]  Graham Neubig,et al.  When and Why Are Pre-Trained Word Embeddings Useful for Neural Machine Translation? , 2018, NAACL.

[7]  Bu-Sung Lee,et al.  Unsupervised rumor detection based on users' behaviors using neural networks , 2017, Pattern Recognit. Lett..

[8]  Neil Shah,et al.  False Information on Web and Social Media: A Survey , 2018, ArXiv.

[9]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Verónica Pérez-Rosas,et al.  Automatic Detection of Fake News , 2017, COLING.

[11]  Reza Zafarani,et al.  Fake News: A Survey of Research, Detection Methods, and Opportunities , 2018, ArXiv.

[12]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[13]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[14]  Fan Yang,et al.  Attending Sentences to detect Satirical Fake News , 2018, COLING.

[15]  Luca Maria Gambardella,et al.  Max-pooling convolutional neural networks for vision-based hand gesture recognition , 2011, 2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA).

[16]  P. Vigneswara Ilavarasan,et al.  Detection of Spammers in Twitter marketing: A Hybrid Approach Using Social Media Analytics and Bio Inspired Computing , 2017, Information Systems Frontiers.

[17]  Takashi Matsubara,et al.  Neural Architecture Search for Convolutional Neural Networks with Attention , 2021, IEICE Trans. Inf. Syst..

[18]  Jiliang Tang,et al.  Multi-Source Multi-Class Fake News Detection , 2018, COLING.

[19]  Fan Yang,et al.  Automatic detection of rumor on Sina Weibo , 2012, MDS '12.

[20]  Soroush Vosoughi,et al.  Rumor Gauge , 2017, ACM Trans. Knowl. Discov. Data.

[21]  Yang Liu,et al.  Early Detection of Fake News on Social Media Through Propagation Path Classification with Recurrent and Convolutional Networks , 2018, AAAI.

[22]  Isha Ghosh,et al.  Automated Fake News Detection Using Linguistic Analy- sis and Machine Learning , 2017 .

[23]  Dong Yu,et al.  Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[24]  Sungyong Seo,et al.  CSI: A Hybrid Deep Model for Fake News Detection , 2017, CIKM.

[25]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[26]  Dipanjan Das,et al.  BERT Rediscovers the Classical NLP Pipeline , 2019, ACL.

[27]  No Value,et al.  Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics , 2000 .

[28]  LiakataMaria,et al.  Detection and Resolution of Rumours in Social Media , 2018 .

[29]  Huan Liu,et al.  FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media , 2018, Big Data.

[30]  Jiawei Han,et al.  Evaluating Event Credibility on Twitter , 2012, SDM.

[31]  Francesco Marcelloni,et al.  A survey on fake news and rumour detection techniques , 2019, Inf. Sci..

[32]  Pushpak Bhattacharyya,et al.  A Deep Ensemble Framework for Fake News Detection and Classification , 2018, ArXiv.

[33]  Georgios Evangelopoulos,et al.  The Language of Fake News: Opening the Black-Box of Deep Learning Based Detectors , 2018 .

[34]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[35]  Ladislav Lenc,et al.  On the effects of using word2vec representations in neural networks for dialogue act recognition , 2020, Comput. Speech Lang..

[36]  Huan Liu,et al.  dEFEND: Explainable Fake News Detection , 2019, KDD.

[37]  Eric P. Garcia,et al.  Surveying fake news: Assessing university faculty’s fragmented definition of fake news and its impact on teaching critical thinking , 2020, International Journal for Educational Integrity.

[38]  Fabio Crestani,et al.  The Role of Personality and Linguistic Patterns in Discriminating Between Fake News Spreaders and Fact Checkers , 2020, NLDB.

[39]  Bengt Muthén,et al.  Weighted Least Squares Estimation with Missing Data , 2010 .

[40]  Arkaitz Zubiaga,et al.  Detection and Resolution of Rumours in Social Media , 2017, ACM Comput. Surv..

[41]  Peter E.D. Love,et al.  Convolutional neural network: Deep learning-based classification of building quality problems , 2019, Adv. Eng. Informatics.

[42]  Andri Fachrur Rozie,et al.  Text Classification for Sentiment Prediction of Social Media Dataset using Multichannel Convolution Neural Network , 2018, 2018 International Conference on Computer, Control, Informatics and its Applications (IC3INA).

[43]  Issa Traoré,et al.  Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques , 2017, ISDDC.

[44]  Paolo Rosso,et al.  Stance Detection in Fake News A Combined Feature Representation , 2018 .

[45]  Rohit Kumar Kaliyar,et al.  FNDNet – A deep convolutional neural network for fake news detection , 2020, Cognitive Systems Research.

[46]  Tiago A. Almeida,et al.  Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results , 2018, PROPOR.

[47]  Eunsol Choi,et al.  Truth of Varying Shades: Analyzing Language in Fake News and Political Fact-Checking , 2017, EMNLP.

[48]  Hueiseok Lim,et al.  exBAKE: Automatic Fake News Detection Model Based on Bidirectional Encoder Representations from Transformers (BERT) , 2019, Applied Sciences.

[49]  Chirag Shah,et al.  Towards automatic fake news classification , 2018 .

[50]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[51]  M. Gentzkow,et al.  Social Media and Fake News in the 2016 Election , 2017 .

[52]  Muhammad Abulaish,et al.  A Hybrid Approach for Detecting Automated Spammers in Twitter , 2018, IEEE Transactions on Information Forensics and Security.

[53]  Kevin Driscoll,et al.  The diffusion of misinformation on social media: Temporal pattern, message, and source , 2018, Comput. Hum. Behav..

[54]  Robert K. Brayton,et al.  Retiming and resynthesis: optimizing sequential networks with combinational techniques , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..