Fake News Detection in Facebook Using Machine Learning
暂无分享,去创建一个
This Project comes up with the applications of Random Forest techniques for detecting the 'fake news', that is, misleading news stories that comes from the non-reputable sources. Only by building a model based on a count vectorizer (using word tallies) or a (Term Frequency Inverse Document Frequency) tfidf matrix, (word tallies relative to how often they’re used in other articles in your dataset) can only get you so far. But these models do not consider the important qualities like word ordering and context. It is very possible that two articles that are similar in their word count will be completely different in their meaning. The data science community has responded by taking actions against the problem. There is a Kaggle competition called as the “Fake News Challenge” and Facebook is employing AI to filter fake news stories out of users’ feeds. Combatting the fake news is a classic text classification project with a straight forward proposition. Is it possible for you to build a model that can differentiate between “Real “news and “Fake” news? So a proposed work on assembling a dataset of both fake and real news and employ a Random Forest classifier in order to create a model to classify an article into fake or real based on its words and phrases. The main objective is to detect the fake news, which is a classic text classification problem with a straight forward proposition. It is needed to build a model that can differentiate between “Real” news and “Fake” news.
[1] Yimin Chen,et al. Automatic deception detection: Methods for finding fake news , 2015, ASIST.
[2] Carlo Strapparava,et al. The Lie Detector: Explorations in the Automatic Recognition of Deceptive Language , 2009, ACL.
[3] Paweł Cichosz,et al. Naïve Bayes classifier , 2015 .
[4] Ciro Cattuto,et al. Social spam detection , 2009, AIRWeb '09.