A Clickbait Detection Method on News Sites

The use of internet news sites increases day by day. The internet has gone beyond institutions and organizations to provide a different service and there have been attempts to provide services only through the internet and thus to earn money. It can also be a organization or an individual who opens an account on the social network and provides financial gain on these accounts. The financial gain on the internet is increasing in parallel with the number of people entering the site in general or the number of people reading the content on the site. Clickbait is a click technique in which a user manipulates the curiosity of a person in order to open more pages in a web site, usually by writing exaggerated and unreal headlines. In this study, headlines or subheadings for news were collected. In these news articles, the Clickbait headline identified by TF-IDF has summarized by looking at the content of the Clickbait news with text feature extraction based on ontology method. The summarized version has been shown to the user without having to click on the newsletter. In this study, news in 4 news sites with Turkish and English content were examined. This study is the first Turkish study about Clickbait detection. In the English tests, results have given in comparison with equivalent algorithms and explained in detail.

[1]  Naeemul Hassan,et al.  Diving Deep into Clickbaits: Who Use Them to What Extents in Which Topics with What Effects? , 2017, ASONAM.

[2]  Pierluigi Ritrovato,et al.  Abnormal Event Recognition: A Hybrid Approach Using SemanticWeb Technologies , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3]  Timos K. Sellis,et al.  A Ranking Mechanism for SemanticWeb Service Discovery , 2007, 2007 IEEE Congress on Services (Services 2007).

[4]  Matthias Hagen,et al.  Clickbait Detection , 2016, ECIR.

[5]  Di Wu,et al.  URLSight: Profiling Mobile Users via Large-Scale Internet Metadata Analytics , 2016, 2016 IEEE Trustcom/BigDataSE/ISPA.

[6]  Amrita A. Manjrekar,et al.  A novel method to summarize and retrieve text documents using text feature extraction based on ontology , 2016, 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[7]  Preslav Nakov,et al.  In Search of Credible News , 2016, AIMSA.

[8]  Dieter Fensel,et al.  Towards SemanticWeb Services in Grid Environments , 2005, 2005 First International Conference on Semantics, Knowledge and Grid.

[9]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[10]  Jukka Riekki,et al.  Enabling user-centered interactions in the Internet of Things , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[11]  Barbara J. Grosz,et al.  Natural-Language Processing , 1982, Artificial Intelligence.

[12]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[13]  Tanmoy Chakraborty,et al.  We Used Neural Networks to Detect Clickbaits: You Won't Believe What Happened Next! , 2016, ECIR.

[14]  Maria das Graças Volpe Nunes,et al.  Summarizing Scientific Texts: Experiments with Extractive Summarizers , 2007, Seventh International Conference on Intelligent Systems Design and Applications (ISDA 2007).

[15]  Sebastian Thrun,et al.  Tracking-based semi-supervised learning , 2011, Int. J. Robotics Res..

[16]  Daoud M. Daoud,et al.  Clickbait Detection , 2018, ICSIE '18.

[17]  Wei Li,et al.  Summarising News with Texts and Pictures , 2014, 2014 10th International Conference on Semantics, Knowledge and Grids.

[18]  Pankaj Gupta,et al.  Summarizing text by ranking text units according to shallow linguistic features , 2011, 13th International Conference on Advanced Communication Technology (ICACT2011).

[19]  Mehmet Kaya,et al.  Mining quad closure patterns in Instagram , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[20]  Niloy Ganguly,et al.  Stop Clickbait: Detecting and preventing clickbaits in online news media , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[21]  K. G. Srinivasa,et al.  SentenceRank — A graph based approach to summarize text , 2014, The Fifth International Conference on the Applications of Digital Information and Web Technologies (ICADIWT 2014).

[22]  Mehmet Kaya,et al.  A Novel Method for Event Recommendation in Meetup , 2017, 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[23]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..