On authorship authentication of Arabic articles

In this work, we consider the authorship authentication problem, a historical problem in linguistics that has been made more difficult with the explosion of the Internet and the increase in the amount of unverified texts and hard to check claims posted online. We focus on the Arabic language for which this problem is still largely understudied despite its importance. We experiment with two classifiers and the results obtained so far are almost perfect.

[1]  F. Mosteller,et al.  Inference and Disputed Authorship: The Federalist , 1966 .

[2]  Rajarathnam Chandramouli,et al.  Author gender identification from text , 2011, Digit. Investig..

[3]  R. H. Baayen,et al.  An experiment in authorship attribution , 2002 .

[4]  Rehab Duwairi,et al.  Machine learning for Arabic text categorization , 2006, J. Assoc. Inf. Sci. Technol..

[5]  Dominique Estival,et al.  TAT: An Author Profiling Tool with Application to Arabic Emails , 2007, ALTA.

[6]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[7]  Vanessa Wei Feng,et al.  Changes in Style in Authors with Alzheimer's Disease , 2012 .

[8]  Luiz Eduardo Soares de Oliveira,et al.  Author Identification Using Compression Models , 2022 .

[9]  H. Sayoud,et al.  Authorship attribution of ancient texts written by ten arabic travelers using a SMO-SVM classifier , 2012, 2012 International Conference on Communications and Information Technology (ICCIT).

[10]  Derek Abbott,et al.  Who wrote the "Letter to the Hebrews"?: data mining for detection of text authorship , 2005, SPIE Micro + Nano Materials, Devices, and Applications.

[11]  María J. Somodevilla,et al.  H-Tree: A data structure for fast path-retrieval in rooted trees. , 2007 .

[12]  Moshe Koppel,et al.  Automatically Classifying Documents by Ideological and Organizational Affiliation , 2009, 2009 IEEE International Conference on Intelligence and Security Informatics.

[13]  Zachary Miller,et al.  Author Gender Prediction in an Email Stream Using Neural Networks , 2012 .

[14]  Rong Zheng,et al.  Authorship Analysis in Cybercrime Investigation , 2003, ISI.

[15]  Olivier de Vel,et al.  Mining E-mail Authorship , 2000 .

[16]  David Corne,et al.  Authorship Attribution in Arabic using a hybrid of evolutionary search and linear discriminant analysis , 2010, 2010 UK Workshop on Computational Intelligence (UKCI).

[17]  Daniel Jurafsky,et al.  Automatic Tagging of Arabic Text: From Raw Text to Base Phrase Chunks , 2004, NAACL.

[18]  Abdulmohsen Al-Thubaity,et al.  Automatic Arabic Text Classification , 2008 .

[19]  David W. Corne,et al.  Investigating hybrids of evolutionary search and linear discriminant analysis for authorship attribution , 2007, 2007 IEEE Congress on Evolutionary Computation.

[20]  Patrick Juola,et al.  Large-Scale Experiments in Authorship Attribution , 2012 .

[21]  Halim Sayoud,et al.  Author discrimination between the Holy Quran and Prophet's statements , 2012, Lit. Linguistic Comput..

[22]  Patrick Juola,et al.  Authorship Attribution , 2008, Found. Trends Inf. Retr..

[23]  Halim Sayoud,et al.  Authorship Attribution of Short Historical Arabic Texts Based on Lexical Features , 2013, 2013 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery.

[24]  Saleh Alsaleem,et al.  Automated Arabic Text Categorization Using SVM and NB , 2011, Int. Arab. J. e Technol..

[25]  Fouzi Harrag,et al.  Improving arabic text categorization using decision trees , 2009, 2009 First International Conference on Networked Digital Technologies.

[26]  Jon Oberlander,et al.  The Identity of Bloggers: Openness and Gender in Personal Weblogs , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[27]  Jennifer Widom,et al.  Exploiting hierarchical domain structure to compute similarity , 2003, TOIS.

[28]  Efstathios Stamatatos,et al.  A survey of modern authorship attribution methods , 2009, J. Assoc. Inf. Sci. Technol..

[29]  Riyad Al-Shalabi,et al.  A comparison of text-classification techniques applied to Arabic text , 2009, J. Assoc. Inf. Sci. Technol..

[30]  Hsinchun Chen,et al.  Applying authorship analysis to extremist-group Web forum messages , 2005, IEEE Intelligent Systems.

[31]  Hsinchun Chen,et al.  Applying Authorship Analysis to Arabic Web Content , 2005, ISI.

[32]  Fouzi Harrag,et al.  Stemming as a feature reduction technique for Arabic Text Categorization , 2011, 2011 10th International Symposium on Programming and Systems.

[33]  Efstathios Stamatatos,et al.  Author identification: Using text sampling to handle the class imbalance problem , 2008, Inf. Process. Manag..

[34]  Luiz Eduardo Soares de Oliveira,et al.  Author identification using writer-dependent and writer-independent strategies , 2008, SAC '08.

[35]  George M. Mohay,et al.  Mining e-mail content for author identification forensics , 2001, SGMD.

[36]  Alaa M. El-Halees,et al.  Arabic Text Classification Using Maximum Entropy , 2015 .

[37]  Abdelwadood Mesleh,et al.  Chi Square Feature Extraction Based Svms Arabic Language Text Categorization System , 2007 .

[38]  Jonathan H. Clark,et al.  An Algorithm for Identifying Authors Using Synonyms , 2007, Eighth Mexican International Conference on Current Trends in Computer Science (ENC 2007).