A Survey on Authorship Profiling Techniques

Authorship analysis is a text analysis technique that is visualized mainly in three different techniques namely Authorship Profiling, Authorship Identification and Plagiarism Detection. In this paper a brief survey on the recent developments in the area of author profiling approaches were presented. Authorship Profiling is to ascertain various authors characteristics like age, gender, native country, native language, degree of education and personality traits by analyzing their writing styles. In recent times, Author Profiling is popular in the fields of forensic analysis, security and marketing. Based on the popularity of the Authorship Profiling problem, multiple solutions were proposed by various researchers across the globe. Several researchers used different types of features to identify the writing style characteristics of authors. The main focus of this survey is to predict the demographic features of authors such as gender, age and personality traits based on the text corpus written by

[1]  Thamar Solorio,et al.  A Simple Approach to Author Profiling in MapReduce , 2014, CLEF.

[2]  T. Raghunadha Reddy,et al.  Author Profiling: Predicting Gender and Age from Blogs, Reviews & Social Media , 2014 .

[3]  Moshe Koppel,et al.  Determining an author's native language by mining a text for errors , 2005, KDD '05.

[4]  Somnath Banerjee,et al.  Automatic Author Profiling Based on Linguistic and Stylistic Features Notebook for PAN at CLEF 2013 , 2013, CLEF.

[5]  Shlomo Argamon,et al.  Automatically profiling the author of an anonymous text , 2009, CACM.

[6]  José Palazzo Moreira de Oliveira,et al.  Exploring Information Retrieval Features for Author Profiling , 2014, CLEF.

[7]  Shlomo Argamon,et al.  Effects of Age and Gender on Blogging , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[8]  Vasudeva Varma,et al.  Author Profiling using LDA and Maximum Entropy Notebook for PAN at CLEF 2013 , 2013, CLEF.

[9]  José Carlos González,et al.  DAEDALUS at PAN 2014: Guessing Tweet Author's Gender and Age , 2014, CLEF.

[10]  Lamia Hadrich Belguith,et al.  Author Profiling Using Style-based Features Notebook for PAN at CLEF 2013 , 2013, CLEF.

[11]  Dominique Estival,et al.  Author Profiling for English and Arabic Emails , 2008 .

[12]  Fermín L. Cruz,et al.  ITALICA at PAN 2013: An Ensemble Learning Approach to Author Profiling Notebook for PAN at CLEF 2013 , 2013, CLEF.

[13]  Hugo Jair Escalante,et al.  Using Intra-Profile Information for Author Profiling , 2014, CLEF.

[14]  Son Bao Pham,et al.  Author Profiling for Vietnamese Blogs , 2009, 2009 International Conference on Asian Language Processing.

[15]  Vasudeva Varma,et al.  Author Profiling: Predicting Age and Gender from Blogs Notebook for PAN at CLEF 2013 , 2013, CLEF.

[16]  Lamia Hadrich Belguith,et al.  Machine Learning for Classifying Authors of Anonymous Tweets, Blogs and Reviews , 2014, CLEF.

[17]  Darnes Vilariño Ayala,et al.  Two Methodologies Applied to the Author Profiling Task , 2013, CLEF.

[18]  Martha-Alicia Rocha,et al.  Semantic-based Features for Author Profiling Identification: First insights Notebook for PAN at CLEF 2013 , 2013, CLEF.

[19]  José Palazzo Moreira de Oliveira,et al.  Using Simple Content Features for the Author Profiling Task Notebook for PAN at CLEF 2013 , 2013, CLEF.

[20]  Michal Meina,et al.  Ensemble-based Classification for Author Profiling Using Various Features Notebook for PAN at CLEF 2013 , 2013, CLEF.

[21]  Vrizlynn L. L. Thing,et al.  Content-centric Age and Gender Profiling Notebook for PAN at CLEF 2013 , 2013, CLEF.

[22]  Marie-Francine Moens,et al.  Age and Gender Identification in Social Media , 2014, CLEF.

[23]  Christopher Baker Proof of Concept Framework for Prediction , 2014, CLEF.

[24]  Shlomo Argamon,et al.  Automatically Categorizing Written Texts by Author Gender , 2002, Lit. Linguistic Comput..

[25]  Lee Gillam Readability for Author Profiling? Notebook for PAN at CLEF 2013 , 2013, CLEF.

[26]  Julia Baquero,et al.  Author Profiling Using Corpus Statistics, Lexicons and Stylistic Features Notebook for PAN at CLEF-2013 , 2013, CLEF.

[27]  Carl Vogel,et al.  Style-based Distance Features for Author Profiling Notebook for PAN at CLEF 2013 , 2013, CLEF.

[28]  Magdalena Jankowska,et al.  CNG Text Classification for Authorship Profiling Task Notebook for PAN at CLEF 2013 , 2013, CLEF.