Author recognition by Abstract Feature Extraction

The purpose of this study is to show the success of Abstract Feature Extraction Method in multi dimensional feature vectors studies. Author recognition study is taken as an application area and word root and 2 gram's are chosen as feature vectors. The success of the Abstract Feature Extraction method in classification is shown on both Turkish and English data sets by comparing with feature extraction methods such as PCA, CFS, chi-square.

[1]  Efstathios Stamatatos,et al.  Computer-Based Authorship Attribution Without Lexical Measures , 2001, Comput. Humanit..

[2]  Efstathios Stamatatos,et al.  Automatic Text Categorization In Terms Of Genre and Author , 2000, CL.

[3]  Yuan-Fang Wang,et al.  The use of bigrams to enhance text categorization , 2002, Inf. Process. Manag..

[4]  Banu Diri,et al.  Impact of a New Attribute Extraction Algorithm on Web Page Classification , 2009, DMIN.

[5]  Banu Diri,et al.  Automatic Author Detection for Turkish Texts , 2003 .

[6]  J. F. Burrows,et al.  Not Unles You Ask Nicely: The Interpretative Nexus Between Analysis and Information , 1992 .

[7]  A. Q. Morton The Authorship of Greek Prose , 1965 .

[8]  Barron Brainerd Weighting Evidence in Language and Literature: A Statistical Approach , 1974 .

[9]  Johannes Fürnkranz,et al.  A Study Using $n$-gram Features for Text Categorization , 1998 .

[10]  Frederick Mosteller,et al.  Applied Bayesian and classical inference : the case of the Federalist papers , 1984 .

[11]  Murat Can Ganiz,et al.  Analysis of preprocessing methods on classification of Turkish texts , 2011, 2011 International Symposium on Innovations in Intelligent Systems and Applications.

[12]  Paul Bratley,et al.  Computers and the Humanities , 1978, Computer.

[13]  I.N. Bozkurt,et al.  Authorship attribution , 2007, 2007 22nd international symposium on computer and information sciences.

[14]  Banu Diri,et al.  A new method for attribute extraction with application on text classification , 2009, 2009 Fifth International Conference on Soft Computing, Computing with Words and Perceptions in System Analysis, Decision and Control.

[15]  R. Harald Baayen,et al.  How Variable May a Constant be? Measures of Lexical Richness in Perspective , 1998, Comput. Humanit..

[16]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.