Improving handwriting based gender classification using ensemble classifiers

A system to predict gender from images of handwriting using textural descriptors.Multiple classifiers to discriminate male and female writings.Classifiers combined using bagging, voting and stacking techniques.Generic and script-independent approach applied to English and Arabic handwritings.Improved results on the QUWI database once compared to state-of-the-art methods. This paper presents a system to predict gender of individuals from offline handwriting samples. The technique relies on extracting a set of textural features from handwriting samples of male and female writers and training multiple classifiers to learn to discriminate between the two gender classes. The features include local binary patterns (LBP), histogram of oriented gradients (HOG), statistics computed from gray-level co-occurrence matrices (GLCM) and features extracted through segmentation-based fractal texture analysis (SFTA). For classification, we employ artificial neural networks (ANN), support vector machine (SVM), nearest neighbor classifier (NN), decision trees (DT) and random forests (RF). Classifiers are then combined using bagging, voting and stacking techniques to enhance the overall system performance. The realized classification rates are significantly better than those of the state-of-the-art systems on this problem validating the ideas put forward in this study.

[1]  Hassiba Nemmour,et al.  Histogram of Oriented Gradients for writer's gender, handedness and age prediction , 2015, 2015 International Symposium on Innovations in Intelligent SysTems and Applications (INISTA).

[2]  James Hartley,et al.  Sex Differences in Handwriting: a comment on Spear , 1991 .

[3]  J Walton Handwriting changes due to aging and Parkinson's syndrome. , 1997, Forensic science international.

[4]  Kevin W. Bowyer,et al.  Combination of multiple classifiers using local accuracy estimates , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Sargur N. Srihari,et al.  Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  C C Dunbar,et al.  The Slope Method for Prescribing Exercise with Ratings of Perceived Exertion (RPE) , 1996, Perceptual and motor skills.

[7]  Cha Zhang,et al.  Ensemble Machine Learning: Methods and Applications , 2012 .

[8]  Gerard P. van Galen,et al.  Handwriting: Issues for a psychomotor theory ☆ , 1991 .

[9]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Labiba Souici-Meslati,et al.  Automatic analysis of handwriting for gender classification , 2014, Pattern Analysis and Applications.

[11]  Ying Wen,et al.  Text-independent writer identification using SIFT descriptor and contour-directional feature , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[12]  Klara Kedem,et al.  Classification of Hebrew calligraphic handwriting styles: preliminary results , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[13]  K. Feder,et al.  Handwriting development, competency, and intervention , 2007, Developmental medicine and child neurology.

[14]  Véronique Eglin,et al.  Curvelets Based Queries for CBIR Application in Handwriting Collections , 2007 .

[15]  Agma J. M. Traina,et al.  An Efficient Algorithm for Fractal Analysis of Textures , 2012, 2012 25th SIBGRAPI Conference on Graphics, Patterns and Images.

[16]  Christos Faloutsos,et al.  Fast feature selection using fractal dimension , 2010, J. Inf. Data Manag..

[17]  Lambert Schomaker,et al.  Writer identification using directional ink-trace width measurements , 2012, Pattern Recognit..

[18]  Mahdi Jampour,et al.  Efficient Handwritten Digit Recognition based on Histogram of Oriented Gradients and SVM , 2014 .

[19]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[20]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[21]  S. Rosenblum,et al.  Age-related changes in executive control and their relationships with activity performance in handwriting. , 2013, Human movement science.

[22]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[23]  Matti Pietikäinen,et al.  Performance evaluation of texture measures with classification based on Kullback discrimination of distributions , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[24]  W. N. Hayes,et al.  Identifying Sex from Handwriting , 1996, Perceptual and motor skills.

[25]  Michael P. Caligiuri,et al.  The Neuroscience of Handwriting: Applications for Forensic Document Examination , 2012 .

[26]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[27]  John R. Beech,et al.  Do differences in sex hormones affect handwriting style? Evidence from digit ratio and sex role identity as determinants of the sex of handwriting , 2005 .

[28]  Bernard Zenko,et al.  Is Combining Classifiers with Stacking Better than Selecting the Best One? , 2004, Machine Learning.

[29]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[30]  H. Möller,et al.  Kinematic Analysis of Handwriting Movements in Patients with Alzheimer’s Disease, Mild Cognitive Impairment, Depression and Healthy Subjects , 2003, Dementia and Geriatric Cognitive Disorders.

[31]  Avi Karni,et al.  Sex differences in motor performance and motor learning in children and adolescents: An increasing male advantage in motor learning and consolidation phase gains , 2009, Behavioural Brain Research.

[32]  Lambert Schomaker,et al.  Writer identification and verification , 2008 .

[33]  Stephen R. Marsland,et al.  Machine Learning - An Algorithmic Perspective , 2009, Chapman and Hall / CRC machine learning and pattern recognition series.

[34]  Ian H. Witten,et al.  Stacked generalization: when does it work? , 1997, IJCAI 1997.

[35]  Hassiba Nemmour,et al.  Local descriptors to improve off-line handwriting-based gender prediction , 2014, 2014 6th International Conference of Soft Computing and Pattern Recognition (SoCPaR).

[36]  Shula Parush,et al.  Developmental Trends in Handwriting Performance among Middle School Children , 2007 .

[37]  Andreas R. Luft,et al.  Characterization of motor skill and instrumental learning time scales in a skilled reaching task in rat , 2004, Behavioural Brain Research.

[38]  Robert Sablatnig,et al.  Writer Identification and Writer Retrieval Using the Fisher Vector on Visual Vocabularies , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[39]  Florence L. Goodenough,et al.  Sex Differences in Judging the Sex of Handwriting , 1945 .

[40]  Youbao Tang,et al.  Offline Text-Independent Writer Identification Based on Scale Invariant Feature Transform , 2014, IEEE Transactions on Information Forensics and Security.

[41]  Tom Chau,et al.  Handwriting Difficulties in Children with Autism Spectrum Disorders: A Scoping Review , 2011, Journal of autism and developmental disorders.

[42]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[43]  Luiz Eduardo Soares de Oliveira,et al.  Texture-based descriptors for writer identification and verification , 2013, Expert Syst. Appl..

[44]  Abdelaali Hassaine,et al.  Automatic prediction of age, gender, and nationality in offline handwriting , 2014 .

[45]  Haikal El Abed,et al.  ICDAR2015 competition on Multi-script Writer Identification and Gender Classification using ‘QUWI’ Database , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[46]  E. Sokic,et al.  Analysis of off-line handwritten text samples of different gender using shape descriptors , 2012, 2012 IX International Symposium on Telecommunications (BIHTEL).

[47]  M R Cohen,et al.  Individual and Sex Differences in Speed of Handwriting among High School Students , 1997, Perceptual and motor skills.

[48]  Jorge Stolfi,et al.  T-HOG: An effective gradient-based descriptor for single line text regions , 2013, Pattern Recognit..

[49]  G. Stelmach,et al.  Control of stroke size, peak acceleration, and stroke duration in Parkinsonian handwriting , 1991 .

[50]  Geoffrey I. Webb,et al.  Encyclopedia of Machine Learning , 2011, Encyclopedia of Machine Learning.

[51]  Sung-Hyuk Cha,et al.  A priori algorithm for sub-category classification analysis of handwriting , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[52]  Ernest Valveny,et al.  Writer identification in handwritten musical scores with bags of notes , 2013, Pattern Recognit..

[53]  Lambert Schomaker,et al.  Effects of motor programming on the power spectral density function of finger and wrist movements , 1990 .

[54]  Umapada Pal,et al.  A study on word-level multi-script identification from video frames , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[55]  Somaya Al-Máadeed,et al.  QUWI: An Arabic and English Handwriting Dataset for Offline Writer Identification , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[56]  A. Bastian,et al.  Children with autism show specific handwriting impairments , 2009, Neurology.

[57]  Takio Kurita,et al.  Selection of Histograms of Oriented Gradients Features for Pedestrian Detection , 2007, ICONIP.

[58]  Oscar Déniz-Suárez,et al.  Face recognition using Histograms of Oriented Gradients , 2011, Pattern Recognit. Lett..

[59]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[60]  Hassiba Nemmour,et al.  Age, gender and handedness prediction from handwriting using gradient features , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[61]  K. Loewenthal,et al.  Inferring gender from handwriting in Urdu and English. , 1996, The Journal of social psychology.

[62]  H. Stefánsson,et al.  Mapping of a familial essential tremor gene, FET1, to chromosome 3q13 , 1997, Nature Genetics.

[63]  Jeffrey J. P. Tsai,et al.  Machine learning applications in software engineering , 2005 .

[64]  Marcus Liwicki,et al.  Automatic gender detection using on-line and off-line information , 2011, Pattern Analysis and Applications.

[65]  Regis Hoffman,et al.  Visual classification of coarse vehicle orientation using Histogram of Oriented Gradients features , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[66]  Simon E Fisher,et al.  Confirmatory evidence for linkage of relative hand skill to 2p12-q11. , 2003, American journal of human genetics.

[67]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[68]  Samir Elloumi,et al.  A novel approach for handedness detection from off-line handwriting using fuzzy conceptual reduction , 2016, EURASIP J. Image Video Process..