Hyperspectral Image Analysis for Writer Identification using Deep Learning

Handwriting is a behavioral characteristic of human beings that is one of the common idiosyncrasies utilized for litigation purposes. Writer identification is commonly used for forensic examination of questioned and specimen documents. Recent advancements in imaging and machine learning technologies have empowered the development of automated, intelligent and robust writer identification methods. Most of the existing methods based on human defined features and color imaging have limited performance in terms of accuracy and robustness. However, rich spectral information content obtained from hyperspectral imaging (HSI) and suitable spatio-spectral features extracted using deep learning can significantly enhance the performance of writer identification in terms of accuracy and robustness. In this paper, we propose a novel writer identification method in which spectral responses of text pixels in a hyperspectral document image are extracted and are fed to a Convolutional Neural Network (CNN) for writer classification. Different CNN architectures, hyperparameters, spatio-spectral formats, train-test ratios and inks are used to evaluate the performance of the proposed system on the UWA Writing Inks Hyperspectral Images (WIHSI) database and to select the most suitable set of parameters for writer identification. The findings of this work have opened a new arena in forensic document analysis for writer identification using HSI and deep learning.

[1]  Sargur N. Srihari,et al.  A statistical model for writer verification , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[2]  Michael Blumenstein,et al.  An Empirical Study on Writer Identification and Verification From Intra-Variable Individual Handwriting , 2017, IEEE Access.

[3]  Horst Bunke,et al.  Writer identification using text line based features , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[4]  Khurram Khurshid,et al.  Towards Automated Ink Mismatch Detection in Hyperspectral Document Images , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[5]  Ajmal S. Mian,et al.  Hyperspectral Imaging for Ink Mismatch Detection , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[6]  Ajmal S. Mian,et al.  Towards Automated Hyperspectral Document Image Analysis , 2013, AFHA.

[7]  Linjie Xing,et al.  DeepWriter: A Multi-stream Deep CNN for Text-Independent Writer Identification , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[8]  Ajmal S. Mian,et al.  Sparse Spatio-spectral Representation for Hyperspectral Image Super-resolution , 2014, ECCV.

[9]  Maria Fernanda Pimentel,et al.  Near infrared hyperspectral imaging for forensic analysis of document forgery. , 2014, The Analyst.

[10]  Ajmal S. Mian,et al.  Localized forgery detection in hyperspectral document images , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[11]  Sung-Hyuk Cha,et al.  Individuality of handwriting. , 2002, Journal of forensic sciences.

[12]  K. M. bin Abdl,et al.  Handwriting identification: a direction review , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[13]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[14]  Yassine Ruichek,et al.  Block wise local binary count for off-Line text-independent writer identification , 2018, Expert Syst. Appl..

[15]  Bipin Indurkhya,et al.  Text-independent writer identification using convolutional neural network , 2017, Pattern Recognit. Lett..

[16]  Matthias Bethge,et al.  Comparing deep neural networks against humans: object recognition when the signal gets weaker , 2017, ArXiv.

[17]  G. de Bruin,et al.  QUANTITATIVE HYPERSPECTRAL IMAGING OF HISTORICAL DOCUMENTS: TECHNIQUE AND APPLICATIONS , 2008 .

[18]  Robert Sablatnig,et al.  Learning Features for Writer Retrieval and Identification using Triplet CNNs , 2018, 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[19]  Lei Shu,et al.  Learning Spatial–Spectral Features for Hyperspectral Image Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Mohamed Cheriet,et al.  Constrained Energy Maximization and Self-Referencing Method for Invisible Ink Detection from Multispectral Historical Document Images , 2014, 2014 22nd International Conference on Pattern Recognition.

[21]  Muhammad Imran Razzak,et al.  Writer identification using machine learning approaches: a comprehensive review , 2018, Multimedia Tools and Applications.

[22]  Khurram Khurshid,et al.  A Spatio-Spectral Hybrid Convolutional Architecture for Hyperspectral Document Authentication , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[23]  Tao Li,et al.  Cube-CNN-SVM: A Novel Hyperspectral Image Classification Method , 2016, 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI).

[24]  Tieniu Tan,et al.  Font Recognition Based on Global Texture Analysis , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  A. Rafiee,et al.  Off-Line Writer Recognition for Farsi Text , 2007, 2007 Sixth Mexican International Conference on Artificial Intelligence, Special Session (MICAI).

[26]  Liren Zhang,et al.  The principle and application of hyperspectral imaging technology in detection of handwriting , 2017, 2017 9th International Conference on Advanced Infocomm Technology (ICAIT).

[27]  Luiz Eduardo Soares de Oliveira,et al.  Texture-based descriptors for writer identification and verification , 2013, Expert Syst. Appl..

[28]  Jianjun Lei,et al.  Joint spatial-spectral hyperspectral image classification based on convolutional neural network , 2020, Pattern Recognit. Lett..

[29]  Hamid Saeed Khan,et al.  Modern Trends in Hyperspectral Image Analysis: A Review , 2018, IEEE Access.

[30]  Khurram Khurshid,et al.  Automated Forgery Detection in Multispectral Document Images Using Fuzzy Clustering , 2018, 2018 13th IAPR International Workshop on Document Analysis Systems (DAS).

[31]  Khurram Khurshid,et al.  Deep learning for automated forgery detection in hyperspectral document images , 2018, J. Electronic Imaging.

[32]  David Doermann,et al.  Combining Local Features for Offline Writer Identification , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[33]  Hong Yan,et al.  Hyperspectral document image processing: Applications, challenges and future prospects , 2019, Pattern Recognit..

[34]  Lambert Schomaker,et al.  Deep Adaptive Learning for Writer Identification based on Single Handwritten Word Images , 2018, Pattern Recognit..

[35]  Lianwen Jin,et al.  DeepWriterID: An End-to-End Online Text-Independent Writer Identification System , 2015, IEEE Intelligent Systems.