Handwritten Indic Script Identification in Multi-Script Document Images: A Survey

Script identification is crucial for automating optical character recognition (OCR) in multi-script documents since OCRs are script-dependent. In this paper, we present a comprehensive survey of th...

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Nibaran Das,et al.  An Approach for Automatic Indic Script Identification from Handwritten Document Images , 2015, ACSS.

[3]  Mita Nasipuri,et al.  Offline Script Identification from multilingual Indic-script documents: A state-of-the-art , 2015, Comput. Sci. Rev..

[4]  Jie Ding,et al.  Differential Between Oriental and European Scripts by Statistical Features , 1998, Int. J. Pattern Recognit. Artif. Intell..

[5]  Giuseppe Pirlo,et al.  Script Identification of Multi-Script Documents: A Survey , 2017, IEEE Access.

[6]  Gerhard Rigoll,et al.  Novel script line identification method for script normalization and feature extraction in on-line handwritten whiteboard note recognition , 2009, Pattern Recognit..

[7]  Subhadip Basu,et al.  CMATERdb1: a database of unconstrained handwritten Bangla and Bangla–English mixed script document image , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[8]  Patrick Kelly,et al.  Script and language identification for handwritten document images , 1999, International Journal on Document Analysis and Recognition.

[9]  Sk Md Obaidullah,et al.  A System for Handwritten Script Identification From Indian Document , 2013 .

[10]  A. Lawrence Spitz,et al.  Determination of the Script and Language Content of Document Images , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Subhadip Basu,et al.  Benchmark databases of handwritten Bangla-Roman and Devanagari-Roman mixed-script document images , 2018, Multimedia Tools and Applications.

[12]  Ioannis Pratikakis,et al.  Text line and word segmentation of handwritten documents , 2009, Pattern Recognit..

[13]  Yi Li,et al.  Language identification for handwritten document images using a shape codebook , 2009, Pattern Recognit..

[14]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[15]  Alireza Alaei,et al.  A new scheme for unconstrained handwritten text-line segmentation , 2011, Pattern Recognit..

[16]  Anil K. Jain,et al.  Online handwritten script recognition , 2004 .

[17]  J. Mantas,et al.  An overview of character recognition methodologies , 1986, Pattern Recognit..

[18]  Nibaran Das,et al.  PHDIndic_11: page-level handwritten document image dataset of 11 official Indic scripts for script identification , 2017, Multimedia Tools and Applications.

[19]  Nibaran Das,et al.  Script Identification from Printed Indian Document Images and Performance Evaluation Using Different Classifiers , 2014, Appl. Comput. Intell. Soft Comput..

[20]  Mahantapas Kundu,et al.  A genetic algorithm based region sampling for selection of local features in handwritten digit recognition application , 2012, Appl. Soft Comput..

[21]  Debashis Ghosh,et al.  Script Recognition—A Review , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Nibaran Das,et al.  Numeral Script Identification from Handwritten Document Images , 2015 .

[23]  Ioannis Pratikakis,et al.  Text line detection in handwritten documents , 2008, Pattern Recognit..

[24]  Subhadip Basu,et al.  Text line extraction from multi-skewed handwritten documents , 2007, Pattern Recognit..

[25]  Nibaran Das,et al.  Convolution Based Technique for Indic Script Identification from Handwritten Document Images , 2015 .

[26]  Subhadip Basu,et al.  A novel framework for automatic sorting of postal documents with multi-script address blocks , 2010, Pattern Recognit..

[27]  B. V. Dhandra,et al.  Offline Handwritten Script Identification in Document Images , 2010 .