Local Binary Patterns for Arabic Optical Font Recognition

Optical Font Recognition (OFR) has been proven to increase Optical Character Recognition (OCR) accuracy, but it can also help in harvesting semantic information from documents. It therefore becomes a part of many Document Image Analysis (DIA) pipelines. Our work is based on the hypothesis that Local Binary Patterns (LBP), as a generic texture classification method, can address several distinct DIA problems at the same time such as OFR, script detection, writer identification, etc. In this paper we strip down the Redundant Oriented LBP (RO-LBP) method, previously used in writer identification, and apply it for OFR with the goal of introducing a generic method that classifies text as oriented texture. We focus on Arabic OFR and try to perform a thorough comparison of our method and the leading Gaussian Mixture Model method that is developed specifically for the task. Depending on the nature of proposed OFR method, each method's performance is usually evaluated on different data and with different evaluation protocols. The proposed experimental procedure addresses this problem and allows us to compare OFR methods that are fundamentally different by adapting them to a common measurement protocol. In performed experiments LBP method achieves perfect results on large text blocks generated from the APTI database, while preserving its very broad generic attributes as proven by secondary experiments.

[1]  Rolf Ingold,et al.  Optical Font Recognition Using Typographical Features , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Lambert Schomaker,et al.  Towards Explainable Writer Verification and Identification Using Vantage Writers , 2007 .

[3]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[4]  Lambert Schomaker,et al.  Towards Explainable Writer Verification and Identification Using Vantage Writers , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[5]  Adel M. Alimi,et al.  A study on font-family and font-size recognition applied to Arabic word images at ultra-low resolution , 2013, Pattern Recognit. Lett..

[6]  C. L. Philip Chen,et al.  Arabic font recognition based on diacritics features , 2014, Pattern Recognition.

[7]  Adel M. Alimi,et al.  A New Arabic Printed Text Image Database and Evaluation Protocols , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[8]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  B. K. Julsing,et al.  Face Recognition with Local Binary Patterns , 2012 .

[10]  Xinge You,et al.  Offline Arabic Handwriting Identification Using Language Diacritics , 2010, 2010 20th International Conference on Pattern Recognition.

[11]  Miguel Angel Ferrer-Ballester,et al.  LBP Based Line-Wise Script Identification , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[12]  Antonio Torralba,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[13]  Adel M. Alimi,et al.  Database and Evaluation Protocols for Arabic Printed Text Recognition , 2009 .

[14]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Yaghoub Pourasad,et al.  Farsi Font Recognition Using Holes of Letters and Horizontal Projection Profile , 2011 .

[16]  Marcus Liwicki,et al.  Oriented Local Binary Patterns for Writer Identification , 2013, AFHA.

[17]  Khairuddin Omar,et al.  A novel statistical feature extraction method for textual images: Optical font recognition , 2012, Expert Syst. Appl..

[18]  Yuan Yan Tang,et al.  Wavelet Domain Local Binary Pattern Features For Writer Identification , 2010, 2010 20th International Conference on Pattern Recognition.

[19]  Adel M. Alimi,et al.  Impact of Character Models Choice on Arabic Text Recognition Performance , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.