Arabic Font Recognition Based on Templates

We present an algorithm for a priori Arabic optical Font Recognition (AFR). First, words in the training set of documents for each font are segmented into symbols that are rescaled. Next, templates are constructed, where every new training symbol that is not similar to existing templates is a new template. Templates are sharable between fonts. To classify the font of a word, its symbols are matched to the templates and the fonts of the best matching templates are retained. The most frequent font is the word font.

[1]  I. S. I. Abuhaiba Discrete Script or Cursive Language Identification from Document Images , 2004 .

[2]  Tieniu Tan,et al.  Font Recognition Based on Global Texture Analysis , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Theodosios Pavlidis,et al.  Font recognition and contextual processing for more accurate text recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[4]  I. S. I. Abuhaiba,et al.  Skew Correction of Textural Documents , 2003, J. King Saud Univ. Comput. Inf. Sci..

[5]  Alessandro Sperduti,et al.  Optical font recognition for multi-font OCR and document processing , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[6]  Rolf Ingold,et al.  Optical Font Recognition Using Typographical Features , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Rolf Ingold,et al.  A study of document image degradation effects on font recognition , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.