Content Based Indexing and Retrieval in a Digital Library of Arabic Scripts and Calligraphy

Due the cursive nature of the Arabic scripts automatic recognition of keywords using computers is very difficult. Content based indexing using textual, graphical and visual information combined provides a more realistic and practical approach to the problem of indexing large collection of calligraphic material. Starting with low level patter recognition and feature extraction techniques, graphical representations of the calligraphic material can be captured to form the low level indexing parameters. These parameters are then enhanced using textual and visual information provided by the users. Through visual feedback and visual interaction, recognized textual information can be used to enhance the indexing parameter and in return improve the retrieval of the calligraphic material. In this paper, we report an implementation of the system and show how visual feedback and visual interaction helps to improve the indexing parameters created using the low-level image feature extraction technologies.

[1]  Roland Wilson,et al.  Curve and corner extraction using the multiresolution Fourier transform , 1992 .

[2]  Tat-Seng Chua,et al.  Applying relevance feedback to a photo archival system , 1992, J. Inf. Sci..

[3]  Duncan Fyfe Gillies,et al.  A Parallel-Hierarchical Method For Grouping Line Segments Into Contours , 1990, Optics & Photonics.

[4]  Duncan Fyfe Gillies,et al.  Extracting contours by perceptual grouping , 1992, Image Vis. Comput..

[5]  Suliman Al-Hawamdeh,et al.  Nearest neighbour searching in a picture archive system , 1991 .

[6]  W. Bruce Croft,et al.  Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[7]  Subramaniam Ganesan,et al.  A robust Hough transform technique for description of multiple line segments in an image , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[8]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[9]  Hussein Almuallim,et al.  A Method of Recognition of Arabic Cursive Handwriting , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Adnan Amin,et al.  Hand-printed arabic character recognition system using an artificial network , 1996, Pattern Recognit..

[11]  J. Alison Noble,et al.  Finding Corners , 1988, Alvey Vision Conference.

[12]  M C. Maguire An analysis of ISLA and DEVIL—two experimental public information systems , 1984 .

[13]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[14]  Jose A. Ventura,et al.  Segmentation of Planar Curves into Straight-Line Segments and Elliptical Arcs , 1997, CVGIP Graph. Model. Image Process..