Archival image indexing with connectivity features using randomized masks

Connectivity properties capture a natural spatial feature of a binary image. Albeit they are easy to compute, more often than not, they alone fail to provide a good characterization of the image because of the fact that several different images may have the same connectivity features. Typically, other types of parameters, e.g., moments, are used to augment the connectivity feature for efficient indexing and retrieval purposes. In this work, an alternative approach is proposed. Instead of considering other diverse features, which are computationally intensive, only the connectivity features of the image are used iteratively using a novel concept of spatial masking. For this purpose, a greedy algorithm for constructing a spatial feature vector of variable length for a binary image, is proposed. The algorithm is based on XOR-ing the image bit-plane with a few pseudo-random synthetic masks, and its novelty lies in computing the feature vector iteratively, depending on the size and diversity of the image database. The classical Euler number and the two primary connectivity features from which it is derived, namely, the number of connected components and the number of holes, are used to finally generate a unique feature vector for each binary image in the database using a fuzzy membership function customized for the given database. The method is particularly suitable for large-sized image archives of a digital library, where each image contains one or more objects. It is found to converge within only three iterations for a postal stamp database consisting of 2598 images, and also for a logo database of 1034 images. A data structure called discrimination tree has been introduced for supporting efficient storage and indexing of the images using the above feature vector.

[1]  James C. Bezdek,et al.  Fuzzy models—What are they, and why? [Editorial] , 1993, IEEE Transactions on Fuzzy Systems.

[2]  Richard J. Prokop,et al.  A survey of moment-based techniques for unoccluded object representation and recognition , 1992, CVGIP Graph. Model. Image Process..

[3]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[4]  Tim J. Ellis,et al.  Image Difference Threshold Strategies and Shadow Detection , 1995, BMVC.

[5]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Other Conferences.

[6]  Christos Faloutsos,et al.  Efficient and effective Querying by Image Content , 1994, Journal of Intelligent Information Systems.

[7]  Bhargab B. Bhattacharya,et al.  Stacked Euler Vector (SERVE): A Gray-Tone Image Feature Based on Bit-Plane Augmentation , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  James Ze Wang,et al.  Wavelet-based image indexing techniques with partial sketch retrieval capability , 1997, Proceedings of ADL '97 Forum on Research and Technology. Advances in Digital Libraries.

[9]  Fuzzy Models-What Are They , and Why ? , 2004 .

[10]  C. A. Murthy,et al.  A pipeline architecture for computing the Euler number of a binary image , 2005, J. Syst. Archit..

[11]  Yasufumi Takama,et al.  Genetic algorithms for a family of image similarity models incorporated in the relevance feedback mechanism , 2003, Appl. Soft Comput..

[12]  Mark H. Overmars,et al.  The Design of Dynamic Data Structures , 1987, Lecture Notes in Computer Science.

[13]  Nozha Boujemaa,et al.  Embedding fuzzy logic in content based image retrieval , 2000, PeachFuzz 2000. 19th International Conference of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.00TH8500).

[14]  Sargur N. Srihari Document Image Understanding , 1986, FJCC.

[15]  Brijesh Verma,et al.  A fuzzy-neural approach for interpretation and fusion of colour and texture features for CBIR systems , 2004, Appl. Soft Comput..

[16]  Vassilis Anastassopoulos,et al.  The Euler feature vector , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[17]  Jiří Matas,et al.  Unconstrained licence plate and text localization and recognition , 2005, Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005..

[18]  Fred S. Roberts,et al.  Applied Combinatorics , 1984 .

[19]  Minghua Chen,et al.  A fast algorithm to calculate the Euler number for binary images , 1988, Pattern Recognit. Lett..

[20]  Paul L. Rosin,et al.  Proceedings of the 13th British Machine Vision Conference , 2004, Image Vis. Comput..

[21]  Sethuraman Panchanathan,et al.  Image indexing using vector quantization , 1995, Electronic Imaging.

[22]  Mark de Berg,et al.  Computational geometry: algorithms and applications , 1997 .

[23]  C.-C. Jay Kuo,et al.  Wavelet-compressed image retrieval using successive approximation quantization (SAQ) features , 1997, Other Conferences.

[24]  William I. Grosky,et al.  Spatial similarity-based retrievals and image indexing by hierarchical decomposition , 1997, Proceedings of the 1997 International Database Engineering and Applications Symposium (Cat. No.97TB100166).

[25]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Azriel Rosenfeld,et al.  Digital geometry , 2002, JCIS.

[27]  Suh-Yin Lee,et al.  Retrieval of similar pictures on pictorial databases , 1991, Pattern Recognit..

[28]  C. A. Murthy,et al.  Euler vector for search and retrieval of gray-tone images , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[29]  Alex Pentland,et al.  Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[30]  C. A. Murthy,et al.  Euler vector: a combinatorial signature for gray-tone images , 2002, Proceedings. International Conference on Information Technology: Coding and Computing.

[31]  David Salesin,et al.  Fast multiresolution image querying , 1995, SIGGRAPH.

[32]  B. Pogue,et al.  Image analysis for discrimination of cervical neoplasia. , 2000, Journal of biomedical optics.

[33]  Toshikazu Kato,et al.  Query by Visual Example - Content based Image Retrieval , 1992, EDBT.

[34]  Nabil R. Adam Proceedings of the IEEE international forum on Research and technology advances in digital libraries , 1997 .

[35]  Partha Bhowmick,et al.  CONFERM: Connectivity Features with Randomized Masks and Their Applications to Image Indexing , 2004, ICVGIP.

[36]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.