A unified framework for image database clustering and content-based retrieval

With the proliferation of image data, the need to search and retrieve images efficiently and accurately from a large image database or a collection of image databases has drastically increased. To address such a demand, a unified framework called <i>Markov Model Mediators</i> (MMMs) is proposed in this paper to facilitate conceptual database clustering and to improve the query processing performance by analyzing the summarized knowledge. The unique characteristics of MMMs are that it provides the capabilities of exploring the affinity relations among the images at the database level and among the databases at the cluster level respectively, using an effective data mining process. At the database level, each database is modeled by an intra-database MMM which enables accurate image retrieval within the database. Then the conceptual database clustering is performed and cluster-level knowledge summarization is conducted to reduce the cost of retrieving images across the databases. This framework has been tested using a set of image databases, which contain various numbers of images with different dimensions and concept categories. The experimental results demonstrate that our framework achieves better retrieval accuracy via inter-cluster retrieval than that of intra-cluster retrieval with minimal extra effort.

[1]  Guojun Lu,et al.  Generic Fourier descriptor for shape-based image retrieval , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[2]  Bo Zhang,et al.  An effective region-based image retrieval framework , 2002, MULTIMEDIA '02.

[3]  Anil K. Jain,et al.  A self-organizing network for hyperellipsoidal clustering (HEC) , 1996, IEEE Trans. Neural Networks.

[4]  Min Chen,et al.  Affinity relation discovery in image database clustering and content-based retrieval , 2004, MULTIMEDIA '04.

[5]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[6]  Edward Y. Chang,et al.  Support vector machine active learning for image retrieval , 2001, MULTIMEDIA '01.

[7]  Chengcui Zhang,et al.  Multiple object retrieval for image databases using multiple instance learning and relevance feedback , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[8]  Keiji Yanai,et al.  Generic image classification using visual knowledge on the web , 2003, ACM Multimedia.

[9]  Michalis Vazirgiannis,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. On Clustering Validation Techniques , 2022 .

[10]  Stuart Harvey Rubin,et al.  Stochastic clustering for organizing distributed information sources , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  Nozha Boujemaa,et al.  Image database clustering with SVM-based class personalization , 2003, IS&T/SPIE Electronic Imaging.

[12]  Guojun Lu,et al.  Enhanced Generic Fourier Descriptors for object-based image retrieval , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Donald Kossmann,et al.  The state of the art in distributed query processing , 2000, CSUR.

[14]  Rangasami L. Kashyap,et al.  A probabilistic-based mechanism for video database management systems , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[15]  Marco La Cascia,et al.  Unifying Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web , 1999, Comput. Vis. Image Underst..

[16]  Jitendra Malik,et al.  Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Masatoshi Yoshikawa,et al.  The A-tree: An Index Structure for High-Dimensional Spaces Using Relative Approximation , 2000, VLDB.

[18]  Choochart Haruechaiyasak,et al.  Mining user access behavior on the WWW , 2001, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236).

[19]  J. Banerjee,et al.  Clustering a DAG for CAD Databases , 1988, IEEE Trans. Software Eng..

[20]  Minh N. Do,et al.  Rotation invariant texture characterization and retrieval using steerable wavelet-domain hidden Markov models , 2002, IEEE Trans. Multim..

[21]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[22]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[23]  Amarnath Gupta,et al.  Visual information retrieval , 1997, CACM.

[24]  Choochart Haruechaiyasak,et al.  Disjoint Web Document Clustering and Management in Electronic Commerce , 2001 .

[25]  Mei-Ling Shyu,et al.  Affinity-based probabilistic reasoning and document clustering on the WWW , 2000, Proceedings 24th Annual International Computer Software and Applications Conference. COMPSAC2000.

[26]  Cyrus Shahabi,et al.  Image retrieval by shape: a comparative study , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[27]  Deok-Hwan Kim,et al.  QCluster: relevance feedback using adaptive clustering for content-based image retrieval , 2003, SIGMOD '03.

[28]  Mario A. Nascimento,et al.  On “shapes” of colors for content-based image retrieval , 2000, MULTIMEDIA '00.

[29]  Andreas Girgensohn,et al.  Temporal event clustering for digital photo collections , 2003, ACM Multimedia.

[30]  Shu-Ching Chen,et al.  Organizing a network of databases using probabilistic reasoning , 2000, Smc 2000 conference proceedings. 2000 ieee international conference on systems, man and cybernetics. 'cybernetics evolving to systems, humans, organizations, and their complex interactions' (cat. no.0.

[31]  Gerd Stumme,et al.  Computing iceberg concept lattices with T , 2002, Data Knowl. Eng..

[32]  Kyuseok Shim,et al.  WALRUS: A Similarity Retrieval Algorithm for Image Databases , 2004, IEEE Trans. Knowl. Data Eng..

[33]  Hanqing Lu,et al.  A practical SVM-based algorithm for ordinal regression in image retrieval , 2003, MULTIMEDIA '03.

[34]  Guojun Lu,et al.  Techniques and data structures for efficient multimedia retrieval based on similarity , 2002, IEEE Trans. Multim..

[35]  Aidong Zhang,et al.  SemQuery: Semantic Clustering and Querying on Heterogeneous Features for Visual Data , 2002, IEEE Trans. Knowl. Data Eng..

[36]  Yixin Chen,et al.  A Region-Based Fuzzy Feature Matching Approach to Content-Based Image Retrieval , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Wenyin Liu,et al.  Joint semantics and feature based image retrieval using relevance feedback , 2003, IEEE Trans. Multim..

[38]  Min Chen,et al.  Image database retrieval utilizing affinity relationships , 2003, MMDB '03.