Data Resource Selection in Distributed Visual Information Systems

With the advances in multimedia databases and the popularization of the Internet, it is now possible to access large image and video repositories distributed throughout the world. One of the challenging problems in such access is how the information in the respective databases can be summarized to enable an intelligent selection of relevant database sites based on visual queries. This paper presents an approach to solve this problem based on image content-based indexing of a metadatabase at a query distribution server. The metadatabase records a summary of the visual content of the images in each database through image templates and statistical features characterizing the similarity distributions of the images. The selection of the databases is done by searching the metadatabase using a ranking algorithm that uses the query's similarity to a template and the features of the databases associated with the template. Two selection approaches, termed mean-based and histogram-based approaches, are presented. The database selection mechanisms have been implemented in a metaserver, and extensive experiments have been performed to demonstrate the effectiveness of the database selection approaches.

[1]  Aidong Zhang,et al.  Geographical image classification and retrieval , 1997, GIS '97.

[2]  Richard S. Marcus,et al.  An experimental comparison of the effectiveness of computers and humans as search intermediaries , 1983, J. Am. Soc. Inf. Sci..

[3]  Luis Gravano,et al.  Merging Ranks from Heterogeneous Internet Sources , 1997, VLDB.

[4]  Peter A. Lachenbruch,et al.  Classification: Methods for the Exploratory Analythi of Multivariate Data , 1982 .

[5]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.

[6]  Aidong Zhang,et al.  NetView: Integrating Large-Scale Distributed Visual Databases , 1998, IEEE Multim..

[7]  Aidong Zhang,et al.  Metadatabase and search agent for multimedia database access over Internet , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[8]  Luis Gravano,et al.  Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies , 1995, VLDB.

[9]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[10]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[11]  Ramesh C. Jain,et al.  A Visual Information Management System for the Interactive Retrieval of Faces , 1993, IEEE Trans. Knowl. Data Eng..

[12]  Aidong Zhang,et al.  Efficient resource selection in distributed visual information systems , 1997, MULTIMEDIA '97.

[13]  Brewster Kahle,et al.  An information system for corporate users: wide area information servers , 1991 .

[14]  Oliver A. McBryan,et al.  GENVL and WWWW: Tools for taming the Web , 1994, WWW Spring 1994.

[15]  Aidong Zhang,et al.  NetV iew: A Framework for Integration of Large-Scale Distributed Visual Databases , 1998 .

[16]  R. Ng,et al.  Eecient and Eeective Clustering Methods for Spatial Data Mining , 1994 .

[17]  Peter B. Danzig,et al.  Distributed Indexing of Autonomous Internet Services , 1992, Comput. Syst..

[18]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[19]  Aidong Zhang,et al.  Metadata for Distributed Visual Database Access , 1997 .

[20]  Ellen M. Voorhees,et al.  The Collection Fusion Problem , 1994, TREC.

[21]  A. D. Gordon,et al.  Classification : Methods for the Exploratory Analysis of Multivariate Data , 1981 .

[22]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[23]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[24]  W. Bruce Croft,et al.  Searching distributed collections with inference networks , 1995, SIGIR '95.

[25]  Raj Acharya,et al.  Color clustering techniques for color-content-based image retrieval from image databases , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[26]  Martijn Koster,et al.  ALIWEB - Archie-like Indexing in the WEB , 1994, Comput. Networks ISDN Syst..

[27]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[28]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[29]  Aidong Zhang,et al.  Approach to clustering large visual databases using wavelet transform , 1997, Electronic Imaging.