Similarity-based ranking and query processing in multimedia databases

Abstract Since media-based evaluation yields similarity values, results to a multimedia database query, Q(Y1,…,Yn), is defined as an ordered list SQ of n-tuples of the form 〈X1,…,Xn〉. The query Q itself is composed of a set of fuzzy and crisp predicates, constants, variables, and conjunction, disjunction, and negation operators. Since many multimedia applications require partial matches, SQ includes results which do not satisfy all predicates. Due to the ranking and partial match requirements, traditional query processing techniques do not apply to multimedia databases. In this paper, we first focus on the problem of “given a multimedia query which consists of multiple fuzzy and crisp predicates, providing the user with a meaningful final ranking”. More specifically, we study the problem of merging similarity values in queries with multiple fuzzy predicates. We describe the essential multimedia retrieval semantics, compare these with the known approaches, and propose a semantics which captures the requirements of multimedia retrieval problem. We then build on these results in answering the related problem of “given a multimedia query which consists of multiple fuzzy and crisp predicates, finding an efficient way to process the query.” We develop an algorithm to efficiently process queries with unordered fuzzy predicates (sub-queries). Although this algorithm can work with different fuzzy semantics, it benefits from the statistical properties of the semantics proposed in this paper. We also present experimental results for evaluating the proposed algorithm in terms of quality of results and search space reduction.

[1]  Ronald Fagin,et al.  Combining Fuzzy Information from Multiple Systems , 1999, J. Comput. Syst. Sci..

[2]  V. S. Subrahmanian,et al.  A multi-similarity algebra , 1998, SIGMOD '98.

[3]  K. Selçuk Candan,et al.  Hierarchical Image Modeling for Object-Based Media Retrieval , 1998, Data Knowl. Eng..

[4]  Surajit Chaudhuri,et al.  Optimization of queries with user-defined predicates , 1996, TODS.

[5]  R. Yager SOME PROCEDURES FOR SELECTING FUZZY SET-THEORETIC OPERATORS , 1982 .

[6]  K. Selçuk Candan,et al.  Query caching and optimization in distributed mediator systems , 1996, SIGMOD '96.

[7]  Clement T. Yu,et al.  Priniples of Database Query Processing for Advanced Applications , 1997 .

[8]  Ronald Fagin,et al.  Incorporating User Preferences in Multimedia Queries , 1997, ICDT.

[9]  Sethuraman Panchanathan,et al.  Review of Image and Video Indexing Techniques , 1997, J. Vis. Commun. Image Represent..

[10]  Chad Carson,et al.  Optimizing queries over multimedia repositories , 1996, SIGMOD '96.

[11]  Clement T. Yu,et al.  Design, implementation and evaluation of SCORE (a system for content based retrieval of pictures) , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[12]  Clement T. Yu,et al.  Techniques and Systems for Image and Video Retrieval , 1999, IEEE Trans. Knowl. Data Eng..

[13]  K. Selçuk Candan,et al.  SEMCOG: a hybrid object-based image database system and its modeling, language, and query processing , 1998, Proceedings 14th International Conference on Data Engineering.

[14]  Ronald Fagin,et al.  Fuzzy queries in multimedia database systems , 1998, PODS '98.

[15]  Ronald Fagin,et al.  Allowing users to weight search terms , 2000, RIAO.

[16]  S. Gottwald,et al.  Fuzzy sets, fuzzy logic, fuzzy methods with applications , 1995 .

[17]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[18]  Atsuo Yoshitaka,et al.  A Survey on Content-Based Retrieval for Multimedia Databases , 1999, IEEE Trans. Knowl. Data Eng..

[19]  Sam Sung A Linear Transform Scheme for Combining Weights into Scores , 1998 .

[20]  Suh-Yin Lee,et al.  Similarity retrieval of iconic image database , 1989, Pattern Recognit..

[21]  Christos Faloutsos,et al.  Searching Multimedia Databases by Content , 1996, Advances in Database Systems.

[22]  K. Selçuk Candan,et al.  Facilitating Multimedia Database Exploration through Visual Interfaces and Perpetual Query Reformulations , 1997, VLDB.

[23]  John Yen,et al.  Fuzzy Logic - A Modern Perspective , 1999, IEEE Trans. Knowl. Data Eng..

[24]  Hiroshi Nakajima,et al.  Context-dependent interpretations of linguistic terms in fuzzy relational databases , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[25]  G. Zipf,et al.  Relative Frequency as a Determinant of Phonetic Change , 1930 .

[26]  Laks V. S. Lakshmanan,et al.  ProbView: a flexible probabilistic database system , 1997, TODS.

[27]  Oren Etzioni,et al.  Sound and Efficient Closed-World Reasoning for Planning , 1997, Artif. Intell..

[28]  King-Lup Liu,et al.  Similarity based Retrieval of Pictures Using Indices on Spatial Relationships , 1995, VLDB.

[29]  John Murphy,et al.  Using WordNet as a Knowledge Base for Measuring Semantic Similarity between Words , 1994 .

[30]  H. Zimmermann,et al.  On the suitability of minimum and product operators for the intersection of fuzzy sets , 1979 .