Semantic caching via query matching for web sources

A semantic caching scheme suitable for wrappers wrapping web sources is presented. Since the web sources have typically weaker querying capabilities than conventional databases, existing semantic caching schemes cannot be applied directly. A seamlessly integrated query translation and capability mapping between the wrappers and web sources in semantic caching is described. In addition, an analysis on the match types between the user's input query and cached queries is presented. Semantic knowledge acquired from the data can be used to avoid unnecessary access to the web sources by transforming the cache miss to the cache hit. A polynomial time algorithm based on the proposed query matching technique is presented to find the best matched query in the cache. Experimental results reveal the effectiveness of the proposed semantic caching scheme.

[1]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[2]  Rafael Alonso,et al.  Data caching issues in an information retrieval system , 1990, TODS.

[3]  Miron Livny,et al.  Local Disk Caching for Client-Server Database Systems , 1993, VLDB.

[4]  Arthur M. Keller,et al.  A predicate-based caching scheme for client-server database architectures , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.

[5]  Jarek Gryz,et al.  Answering Queries by Semantic Caches , 1999, DEXA.

[6]  Wei Sun,et al.  Solving satisfiability and implication problems in database systems , 1996, TODS.

[7]  Craig A. Knoblock,et al.  Intelligent Caching for Information Mediators: A KR Based Approach , 1998, KRDB.

[8]  Jennifer Widom,et al.  Integrating and Accessing Heterogeneous Information Sources in TSIMMIS , 1994 .

[9]  Timos K. Sellis,et al.  Intelligent caching and indexing techniques for relational database systems , 1988, Inf. Syst..

[10]  Nick Roussopoulos,et al.  The Implementation and Performance Evaluation of the ADMS Query Optimizer: Integrating Query Result Caching and Matching , 1994, EDBT.

[11]  Alberto O. Mendelzon,et al.  Database techniques for the World-Wide Web: a survey , 1998, SGMD.

[12]  Kevin Chen-Chuan Chang,et al.  Boolean Query Mapping Across Heterogeneous Information Sources , 1996, IEEE Trans. Knowl. Data Eng..

[13]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[14]  K. Selçuk Candan,et al.  Query caching and optimization in distributed mediator systems , 1996, SIGMOD '96.

[15]  Divesh Srivastava,et al.  Using LDAP directory caches , 1999, PODS '99.

[16]  Per-Åke Larson,et al.  Computing Queries from Derived Relations , 1985, VLDB.

[17]  Xiaolei Qian,et al.  Query folding , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[18]  Divesh Srivastava,et al.  Semantic Data Caching and Replacement , 1996, VLDB.

[19]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[20]  Yannis Papakonstantinou,et al.  Describing and Using Query Capabilities of Heterogeneous Sources , 1997, VLDB.

[21]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[22]  Jarek Gryz,et al.  Semantic Query Optimization for Bottom-Up Evaluation , 1996, ISMIS.

[23]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[24]  Dongwon Lee,et al.  Conjunctive Point Predicate-based Semantic Caching for Wrappers in Web Databases , 1998, Workshop on Web Information and Data Management.