Locating Web information using Web checkpoints

Conventional search engines locate information by letting users establish a single Web checkpoint. By specifying one or more keywords, users direct search engines to return a set of documents that contain those keywords. From the documents (links) returned by search engines, user proceed to further probe the WWW from there. Hence, these initial set of documents (contingent upon the occurrence of keyword(s)) serve as a Web checkpoint. Generally, these links are numerous and may not result in much fruitful searches. By establishing multiple Web checkpoints, a richer and controllable search procedure can be constructed to obtain more relevant Web information. This paper presents the design and implementation of permitting multiple checkpoints to facilitate improved searching on the WWW. Web checkpointing is performed as part of the WHOWEDA project.

[1]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[2]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.

[3]  Sourav S. Bhowmick,et al.  Web warehousing: an algebra for web information , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[4]  Sourav S. Bhowmick,et al.  Web Bags - Are They Useful in A Web Warehouse? , 1998, FODO.

[5]  Alberto O. Mendelzon,et al.  Querying the World Wide Web , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[6]  Sourav S. Bhowmick,et al.  Information Coupling in Web Databases , 1998, ER.

[7]  David Konopnicki,et al.  Information gathering in the World-Wide Web: the W3QL query language and the W3QS system , 1998, TODS.

[8]  David Konopnicki,et al.  W3QS: A Query System for the World-Wide Web , 1995, VLDB.

[9]  Sourav S. Bhowmick,et al.  Join Processing in Web Databases , 1998, DEXA.

[10]  Ee-Peng Lim,et al.  /spl Pi/-web join in a web warehouse , 1999, Proceedings. 6th International Conference on Advanced Systems for Advanced Applications.

[11]  G. Moerkotte,et al.  RAW : a Relational Algebra for the Web , 1997 .