Token-Templates and Logic Programs for Intelligent Web Search

We present a general framework for the information extraction from web pages based on a special wrapper language, called token-templates. By using token-templates in conjunction with logic programs we are able to reason about web page contents, search and collect facts and derive new facts from various web pages. We give a formal definition for the semantics of logic programs extended by token-templates and define a general answer-complete calculus for these extended programs. These methods and techniques are used to build intelligent mediators and web information systems.

[1]  Gert Smolka,et al.  Records for Logic Programming , 1994, J. Log. Program..

[2]  Mark E. Stickel,et al.  Automated deduction by theory resolution , 1985, Journal of Automated Reasoning.

[3]  J. W. Lloyd,et al.  Foundations of logic programming; (2nd extended ed.) , 1987 .

[4]  Robert A. Kowalski,et al.  Linear Resolution with Selection Function , 1971, Artif. Intell..

[5]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[6]  Oren Etzioni,et al.  A scalable comparison-shopping agent for the World-Wide Web , 1997, AGENTS '97.

[7]  Laura Bright,et al.  A Wrapper Generation toolkit to specify and construct Wrappersfor Web Accessible Data Sources ( WebSources ) , 1999 .

[8]  David Konopnicki,et al.  W3QS: A Query System for the World-Wide Web , 1995, VLDB.

[9]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[10]  Gerd Neugebauer,et al.  GLUE: Opening the World to Theorem Provers , 1997, LPNMR.

[11]  Hector Garcia-Molina,et al.  Extracting Semistructured Information from the Web. , 1997 .

[12]  李幼升,et al.  Ph , 1989 .

[13]  Bernd Thomas,et al.  Logic Programs for Intelligent Web Search , 1999, ISMIS.

[14]  N. Curteanu Book Reviews: Lecture on Contemporary Syntactic Theories: An Introduction to Unification-Based Approaches to Grammar , 1987, CL.

[15]  Raymond J. Mooney,et al.  Relational Learning of Pattern-Match Rules for Information Extraction , 1999, CoNLL.

[16]  Patrick Valduriez,et al.  Scaling heterogeneous databases and the design of Disco , 1996, Proceedings of 16th International Conference on Distributed Computing Systems.

[17]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[18]  Michael R. Genesereth,et al.  Infomaster: an information integration system , 1997, SIGMOD '97.

[19]  Oren Etzioni,et al.  Dynamic Reference Sifting: A Case Study in the Homepage Domain , 1997, Comput. Networks.

[20]  Craig A. Knoblock,et al.  Wrapper generation for semi-structured Internet sources , 1997, SGMD.

[21]  Nicholas Kushmerick,et al.  Wrapper Induction for Information Extraction , 1997, IJCAI.

[22]  Kevin Knight,et al.  Unification: a multidisciplinary survey , 1989, CSUR.

[23]  J. Lloyd Foundations of Logic Programming , 1984, Symbolic Computation.

[24]  Maria-Esther Vidal,et al.  A flexible meta-wrapper interface for autonomous distributed information sources , 1997 .

[25]  Joann J. Ordille,et al.  Query-Answering Algorithms for Information Agents , 1996, AAAI/IAAI, Vol. 1.

[26]  Tony Mason,et al.  Lex & Yacc , 1992 .

[27]  Z. Pawlak Principles of knowledge representation , 1984 .