Coping with Web Knowledge

The web seems to be the biggest existing information repository. The extraction of information from this repository has attracted the interest of many researchers, who have developed intelligent algorithms (wrappers) able to extract structured syntactic information automatically. In this article, we formalise a new solution in order to extract knowledge from today's non-semantic web. It is novel in that it associates semantics with the information extracted, which improves agent interoperability; furthermore, it achieves to delegate the knowledge extraction procedure to specialist agents, easing software development and promoting software reuse and maintainability.

[1]  James A. Hendler,et al.  Towards the semantic web: knowledge representation in a dynamic, distributed environment , 2001 .

[2]  James A. Hendler,et al.  Ontology-based Web agents , 1997, AGENTS '97.

[3]  Asunción Gómez-Pérez,et al.  A Roadmap to Ontology Specification Languages , 2000, EKAW.

[4]  Tim Berners-Lee,et al.  The World-Wide Web , 1994, CACM.

[5]  Rafael Corchuelo,et al.  A Practical Agent-Based Method to Extract Semantic Information from the Web , 2002, CAiSE.

[6]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[7]  Frank van Harmelen,et al.  Reviewing the design of DAML+OIL: an ontology language for the semantic web , 2002, AAAI/IAAI.

[8]  G Stix,et al.  The mice that warred. , 2001, Scientific American.

[9]  William W. Cohen A structured wrapper induction system for extracting information from semi-structured documents , 2001, IJCAI 2001.

[10]  S. Griffis EDITOR , 1997, Journal of Navigation.

[11]  Nicola Guarino,et al.  Formal Ontology and Information Systems , 1998 .

[12]  Craig A. Knoblock,et al.  STALKER: Learning Extraction Rules for Semistructured, Web-based Information Sources * , 1998 .

[13]  Dayne Freitag,et al.  Boosted Wrapper Induction , 2000, AAAI/IAAI.

[14]  Calton Pu,et al.  XWRAP: an XML-enabled wrapper construction system for Web information sources , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[15]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[16]  Nicholas Kushmerick,et al.  Wrapper verification , 2000, World Wide Web.

[17]  Nicholas R. Jennings,et al.  Intelligent agents: theory and practice , 1995, The Knowledge Engineering Review.

[18]  Line Eikvil,et al.  Information Extraction from World Wide Web - A Survey , 1999 .

[19]  M R Quillian,et al.  Word concepts: a theory and simulation of some basic semantic capabilities. , 1967, Behavioral science.

[20]  James A. Hendler,et al.  Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential , 2002 .