Automatic Derivation of On-line Document Ontologies

This paper describes a method for constructing an ontology which will represent the set of web pages on a specified site. We are developing a technique that will extract knowledge from digital sources, create ontologies containing reusable knowledge to be shared with software agents, and present a view of this knowledge to users. This method will provide a solution to the problem of classifying information and supporting mechanisms that explore its structure, as well as allowing knowledge to be extracted and shared with other software agents.

[1]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[2]  Nicola Guarino,et al.  OntoSeek: content-based access to the Web , 1999, IEEE Intell. Syst..

[3]  Terence R. Smith,et al.  Browsing large digital library collections using classification hierarchies , 1999, CIKM '99.

[4]  Jenifer S. McCormack,et al.  Harnessing agent technologies for data mining and knowledge discovery , 2000, SPIE Defense + Commercial Sensing.

[5]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[6]  Lawrence B. Holder,et al.  Cover story: structural Web search using a graph-based discovery system , 2001, INTL.

[7]  Andreas Rauber,et al.  "'Andreas, Rauber'? Conference pages are over there, German documents on the lower left...": an "old-fashioned" approach to Web search results visualization , 2000, Proceedings 11th International Workshop on Database and Expert Systems Applications.

[8]  Dieter Merkl Document Classification with Self-Organizing Maps , 1999 .

[9]  Ronald J. Brachman,et al.  What IS-A Is and Isn't: An Analysis of Taxonomic Links in Semantic Networks , 1983, Computer.

[10]  Rafael Berlanga Llavori,et al.  Gathering metadata from Web-based repositories of historical publications , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[11]  Andreas Rauber,et al.  parSOM: Using Parallelism to Overcome Memory Latency in Self-Organizing Neural Networks , 2000, HPCN Europe.

[12]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[13]  Samuel Kaski,et al.  Self organization of a massive text document collection , 1999 .

[14]  Andreas Rauber,et al.  The SOMLib Digital Library System , 1999, ECDL.

[15]  Aldo Gangemi,et al.  Ontology integration: Experiences with medical terminologies , 1998 .

[16]  Michael Uschold,et al.  Ontologies: principles, methods and applications , 1996, The Knowledge Engineering Review.