Multilayer annotations in Parmenides

Most of the thrust in the semantic web movement comes from the observation that existing NLP tools are not sophisticated or efficient enough to process the full richness of Natural Language, and therefore Machine Understandable annotations need to be added to Web Resources in order to make them accessible by remote agents. However, when the target application is not required to handle a huge amount of documents, but more limited sets, it is conceivable and practical to take advantage of NLP tools to pre-process textual documents in order to generate annotations (to be verified by human editors). We discuss an approach based on a combination of various Natural Language Processing techniques that addresses this issue. Documents are analized fully automatically and converted into a semantic annotation, which can then be stored together with the original documents. It is this annotation that constitutes the machine understandable resource that remote agents can query.

[1]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[2]  Peter D. Karp,et al.  OKBC: A Programmatic Foundation for Knowledge Base Interoperability , 1998, AAAI/IAAI.

[3]  Ying Ding,et al.  A review of ontologies with the Semantic Web in view , 2001, J. Inf. Sci..

[4]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[5]  Jean Véronis,et al.  Text Encoding Initiative , 1995, Springer Netherlands.

[6]  Douglas E. Appelt,et al.  Introduction to Information Extraction Technology , 1999, IJCAI 1999.

[7]  Manjula Patel Consensus-based Ontology Harmonisation , 2002 .

[8]  Andrei Lopatenko Information Retrieval in Current Research Information Systems , 2001, Semannot@K-CAP 2001.

[9]  Myra Spiliopoulou,et al.  Modelling and Incorporating Background Knowledge in the Web Mining Process , 2002, Pattern Detection and Discovery.

[10]  James A. Hendler,et al.  Agents and the Semantic Web , 2001, IEEE Intell. Syst..

[11]  Paul A. Kogut,et al.  AeroDAML: Applying Information Extraction to Generate DAML Annotations from Web Pages , 2001, Semannot@K-CAP 2001.

[12]  David W. Embley,et al.  Ontology-based extraction and structuring of information from data-rich unstructured documents , 1998, CIKM '98.

[13]  Carole A. Goble,et al.  Towards Annotation Using DAML+OIL , 2001, Semannot@K-CAP 2001.

[14]  Gian Piero Zarri,et al.  Representation of temporal knowledge in events: The formalism, and its potential for legal narratives , 1998 .

[15]  Andrea Setzer,et al.  Temporal information in newswire articles : an annotation scheme and corpus study , 2001 .

[16]  Richard Fikes,et al.  The Ontolingua Server: a tool for collaborative ontology construction , 1997, Int. J. Hum. Comput. Stud..

[17]  Yorick Wilks,et al.  The Interaction of Knowledge Sources in Word Sense Disambiguation , 2001, CL.

[18]  Lei Zhang,et al.  Learning to Generate Semantic Annotation for Domain Specific Sentences , 2001, Semannot@K-CAP 2001.

[19]  Mark A. Musen,et al.  The Knowledge Model of Protégé-2000: Combining Interoperability and Flexibility , 2000, EKAW.

[20]  Siegfried Handschuh,et al.  Ontology-based Linguistic Annotation , 2003, ACL.

[21]  Emmon W. Bach,et al.  Universals in Linguistic Theory , 1970 .

[22]  Francesco M. Donini,et al.  Decidable Reasoning in Terminological Knowledge Representation Systems , 1993, IJCAI.

[23]  Alan F. Smeaton,et al.  Using NLP or NLP Resources for Information Retrieval Tasks , 1999 .

[24]  Branimir K. Boguraev,et al.  A note on a study of cases , 1987 .

[25]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[26]  Thomas R. Gruber,et al.  The Role of Common Ontology in Achieving Sharable, Reusable Knowledge Bases , 1991, KR.