Semantic annotation for knowledge management: Requirements and a survey of the state of the art

While much of a company's knowledge can be found in text repositories, current content management systems have limited capabilities for structuring and interpreting documents. In the emerging Semantic Web, search, interpretation and aggregation can be addressed by ontology-based semantic mark-up. In this paper, we examine semantic annotation, identify a number of requirements, and review the current generation of semantic annotation systems. This analysis shows that, while there is still some way to go before semantic annotation tools will be able to address fully all the knowledge management needs, research in the area is active and making good progress.

[1]  Alexiei Dingli,et al.  User-System Cooperation in Document Annotation Based on Information Extraction , 2002, EKAW.

[2]  Marja-Riitta Koivunen Annotea and Semantic Web Supported Collaboration , 2005 .

[3]  Arthur Stutt,et al.  MnM: A Tool for Automatic Support on Semantic Markup , 2004 .

[4]  Jane Hunter,et al.  Vannotea: A collaborative video indexing, annotation and discussion system for broadband networks , 2003 .

[5]  Diana Maynard,et al.  Ontology-based information extraction for market monitoring and technology watch , 2005 .

[6]  Martin Labský,et al.  RDF-Based Retrieval of Information Extracted from Web Product Catalogues , 2004 .

[7]  Steffen Staab,et al.  Towards the self-annotating web , 2004, WWW '04.

[8]  Vojtech Svátek,et al.  Knowledge Modelling for Deductive Web Mining , 2004, EKAW.

[9]  Nicholas Kushmerick,et al.  Wrapper Induction for Information Extraction , 1997, IJCAI.

[10]  Marcelo Tallis Semantic Word Processing for Content , 2003 .

[11]  Carole A. Goble,et al.  Accessibility: a Web engineering approach , 2005, WWW '05.

[12]  Marja-Riitta Koivunen,et al.  Annotea: an open RDF infrastructure for shared Web annotations , 2001, WWW '01.

[13]  Fabio Ciravegna,et al.  Challenges in Information Extraction from Text for Knowledge Management , 2001 .

[14]  Atanas Kiryakov,et al.  Towards Semantic Web Information Extraction , 2003 .

[15]  Peter Gärdenfors,et al.  How to make the Semantic Web more semantic , 2004 .

[16]  Steffen Staab,et al.  Annotation for the semantic web , 2003 .

[17]  Daniela Petrelli,et al.  Semantic Web-Based Document: Editing and Browsing in AktiveDoc , 2005, ESWC.

[18]  Georg Gottlob,et al.  Visual Web Information Extraction with Lixto , 2001, VLDB.

[19]  Paul Buitelaar,et al.  Unsupervised Ontology-based Semantic Tagging for Knowledge Markup , 2005 .

[20]  Oren Etzioni,et al.  Mangrove: Enticing Ordinary People onto the Semantic Web via Instant Gratification , 2003, SEMWEB.

[21]  Steffen Staab,et al.  Authoring and annotation of web pages in CREAM , 2002, WWW.

[22]  Steffen Staab,et al.  Requirements for information extraction for KM , 2003 .

[23]  Enrico Motta,et al.  Opening Up Magpie via Semantic Services , 2004, SEMWEB.

[24]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[25]  Les Carr,et al.  The Distributed Link Service: A Tool for Publishers, Authors, and Readers , 1995, WWW.

[26]  Steffen Staab,et al.  Gimme' the context: context-driven automatic semantic annotation with C-PANKOW , 2005, WWW '05.

[27]  Carole A. Goble,et al.  Towards Annotation Using DAML+OIL , 2001, Semannot@K-CAP 2001.

[28]  Maguelonne Teisseire,et al.  19th International Conference on Applications of Natural Language to Information Systems , 2014 .

[29]  Myra Spiliopoulou,et al.  Coupling Information Extraction and Data Mining for Ontology Learning in PARMENIDES , 2004, RIAO.

[30]  Yorick Wilks,et al.  Designing Adaptive Information Extraction for the Semantic Web in Amilcare , 2003 .

[31]  Alexiei Dingli,et al.  Learning to Harvest Information for the Semantic Web , 2004, ESWS.

[32]  Diana Maynard,et al.  Automatic Creation and Monitoring of Semantic Metadata in a Dynamic Knowledge Portal , 2004, AIMSA.

[33]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[34]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[35]  Atanas Kiryakov,et al.  KIM – a semantic platform for information extraction and retrieval , 2004, Natural Language Engineering.

[36]  Boris Motik,et al.  Ontologies for Enterprise Knowledge Management , 2003, IEEE Intell. Syst..

[37]  Fabio Rinaldi,et al.  Mining relations in the GENIA corpus , 2004 .

[38]  David R. Karger,et al.  Haystack: A Platform for Creating, Organizing and Visualizing Information Using RDF , 2002, Semantic Web Workshop.

[39]  Paul A. Kogut,et al.  AeroDAML: Applying Information Extraction to Generate DAML Annotations from Web Pages , 2001, Semannot@K-CAP 2001.

[40]  Steffen Staab,et al.  Project Halo: Towards a Digital Aristotle , 2004, AI Mag..

[41]  Vincent Quint,et al.  An introduction to Amaya , 1997, World Wide Web journal.

[42]  David R. Karger,et al.  Thresher: automating the unwrapping of semantic content from the World Wide Web , 2005, WWW '05.

[43]  Nancy Ide,et al.  Using the Right Tools: Enhancing Retrieval from Marked-up Documents , 1999, Comput. Humanit..

[44]  Steffen Staab,et al.  Unveiling the hidden bride: deep annotation for mapping and migrating legacy data to the Semantic Web , 2004, J. Web Semant..

[45]  Ramanathan V. Guha,et al.  A case for automated large-scale semantic annotation , 2003, J. Web Semant..

[46]  Steffen Staab,et al.  Semantic Annotation of Images and Videos for Multimedia Analysis , 2005, ESWC.

[47]  van der Ielka Sluis,et al.  Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC'04) , 2004 .

[48]  Oren Etzioni,et al.  Semantic email: theory and applications , 2004, J. Web Semant..

[49]  William J. Black,et al.  A Suite of Tools for Marking Up Textual Data for Temporal Text Mining Scenarios , 2004, LREC.

[50]  Nigel Collier,et al.  Integrating Deep and Shallow Semantic Structures in Open Ontology Forge , 2004 .

[51]  James A. Hendler,et al.  A Portrait of the Semantic Web in Action , 2001, IEEE Intell. Syst..

[52]  James A. Hendler,et al.  New Tools for the Semantic Web , 2002, EKAW.

[53]  Valentin Tablan,et al.  Web-assisted annotation, semantic indexing and search of television and radio news , 2005, WWW '05.

[54]  Les Carr,et al.  The case for explicit knowledge in documents , 2004, DocEng '04.

[55]  Kalina Bontcheva,et al.  Automatic Report Generation from Ontologies: The MIAKT Approach , 2004, NLDB.

[56]  Piercarlo Slavazza,et al.  Machine Learning for the Semantic Web Putting the user into the cycle , 2005 .

[57]  Jane Hunter,et al.  Using the Semantic Grid to Build Bridges between Museums and Indigenous Communities , 2004 .

[58]  Steffen Staab,et al.  Leveraging Metadata Creation for the Semantic Web with CREAM , 2003, KI.