Metadata generation and consolidation within an ontology-based document management system

A central issue for the benefit and success of an ontology-based personal document management system is its capability to automatically model appropriate and valid document descriptions. The generation process primarily depends on the application context and should be customised accordingly. Furthermore, automatically generated information needs appropriate cleaning and consolidation to maintain a certain level of data quality. Therefore, this paper presents a semantic document management system which applies a stepwise knowledge modelling process. Our approach separately addresses the problems of general translation of diverse information sources, syntax check, normalisation and duplication and conflict handling, based on consecutive and configurable stages.

[1]  Siegfried Handschuh,et al.  Semantic annotation for knowledge management: Requirements and a survey of the state of the art , 2006, J. Web Semant..

[2]  Steffen Staab,et al.  Authoring and annotation of web pages in CREAM , 2002, WWW.

[3]  Raimund Dachselt,et al.  TimeZoom: a flexible detail and context timeline , 2006, CHI EA '06.

[4]  Giovanni Tummarello,et al.  Enabling Semantic Web Communities with DBin: An Overview , 2006, International Semantic Web Conference.

[5]  Marc Ehrig Ontology Alignment: Bridging the Semantic Gap (Semantic Web and Beyond) , 2006 .

[6]  Jane Hunter,et al.  The ABC Ontology and Model , 2001, J. Digit. Inf..

[7]  D. Marples,et al.  The Open Services Gateway Initiative: an introductory overview , 2001, IEEE Commun. Mag..

[8]  Carlo Batini,et al.  Data Quality at a Glance , 2005, Datenbank-Spektrum.

[9]  David R. Karger,et al.  Data unification in personal information management , 2006, CACM.

[10]  Leo Sauermann,et al.  Semantic Desktop 2.0: The Gnowsis Experience , 2006, International Semantic Web Conference.

[11]  David E. Millard,et al.  Artequakt: Generating Tailored Biographies with Automatically Annotated Fragments from the Web , 2002, SAAKM@ECAI.

[12]  Marja-Riitta Koivunen,et al.  Annotea: an open RDF infrastructure for shared Web annotations , 2001, WWW '01.

[13]  J. Carroll,et al.  Jena: implementing the semantic web recommendations , 2004, WWW Alt. '04.

[14]  Klaus Meißner,et al.  CroCo: Ontology-Based, Cross-Application Context Management , 2008, 2008 Third International Workshop on Semantic Media Adaptation and Personalization.

[15]  William E. Winkler,et al.  The State of Record Linkage and Current Research Problems , 1999 .

[16]  David E. Millard,et al.  Ontologies as facilitators for repurposing web documents , 2007, Int. J. Hum. Comput. Stud..

[17]  Eric Prud'hommeaux,et al.  Annotea: an open RDF infrastructure for shared Web annotations , 2002, Comput. Networks.

[18]  Steffen Staab,et al.  Semantic Annotation of Images and Videos for Multimedia Analysis , 2005, ESWC.

[19]  David R. Karger,et al.  Haystack: a user interface for creating, browsing, and organizing arbitrary semistructured information , 2004, CHI EA '04.

[20]  Fabio Ciravegna,et al.  Cross-media document annotation and enrichment , 2006, SAAW@ISWC.

[21]  Siegfried Handschuh,et al.  The NEPOMUK Project - On the way to the Social Semantic Desktop , 2007 .