Semantic Document Architecture for Desktop Data Integration and Management

The paper presents a novel desktop document architecture, namely SDArch, which attempts to integrate data from heterogeneous desktop applications into a unified desktop information space. To achieve this, SDArch introduces a new document representation model, which establishes explicit semantic links between fine-grained units of document data based on the conceptualization of their semantics. The SDArch semantic search and navigation services, which run on this unified desktop information space, aim to improve the search and navigation within desktop data, thus improving the effectiveness and efficiency of desktop users in carrying out their daily tasks. We report on a usability evaluation of the SDArch prototype and an experimental evaluation of the proposed semantic search. The evaluation results are promising. We present an analysis of these results.

[1]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[2]  David R. Karger,et al.  Haystack: A Platform for Authoring End User Semantic Web Applications , 2003, WWW.

[3]  Eero Hyvönen,et al.  MuseumFinland - Finnish museums on the semantic web , 2005, J. Web Semant..

[4]  Vincent Quint Active Documents as a Paradigm for Human-Computer Interaction , 1994 .

[5]  Dragan Gasevic,et al.  Semantic Document Management for Collaborative Learning Object Authoring , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[6]  Amit P. Sheth,et al.  SemRank: ranking complex relationship search results on the semantic web , 2005, WWW '05.

[7]  Michael K. Buckland,et al.  What is a digital document , 1998 .

[8]  Max Völkel From Documents to Knowledge Models , 2007 .

[9]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[10]  Enrico Motta,et al.  Revyu: Linking reviews and ratings into the Web of Data , 2008, J. Web Semant..

[11]  Betty Collis,et al.  Technology and human issues in reusing learning objects , 2004 .

[12]  James A. Hendler,et al.  Web science: an interdisciplinary approach to understanding the web , 2008, CACM.

[13]  Michael F. Smith Software Prototyping: Adoption, Practice and Management , 1991 .

[14]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[15]  Siegfried Handschuh,et al.  The NEPOMUK Project - On the way to the Social Semantic Desktop , 2007 .

[16]  Christoph Mangold,et al.  A survey and classification of semantic search approaches , 2007, Int. J. Metadata Semant. Ontologies.

[17]  Gordon Bell,et al.  MyLifeBits: fulfilling the Memex vision , 2002, MULTIMEDIA '02.

[18]  Alison Kidd,et al.  The marks are on the knowledge worker , 1994, CHI '94.

[19]  Miguel-Ángel Sicilia Metadata, semantics, and ontology: providing meaning to information resources , 2006, Int. J. Metadata Semant. Ontologies.

[20]  Frank M. Shipman,et al.  Which semantic web? , 2003, HYPERTEXT '03.

[21]  Brandon Muramatsu,et al.  Draft Standard for Learning Object Metadata , 2002 .

[22]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[23]  Robert Wilensky,et al.  Multivalent documents , 2000, CACM.

[24]  Ramanathan V. Guha,et al.  Semantic search , 2003, WWW '03.

[25]  Jakob Nielsen,et al.  The "magic number 5": is it enough for web testing? , 2002, CHI Extended Abstracts.

[26]  D. Hofstadter Fluid Concepts and Creative Analogies: Computer Models of the Fundamental Mechanisms of Thought, Douglas Hofstadter. 1994. Basic Books, New York, NY. 512 pages. ISBN: 0-465-05154-5. $30.00 , 1995 .

[27]  Rifat Ozcan,et al.  Concept-based information access , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[28]  David R. Karger,et al.  Haystack: A General-Purpose Information Management Tool for End Users Based on Semistructured Data , 2005, CIDR.

[29]  Henrik Eriksson The semantic-document approach to combining documents and ontologies , 2007, Int. J. Hum. Comput. Stud..

[30]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[31]  William G. Griswold,et al.  A component architecture for an extensible, highly integrated context-aware computing infrastructure , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[32]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[33]  Dragan Gasevic,et al.  Concept-based semantic annotation, indexing and retrieval of office-like document units , 2010, RIAO.

[34]  Clement T. Yu,et al.  Personalized web search by mapping user queries to categories , 2002, CIKM '02.

[35]  H. Penny Nii,et al.  The Handbook of Artificial Intelligence , 1982 .

[36]  Dragan Gasevic,et al.  Extending MS Office for Sharing Document Content Units over the Semantic Web , 2008, 2008 Eighth International Conference on Web Engineering.

[37]  M. Fischetti Working knowledge. , 2003, Scientific American.

[38]  Christopher A. Welty An integrated representation for software development and discovery , 1996 .

[39]  Siegfried Handschuh,et al.  Semantic annotation for knowledge management: Requirements and a survey of the state of the art , 2006, J. Web Semant..

[40]  F. A. Grootjen,et al.  Conceptual query expansion , 2006, Data Knowl. Eng..

[41]  Steffen Staab,et al.  International Handbooks on Information Systems , 2013 .

[42]  Ian Horrocks,et al.  OIL: An Ontology Infrastructure for the Semantic Web , 2001, IEEE Intell. Syst..

[43]  James A. Hendler,et al.  The semantic Web and its languages , 2000 .

[44]  Dragan Gasevic,et al.  Using Semantic Documents and Social Networking in Authoring of Course Material: An Empirical Study , 2010, 2010 10th IEEE International Conference on Advanced Learning Technologies.

[45]  Mehdi Jazayeri,et al.  Towards efficient document content sharing in social networks , 2009, SoSEA '09.

[46]  Kristian Fischer,et al.  The Open Document Architecture: From Standardization to the Market , 1992, IBM Syst. J..

[47]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[48]  D. Kazakov,et al.  Using WordNet Similarity and Antonymy Relations to Aid Document Retrieval , 2005 .

[49]  Heiner Stuckenschmidt,et al.  Handbook on Ontologies , 2004, Künstliche Intell..

[50]  Dov Dori,et al.  The representation of document structure: a generic object-process analysis , 1995 .

[51]  Brian Keith Reid,et al.  Scribe: a document specification language and its compiler , 1981 .

[52]  Simon L. Kendal,et al.  An introduction to knowledge engineering , 2007 .

[53]  Dragan Gasevic,et al.  Ontology-based content model for scalable content reuse , 2007, K-CAP '07.

[54]  Michael K. Buckland The centenary of “Madame Documentation”: Suzanne Briet, 1894–1989 , 1995 .

[55]  Sören Auer,et al.  OntoWiki: A Tool for Social, Semantic Collaboration , 2006, CKC.

[56]  Pablo Castells,et al.  An Ontology-Based Information Retrieval Model , 2005, ESWC.

[57]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[58]  James A. Hendler,et al.  Agents and the Semantic Web , 2001, IEEE Intell. Syst..

[59]  Victor Gaudioso Classes and Interfaces , 2010 .

[60]  P. Berger,et al.  The Social Construction of Reality , 1966 .

[61]  James A. Hendler,et al.  A Portrait of the Semantic Web in Action , 2001, IEEE Intell. Syst..

[62]  David F. Brailsford,et al.  Enhancing composite digital documents using XML-based standoff markup , 2005, DocEng '05.

[63]  Alon Y. Halevy,et al.  A Platform for Personal Information Management and Integration , 2005, CIDR.

[64]  Vladan Devedzic,et al.  Ontology-Based Automatic Annotation of Learning Content , 2006, Int. J. Semantic Web Inf. Syst..

[65]  Stefan Decker,et al.  The Networked Semantic Desktop , 2004, WWW Workshop on Application Design, Development and Implementation Issues in the Semantic Web.

[66]  Erik Duval,et al.  The Ariadne knowledge pool system , 2001, CACM.

[67]  Zhiguo Gong,et al.  Multi-term Web Query Expansion Using WordNet , 2006, DEXA.

[68]  Yun Peng,et al.  Finding and Ranking Knowledge on the Semantic Web , 2005, SEMWEB.

[69]  Vladan Devedzic,et al.  Using Semantic Web Technologies to Analyze Learning Content , 2007, IEEE Internet Computing.

[70]  Daniel J. Weitzner Beyond Secrecy: New Privacy Protection Strategies for Open Information Spaces , 2007, IEEE Internet Computing.

[71]  John Davies,et al.  Squirrel: An Advanced Semantic Search and Browse Facility , 2007, ESWC.

[72]  Joseph S. Dumas,et al.  Usability in practice: formative usability evaluations - evolution and revolution , 2002, CHI Extended Abstracts.

[73]  Jun-feng Song,et al.  Ontology-Based Information Retrieval Model for the Semantic Web , 2005, EEE.

[74]  Martin L. Griss,et al.  Software agents as next generation software components , 2001 .

[75]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[76]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[77]  Victor Vianu,et al.  Rule-based languages , 2004, Annals of Mathematics and Artificial Intelligence.

[78]  Christian Becker,et al.  Exploring the Geospatial Semantic Web with DBpedia Mobile , 2009, J. Web Semant..

[79]  Zhang Wei-ming,et al.  Ontology-based information retrieval model for the semantic Web , 2005, 2005 IEEE International Conference on e-Technology, e-Commerce and e-Service.

[80]  Ramanathan V. Guha,et al.  A case for automated large-scale semantic annotation , 2003, J. Web Semant..

[81]  D. W. Zimmerman Teacher’s Corner: A Note on Interpretation of the Paired-Samples t Test , 1997 .

[82]  J. M. Cortina,et al.  What Is Coefficient Alpha? An Examination of Theory and Applications , 1993 .

[83]  John Durkin,et al.  Expert systems - design and development , 1994 .

[84]  John G. Breslin,et al.  The Future of Social Networks on the Internet: The Need for Semantics , 2007, IEEE Internet Computing.

[85]  Daniel Schwabe,et al.  A hybrid approach for searching in the semantic web , 2004, WWW '04.

[86]  Kai Ming Ting,et al.  Precision and Recall , 2017, Encyclopedia of Machine Learning and Data Mining.

[87]  Hinrich Schütze,et al.  Personalized search , 2002, CACM.

[88]  James A. Hendler,et al.  A Framework for Web Science , 2006, Found. Trends Web Sci..

[89]  James A. Hendler,et al.  Information accountability , 2008, CACM.

[90]  Eero Hyvönen,et al.  ONKI SKOS Server for Publishing and Utilizing SKOS Vocabularies and Ontologies as Services , 2009, ESWC.

[91]  Eero Hyvönen,et al.  A Method for Determining Ontology-Based Semantic Relevance , 2007, DEXA.

[92]  Peter Smith,et al.  An introduction to knowledge engineering , 1996 .

[93]  W. Shadish,et al.  Experimental and Quasi-Experimental Designs for Generalized Causal Inference , 2001 .

[94]  Jakob Nielsen,et al.  Usability engineering , 1997, The Computer Science and Engineering Handbook.

[95]  Bernhard Haslhofer,et al.  The Sile Model - A Semantic File System Infrastructure for the Desktop , 2009, ESWC.

[96]  Les Carr,et al.  The case for explicit knowledge in documents , 2004, DocEng '04.

[97]  Loren G. Terveen,et al.  Let's Stop Pushing the Envelope and Start Addressing It: A Reference Task Agenda for HCI , 2000, Hum. Comput. Interact..

[98]  R. Likert “Technique for the Measurement of Attitudes, A” , 2022, The SAGE Encyclopedia of Research Design.

[99]  Marcelo Tallis,et al.  Semantic Word Processing for Content Authors , 2003 .

[100]  Jens Dittrich,et al.  A Dataspace Odyssey: The iMeMex Personal Dataspace Management System (Demo) , 2007, CIDR.

[101]  Emile L. Morse Evaluation Methodologies for Information Management Systems , 2002, D Lib Mag..

[102]  Chris Clarke A Resource List Management Tool for Undergraduate Students Based on Linked Open Data Principles , 2009, ESWC.

[103]  Leo Sauermann,et al.  Evaluating Long-Term Use of the Gnowsis Semantic Desktop for PIM , 2008, International Semantic Web Conference.

[104]  Steffen Staab,et al.  WordNet improves text document clustering , 2003, SIGIR 2003.

[105]  Daniela Petrelli,et al.  Semantic Web-Based Document: Editing and Browsing in AktiveDoc , 2005, ESWC.

[106]  Atanas Kiryakov,et al.  Semantic Annotation, Indexing, and Retrieval , 2003, SEMWEB.

[107]  Sasa Nesic,et al.  Semantic Document Model to Enhance Data and Knowledge Interoperability , 2009, Web 2.0 & Semantic Web.

[108]  Marta Mattoso,et al.  Using ontologies for domain information retrieval , 2000, Proceedings 11th International Workshop on Database and Expert Systems Applications.

[109]  Clement T. Yu,et al.  Semantic-Based Grouping of Search Engine Results Using WordNet , 2007, APWeb/WAIM.

[110]  Warren Harrison Eating Your Own Dog Food , 2006, IEEE Softw..

[111]  Thomas R. Gruber,et al.  Collective knowledge systems: Where the Social Web meets the Semantic Web , 2008, J. Web Semant..

[113]  Ivo Düntsch,et al.  Evaluation of software systems , 2002 .

[114]  Karen L. Myers,et al.  Task Management under Change and Uncertainty Constraint Solving Experience with the CALO Project , 2005 .

[115]  James O. Coplien,et al.  Software design patterns , 2003 .

[116]  Dragan Gasevic,et al.  An ontology-based framework for author-learning content interaction , 2007 .

[117]  William Buxton,et al.  Usability evaluation considered harmful (some of the time) , 2008, CHI.

[118]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[119]  Hans-Peter Frei,et al.  Concept based query expansion , 1993, SIGIR.

[120]  Enrico Motta,et al.  SemSearch: A Search Engine for the Semantic Web , 2006, EKAW.

[121]  Fred D. Davis Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology , 1989, MIS Q..

[122]  Martin Gaedke,et al.  Silk - A Link Discovery Framework for the Web of Data , 2009, LDOW.

[123]  Dragan Gasevic,et al.  An Ontology-Based Framework for Authoring Assisted by Recommendation , 2007, Seventh IEEE International Conference on Advanced Learning Technologies (ICALT 2007).

[124]  Paul A. Kogut,et al.  AeroDAML: Applying Information Extraction to Generate DAML Annotations from Web Pages , 2001, Semannot@K-CAP 2001.