Maintenance of Human and Machine Metadata over the Web Content

Semantics over the Web content is crucial for web information systems, e.g. for effective information exploration, navigation or search. However, current coverage of the Web by semantics is insufficient. Web information systems mostly create their own content based metadata (e.g., identified keywords) and user collaboration metadata (e.g., implicit user feedbacks) in a form of information tags --- structured information with semantic relations to the tagged content. By information tags web information systems build a lightweight semantics over the Web content, in which they can store knowledge and information about the content and interconnections between information artifacts of the content. Crucial problem of information tags lies in dynamicity of the Web whose content is continually modified. This together with influence of time can lead to invalidation of information tags which are closely related to tagged content. We address this issue via maintenance approach based on automatically and semi-automatically generated rules that respect changes on the Web and time aspect. The maintenance utilizes a rule-based engine which watches changes in the tagged content, identifies dependencies among maintenance rules and builds optimal strategy of rules application. We evaluate proposed maintenance approach in two domains --- programing repositories and digital libraries, which use shared information tags repository.

[1]  Ilango Krishnamurthi,et al.  NLION: Natural Language Interface for querying ONtologies , 2009, COMPUTE '09.

[2]  Hyoil Han,et al.  Survey of semantic annotation platforms , 2005, SAC '05.

[3]  Mária Bieliková,et al.  Lightweight semantics for the "wild Web" , 2011 .

[4]  Fausto Giunchiglia,et al.  Lightweight Ontologies , 2009, Encyclopedia of Database Systems.

[5]  Siegfried Handschuh,et al.  Semantic annotation for knowledge management: Requirements and a survey of the state of the art , 2006, J. Web Semant..

[6]  Roi Blanco,et al.  Keyword search over RDF graphs , 2011, CIKM '11.

[7]  Siegfried Handschuh,et al.  Visual interfaces to the social and the semantic web (VISSW 2009) , 2009, IUI.

[8]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[9]  Fabio Crestani,et al.  Digital Libraries: For Cultural Heritage, Knowledge Dissemination, and Future Creation - 13th International Conference on Asia-Pacific Digital Libraries, ICADL 2011, Beijing, China, October 24-27, 2011. Proceedings , 2011, ICADL.

[10]  Jane Hunter,et al.  High Speed Capture, Retrieval and Rendering of Segment-Based Annotations on 3D Museum Objects , 2011, ICADL.

[11]  Jane Hunter,et al.  A Collaborative Scholarly Annotation System for Dynamic Web Documents - A Literary Case Study , 2010, ICADL.

[12]  Enrico Motta,et al.  Evaluating the Semantic Web: A Task-Based Approach , 2007, ISWC/ASWC.

[13]  Pavol Bielik,et al.  Automated Acquisition of Domain Model for Adaptive Collaborative Web-Based Learning , 2012 .

[14]  Tim Berners-Lee,et al.  The world-wide web : Internet technology , 1994 .

[15]  Herbert Van de Sompel,et al.  Making web annotations persistent over time , 2010, JCDL '10.

[16]  Lynda L. McGhie,et al.  World Wide Web , 2011, Encyclopedia of Information Assurance.

[17]  Jane Hunter,et al.  The Role of Digital Libraries in a Time of Global Change, 12th International Conference on Asia-Pacific Digital Libraries, ICADL 2010, Gold Coast, Australia, June 21-25, 2010. Proceedings , 2010, ICADL.

[18]  Samhaa R. El-Beltagy,et al.  A Survey of Ontology Learning Approaches , 2011 .

[19]  Mária Bieliková,et al.  Lightweight Semantics over Web Information Systems Content Employing Knowledge Tags , 2012, ER Workshops.