Managing metadata in open learning repositories and P2P networks

“Now, miraculously, we have the Web. For the documents in our lives, everything is simple and smooth. But for data, we are still pre-Web.” (Tim Berners-Lee, Business Model for the Semantic Web) The successful use and re-use, search, and operation of data, depends on the effective definition, use and management of metadata. The first part of this thesis considers the issues related to learning metadata, which are the nuts and bolts of any application in the field of e-learning. More precisely we investigate learning metadata issues in the context of a “local” open learning repository (OLR for short). Thereby, we stress the pedagogical background in handling metadata, discussing metadata standards, and structuring learning materials. We demonstrate, inter alia, the lack of addressing learning processes and instructional theories in the learning object metadata standard (LOM). Then, we propose an extension of LOM based on the introduction of an abstraction layer and the notion of instructional roles. We also structure several courses based on different instructional models. Our open learning repositories can be considered as a framework and a testbed where metadata modeling languages, learning metadata standards, and metadata management are presented and discussed within an interdisciplinary team. In the second part, we generalize the learning metadata issues, particularly metadata management, to issues related to the broadly used metadata that annotate any resource on the Web. We also expand the metadata management from the local environment of open learning repositories to the distributed environment of peerto-peer networks. The open learning repositories play then the role of special peers, themetadata providers , in the P2P network. Unfortunately, although quite a few database techniques can be re-used in the P2P context, P2P metadata management infrastructures pose additional challenges caused by the open and dynamic nature of these networks. The main task here is to enable an efficient dynamic distributed query processing. For this purpose, we briefly present our super-peer based topology and schema-aware distributed routing indices extended with suitable statistics. Then, we show how these indices facilitate the distribution and dynamic expansion of query plans. After that, we propose a set of transformation rules to optimize query plans and discuss different optimization strategies in detail. In addition to the optimization of complex distributed query processing, we also investigate semantic caching strategies for P2P networks, in order to optimize the query response time and reduce the network load.

[1]  Wolfgang Nejdl,et al.  Building up AI Resources as an AI Testbed , 2001 .

[2]  Chen Li,et al.  Answering queries using views with arithmetic comparisons , 2002, PODS '02.

[4]  Alon Y. Halevy,et al.  Piazza: data management infrastructure for semantic web applications , 2003, WWW '03.

[5]  Matthias Jarke,et al.  ConceptBase: Managing Conceptual Models about Information Systems , 2006, Handbook on Architectures of Information Systems.

[6]  Wolfgang Klafki,et al.  Neue Studien zur Bildungstheorie und Didaktik , 2007 .

[7]  Adam Wierzbicki,et al.  Cache replacement policies revisited: the case of P2P traffic , 2004, IEEE International Symposium on Cluster Computing and the Grid, 2004. CCGrid 2004..

[8]  Wolfgang Nejdl,et al.  Meta-Level Category ‚Role' in Metadata Standards for Learning: Instructional Roles and Instructional Qualities of Learning Objects , 2002 .

[9]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[10]  Wolfgang Nejdl,et al.  Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks , 2003, WWW '03.

[11]  Mikael Nilsson,et al.  Conzilla - Towards a Concept Browser , 1999 .

[12]  Li Gong Project JXTA: A Technology Overview , 2001 .

[13]  Wolfgang Nejdl,et al.  Processing and Optimization of Complex Queries in Schema-Based P2P-Networks , 2004, DBISP2P.

[14]  M. David Merrill,et al.  First principles of instruction , 2012 .

[15]  Dongwon Lee,et al.  Towards Intelligent Semantic Caching for Web Sources , 2001, Journal of Intelligent Information Systems.

[16]  Gautam Jain Query Optimization for Parallel Execution , 2007 .

[17]  Wolfgang Nejdl,et al.  OTelosRDF: A Resource Description Format with Enhanced MetaModeling Functionalities Based on OTelos , 2001, Semannot@K-CAP 2001.

[18]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[19]  Alfons Kemper,et al.  ObjectGlobe: Ubiquitous query processing on the Internet , 2001, The VLDB Journal.

[20]  Volker Markl,et al.  LEO - DB2's LEarning Optimizer , 2001, VLDB.

[21]  Manfred A. Jeusfeld,et al.  Anderungskontrolle in deduktiven Objektbanken , 1992 .

[22]  C. M. Sperberg-McQueen,et al.  eXtensible Markup Language (XML) 1.0 (Second Edition) , 2000 .

[23]  Christian Süß Adaptive Knowledge Management: A Meta-Modeling Approach and its Binding to XML , 2000, Grundlagen von Datenbanken.

[24]  Sean Reilly,et al.  Handle System Namespace and Service Definition , 2003, RFC.

[25]  Nicola Guarino,et al.  Concepts, attributes and arbitrary relations , 1992, Data Knowl. Eng..

[26]  Michael R. Genesereth,et al.  Answering recursive queries using views , 1997, PODS '97.

[27]  Heiner Stuckenschmidt,et al.  Index structures and algorithms for querying distributed RDF repositories , 2004, WWW '04.

[28]  Miron Livny,et al.  Local Disk Caching for Client-Server Database Systems , 1993, VLDB.

[29]  Ralf Steinmetz,et al.  Ein Kurseditor für modularisierte Lernressourcen auf der Basis von Learning Objects Metadata zur Erstellung von adaptierbaren Kursen , 2001 .

[30]  Wolfgang Nejdl,et al.  Distributed Queries and Query Optimization in Schema-Based P2P-Systems , 2003, DBISP2P.

[31]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[32]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[33]  Ilia Petrov,et al.  Guide to Web Application and Platform Architectures , 2004, Springer Berlin Heidelberg.

[34]  Wolfgang Nejdl,et al.  How are Learning Objects Used in Learning Processes? Instructional Roles of Learning Objects in LOM , 2002 .

[35]  Wolfgang Nejdl,et al.  The RDF Schema Specification Revisited , 2000 .

[36]  Hector Garcia-Molina,et al.  Improving Search in Peer-to-Peer Systems , 2001 .

[37]  Wolfgang Nejdl,et al.  Intelligent Online-Knowledge-Resources for Intentional Learning , 2001 .

[38]  Rob Koper,et al.  Modeling units of study from a pedagogical perspective: the pedagogical meta-model behind EML , 2001 .

[39]  Donald D. Chamberlin,et al.  W3C World Wide Web Consortium , 2003 .

[40]  Dan Brickley,et al.  Resource description framework (RDF) schema specification , 1998 .

[41]  Prasenjit Mitra An algorithm for answering queries efficiently using views , 2001, ADC.

[42]  Erik Duval,et al.  Metadata Principles and Practicalities , 2002, D Lib Mag..

[43]  Wolfgang Nejdl,et al.  O-Telos-RDF: An Extension of RDF with Enhanced Meta-Modeling and Reification Functionalities , 2001 .

[44]  Laura M. Haas,et al.  Garlic: a new flavor of federated query processing for DB2 , 2002, SIGMOD '02.

[45]  David Maier,et al.  Distributed Query Processing and Catalogs for Peer-to-Peer Systems , 2003, CIDR.

[46]  Steffen Staab,et al.  CREAM: creating relational metadata with a component-based, ontology-driven annotation framework , 2001, K-CAP '01.

[47]  J. Crawford,et al.  Setting the stage. , 2021, The New England journal of medicine.

[48]  Jarek Gryz,et al.  Answering Queries by Semantic Caches , 1999, DEXA.

[49]  Alfons Kemper,et al.  Hyperqueries: Dynamic Distributed Query Processing on the Internet , 2001, VLDB.

[50]  Martin Wolpers,et al.  Kbs hyperbook - a data-driven information system on the web , 1998, WWW 1998.

[51]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[52]  Aris M. Ouksel,et al.  Agents and Peer-to-Peer Computing , 2003, Lecture Notes in Computer Science.

[53]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[54]  Birgitta König-Ries,et al.  Strategies for Semantic Caching , 2001, DEXA.

[55]  Peter A. Boncz,et al.  AmbientDB: Relational Query Processing in a P2P Network , 2003, DBISP2P.

[56]  Alfred V. Aho,et al.  Efficient optimization of a class of relational expressions , 1978, SIGMOD Conference.

[57]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[58]  Wolfgang Nejdl,et al.  Open Learning Repositories and Metadata Modeling , 2001, SWWS.

[59]  Alejandro P. Buchmann,et al.  A Transaction Model for Active Distributed Object Systems , 1992, Database Transaction Models for Advanced Applications.

[60]  Wolfgang Klafki,et al.  Studien zur Bildungstheorie und Didaktik , 1963 .

[61]  David Wiley,et al.  Connecting learning objects to instructional design theory: A definition, a metaphor, and a taxonomy , 2000 .

[62]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[63]  W. Nejdl,et al.  OTelosRDF: A Resource Description Format with Enhanced MetaModeling Functionalities Based on OTelos , 2001, Semannot@K-CAP 2001.

[64]  Brandon Muramatsu,et al.  Draft Standard for Learning Object Metadata , 2002 .

[65]  Heidrun Allert,et al.  Instructional Models and Scenarios for an Open Learning Repository - Instructional Design and Metadata , 2002 .

[66]  Jane Greenberg,et al.  Metadata and the world wide web , 2002 .

[67]  Divesh Srivastava,et al.  Data model and query evaluation in global information systems , 1995, Journal of Intelligent Information Systems.

[68]  Manfred Hauswirth,et al.  Semantic Gossiping , 2002 .

[69]  Alfons Kemper,et al.  Building Dynamic Market Places Using HyperQueries , 2002, EDBT.

[70]  Rafael Alonso,et al.  Data caching issues in an information retrieval system , 1990, TODS.

[71]  William Y. Arms,et al.  The Handle System , 1997 .

[72]  Antony I. T. Rowstron,et al.  Squirrel: a decentralized peer-to-peer web cache , 2002, PODC '02.

[73]  Guido Moerkotte,et al.  Heuristic and randomized optimization for the join ordering problem , 1997, The VLDB Journal.

[74]  Divesh Srivastava,et al.  Semantic Data Caching and Replacement , 1996, VLDB.

[75]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[76]  David Thomas,et al.  Pragmatic version control with CVS , 2004 .

[77]  HalevyAlon,et al.  MiniCon: A scalable algorithm for answering queries using views , 2001, VLDB 2001.

[78]  Mario T. Schlosser,et al.  HyperCuP – Hypercubes , Ontologies and Efficient Search on P 2 P Networks , 2002 .

[79]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.