A flexible content repository to enable a peer‐to‐peer‐based wiki

Wikis—being major applications of the Web 2.0—are used for a large number of purposes, such as encyclopedias, project documentation, and coordination, both in open communities and in enterprises. At the application level, users are targeted as both consumers and producers of dynamic content. Yet, this kind of peer‐to‐peer (P2P) principle is not used at the technical level being still dominated by traditional client–server architectures. What lacks is a generic platform that combines the scalability of the P2P approach with, for example, a wiki's requirements for consistent content management in a highly concurrent environment. This paper presents a flexible content repository system that is intended to close the gap by using a hybrid P2P overlay to support scalable, fault‐tolerant, consistent, and efficient data operations for the dynamic content of wikis. On the one hand, this paper introduces the generic, overall architecture of the content repository. On the other hand, it describes the major building blocks to enable P2P data management at the system's persistent storage layer, and how these may be used to implement a P2P‐based wiki application: (i) a P2P back‐end administrates a wiki's actual content resources. (ii) On top, P2P service groups act as indexing groups to implement a wiki's search index. Copyright © 2009 John Wiley & Sons, Ltd.

[1]  Robert Tappan Morris,et al.  Comparing the Performance of Distributed Hash Tables Under Churn , 2004, IPTPS.

[2]  Jörg P. Müller,et al.  DhtFlex: A Flexible Approach to Enable Efficient Atomic Data Management Tailored for Structured Peer-to-Peer Overlays , 2008, 2008 Third International Conference on Internet and Web Applications and Services.

[3]  Leslie Lamport,et al.  The part-time parliament , 1998, TOCS.

[4]  Mahadev Satyanarayanan,et al.  Scale and performance in a distributed file system , 1987, SOSP '87.

[5]  Ajay D. Kshemkalyani,et al.  Distributed Computing: Index , 2008 .

[6]  Kotagiri Ramamohanarao,et al.  Inverted files versus signature files for text indexing , 1998, TODS.

[7]  GhemawatSanjay,et al.  The Google file system , 2003 .

[8]  Bo Leuf,et al.  The Wiki Way: Quick Collaboration on the Web , 2001 .

[9]  Krishna P. Gummadi,et al.  King: estimating latency between arbitrary internet end hosts , 2002, IMW '02.

[10]  J. Avery,et al.  The long tail. , 1995, Journal of the Tennessee Medical Association.

[11]  David Mazières,et al.  Democratizing Content Publication with Coral , 2004, NSDI.

[12]  Rüdiger Kapitza,et al.  A Generic Infrastructure for Decentralised Dynamic Loading of Platform-Specific Code , 2007, DAIS.

[13]  Philip A. Bernstein Repositories and object oriented databases , 1998, SGMD.

[14]  David R. Karger,et al.  Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web , 1997, STOC '97.

[15]  Abraham Silberschatz,et al.  Distributed file systems: concepts and examples , 1990, CSUR.

[16]  André Schiper,et al.  Lightweight causal and atomic group multicast , 1991, TOCS.

[17]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[19]  Guillaume Pierre,et al.  A Decentralized Wiki Engine for Collaborative Wikipedia Hosting , 2007, WEBIST.

[20]  Larry L. Peterson,et al.  Reliability and Security in the CoDeeN Content Distribution Network , 2004, USENIX Annual Technical Conference, General Track.

[21]  Ben Y. Zhao,et al.  OceanStore: An Extremely Wide-Area Storage System , 2002, ASPLOS 2002.

[22]  Liu Yao,et al.  Research on the Key Technology Based-NLP Of Chinese Medicine Pulse's Mathematical Quantifying , 2008, WI 2008.

[23]  David R. Karger,et al.  Wide-area cooperative storage with CFS , 2001, SOSP.

[24]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[25]  Steffen Rusitschka,et al.  The Resource Management Framework: A System for Managing Metadata in Decentralized Networks Using Peer-to-Peer Technology , 2002, AP2PC.

[26]  Krishna P. Gummadi,et al.  King: estimating latency between arbitrary internet end hosts , 2002, IMW '02.

[27]  Kenneth P. Birman,et al.  Reliable communication in the presence of failures , 1987, TOCS.

[28]  Hans P. Reiser,et al.  A Reconfigurable System Architecture for Consensus-based Group Communication , 2005, IASTED PDCS.

[29]  Wolfgang Müller,et al.  Praxiserfahrungen bei der Einführung dezentraler Wissensmanagement-Lösungen , 2003, Wirtsch..

[30]  Ajay D. Kshemkalyani,et al.  Distributed Computing: Principles, Algorithms, and Systems , 2008 .

[31]  Mary Baker,et al.  Peer-to-Peer Caching Schemes to Address Flash Crowds , 2002, IPTPS.

[32]  Florian Schintke,et al.  Peer-to-Peer Computing , 2010, Euro-Par.

[33]  Harald C. Gall,et al.  Software-Architekturen für verteilte Systeme - Prinzipien, Bausteine und Standardarchitekturen für moderne Software , 2003, Xpert.press.

[34]  Antony I. T. Rowstron,et al.  Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility , 2001, SOSP.

[35]  Harald C. Gall,et al.  Software-Architekturen für Verteilte Systeme , 2003 .

[36]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[37]  Ben Y. Zhao,et al.  OceanStore: an architecture for global-scale persistent storage , 2000, SIGP.

[38]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.