Freelib: a self-sustainable peer-to-peer digital library framework for evolving communities

The need for efficient solutions to the problem of disseminating and sharing of data is growing. Digital libraries provide an efficient solution for disseminating and sharing large volumes of data to diverse sets of users. They enable the use of structured and well defined metadata to provide quality search services. Most of the digital libraries built so far follow a centralized model. The centralized model is an efficient model; however, it has some inherent problems. It is not suitable when content contribution is highly distributed over a very large number of participants. It also requires an organizational support to provide resources (hardware, software, and network bandwidth) and to manage processes for collecting, ingesting, curating, and maintaining the content. In this research, we develop an alternative digital library framework based on peer-to-peer. The framework utilizes resources contributed by participating nodes to achieve self-sustainability. A second key contribution of this research is a significant enhancement of search performance by utilizing the novel concept of community evolution. As demonstrated in this thesis, bringing users sharing similar interest together in a community significantly enhances the search performance. Evolving users into communities is based on a simple analysis of user access patterns in a completely distributed manner. This community evolution process is completely transparent to the user. In our framework, community membership of each node is continuously evolving. This allows users to move between communities as their interest shifts between topics, thus enhancing search performance for users all the time even when their interest changes. It also gives our framework great flexibility as it allows communities to dissolve and new communities to form and evolve over time to reflect the latest user interests. In addition to self-sustainability and performance enhancements, our framework has the potential of building extremely large collections although every node is only maintaining a small collection of digital objects.

[1]  Bengt Carlsson,et al.  The Rise and Fall of Napster - An Evolutionary Approach , 2001, Active Media Technology.

[2]  Gurmeet Singh Manku,et al.  SETS: search enhanced by topic segmentation , 2003, SIGIR.

[3]  Terence R. Smith,et al.  The Alexandria Digital Library architecture , 2000, International Journal on Digital Libraries.

[4]  Mario Kolberg,et al.  Tools for Peer-to-Peer Network Simulation , 2006 .

[5]  Ingeborg Sølvberg,et al.  Metadata Harvesting Framework in P2P-Based Digital Libraries , 2004, Dublin Core Conference.

[6]  John Cocke,et al.  The evolution of RISC technology at IBM , 1990, IBM Journal of Research and Development.