As the amount of user generated content grows, personal information management has become a challenging problem. Several information management approaches, such as desktop search, document organization and (collaborative) document tagging have been proposed to address this, however they are either inappropriate or inefficient. Automated collaborative document tagging approaches mitigate the problems of manual tagging, but they are usually based on centralized settings which are plagued by problems such as scalability, privacy, etc. To resolve these issues, we present P2PDocTagger, an automated and distributed document tagging system based on classification in P2P networks. P2P-DocTagger minimizes the efforts of individual peers and reduces computation and communication cost while providing high tagging accuracy, and eases of document organization/retrieval. In addition, we provide a realistic and flexible simulation toolkit -- P2PDMT, to facilitate the development and testing of P2P data mining algorithms.
[1]
J. Mixter.
Fast
,
2012
.
[2]
Stefan Siersdorfer,et al.
Meta methods for model sharing in personal information systems
,
2008,
ACM Trans. Inf. Syst..
[3]
Steven C. H. Hoi,et al.
Communication-Efficient Classification in P2P Networks
,
2009,
ECML/PKDD.
[4]
S. Krause,et al.
OverSim: A Flexible Overlay Network Simulation Framework
,
2007,
2007 IEEE Global Internet Symposium.
[5]
Wen-Tai Hsieh,et al.
A collaborative desktop tagging system for group knowledge management based on concept space
,
2009,
Expert Syst. Appl..
[6]
Susan T. Dumais,et al.
Fast, flexible filtering with phlat
,
2006,
CHI.
[7]
C. Bauckhage,et al.
Analyzing Social Bookmarking Systems : A del . icio . us Cookbook
,
2008
.
[8]
Steven C. H. Hoi,et al.
Adaptive Ensemble Classification in P2P Networks
,
2010,
DASFAA.