System Design of a Super-Peer Network for Content-Based Image Retrieval

Super-peer networks inherit the advantages of P2P networks, such as pooling together the shared data (images in our system) across peers, self-organizing, and fault-tolerance. In addition, they take advantage of the heterogeneity of capabilities across peers in load-balancing and network adaptation. A super-peer node operates as an equal peer, and as a server/parent to a set of peers. Content-based image retrieval (CBIR) is concerned with using the image visual content, such as color, texture, and shape, to search and compare images from large-scale image database. In this paper, we adopt a super-peer network for search and retrieval of images across a group of peers’ shared image collections. In the proposed system, an image is represented by a multidimensional feature vector, of dimension d, that is extracted from the image color and texture features. We present a novel organization of super-peers as a structured overlay P2P network (specifically CAN). A super-peer maintains a zone in the d-torus virtual space that contains a set of global image categories (clusters). In addition, a super-peer manages the set of peers that hold images belonging to those global categories. That is the proposed super-peer network is both structured and clustered according to the peers’ image collections. A semifuzzy clustering algorithm is used for both clustering a peer image collection and for assigning a peer image cluster to at least two parent super-peers. We illustrate that the use of a structured and clustered super-peer network guarantees the retrieval of the most similar images to a query image. Scalability analysis demonstrates that the extra storage requirement for a peer is less than 0.1% of its total shared images’ size, and less than 9% for a super-peer.

[1]  Ling Guan,et al.  Semi-automated relevance feedback for distributed content based image retrieval , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[2]  Wolfgang Nejdl,et al.  Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks , 2003, WWW '03.

[3]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[4]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Enrico Gregori,et al.  Revised Papers from the NETWORKING 2002 Workshops on Web Engineering and Peer-to-Peer Computing , 2002 .

[6]  Yong Yang,et al.  Performance of Full Text Search in Structured and Unstructured Peer-to-Peer Systems , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[7]  Shokri Z. Selim,et al.  Soft clustering of multidimensional data: a semi-fuzzy approach , 1984, Pattern Recognit..

[8]  Jie Lu,et al.  Merging retrieval results in hierarchical peer-to-peer networks , 2004, SIGIR '04.

[9]  Irwin King,et al.  Peer Clustering and Firework Query Model in the Peer›to›Peer Network , 2003 .

[10]  Wolfgang Müller,et al.  Clustering-Based Source Selection for Efficient Image Retrieval in Peer-to-Peer Networks , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).

[11]  Chi-Hang Chan,et al.  Bridging the P2P and WWW Divide with DISCOVIR - DIStributed COntent-based Visual Information Retrieval , 2003, WWW.

[12]  Thu D. Nguyen,et al.  Text-Based Content Search and Retrieval in Ad-hoc P2P Communities , 2002, NETWORKING Workshops.

[13]  Wolfgang Müller,et al.  Fast retrieval of high-dimensional feature vectors in P2P networks using compact peer data summaries , 2003, MIR '03.

[14]  Hector Garcia-Molina,et al.  Designing a super-peer network , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[15]  L. Guan,et al.  Automatic Relevance Feedback for Distributed Content-Based Image Retrieval , 2005 .

[16]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[17]  Divyakant Agrawal,et al.  PRISM: indexing multi-dimensional data in P2P networks using reference vectors , 2005, MULTIMEDIA '05.

[18]  Charu C. Aggarwal,et al.  On the Surprising Behavior of Distance Metrics in High Dimensional Spaces , 2001, ICDT.

[19]  Zhichen Xu,et al.  pSearch: information retrieval in structured overlays , 2003, CCRV.

[20]  Mohamed A. Ismail,et al.  WAY-LOOK4: A CBIR system based on class signature of the images' color and texture features , 2009, 2009 IEEE/ACS International Conference on Computer Systems and Applications.

[21]  Wolfgang Müller,et al.  Fast retrieval of high-dimensional feature vectors in P2P networks using compact peer data summaries , 2003, MIR '03.

[22]  Wolfgang Müller,et al.  Scalable summary based retrieval in P2P networks , 2005, CIKM '05.

[23]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[24]  Jie Lu,et al.  Content-based retrieval in hybrid peer-to-peer networks , 2003, CIKM '03.