Experiments with Clustering the Collection at ImageCLEF 2007

We present our participation in the 2007 ImageCLEF Ad-hoc photographic retrieval task. Our first participation in this year’s imageCLEF comprised six runs. The main purpose of three of these runs was to evalute the text and visual retrieval tools as well as their combination in the context of the given task. The other purpose of our participation is to experiment with applying clustering techniques to this task, which has not been done frequently in previous editions of ImageCLEF AD-hoc task. We use the preclustered collection to augment the search results of the retrieval engines. For retrieval we used two publicly available libraries; Apache Lucene for text and LIRE for visual retrieval. The clustered-augmented results reduced slightly the precision of the initial runs. While the aspired results have not yet been achieved, we note that the task is useful in assessing the validity of the clusters.

[1]  Mathias Lux,et al.  Retrieval of MPEG-7 based Semantic Descriptions , 2005 .

[2]  Wei-Ying Ma,et al.  Locality preserving clustering for image database , 2004, MULTIMEDIA '04.

[3]  Otis Gospodnetic,et al.  Lucene in Action , 2004 .

[4]  Yixin Chen,et al.  Content-based image retrieval by clustering , 2003, MIR '03.

[5]  Wei-Ying Ma,et al.  Hierarchical clustering of WWW image search results using visual, textual and link information , 2004, MULTIMEDIA '04.

[6]  Masahiko Yachida,et al.  Image clustering system on WWW using Web texts , 2004, Fourth International Conference on Hybrid Intelligent Systems (HIS'04).

[7]  Allan Hanbury,et al.  Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task , 2008, CLEF.

[8]  Wei-Ying Ma,et al.  Organizing WWW images based on the analysis of page layout and Web link structure , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[9]  Wei-Ying Ma,et al.  Grouping WWW Image Search Results by Novel Inhomogeneous Clustering Method , 2005, 11th International Multimedia Modelling Conference.

[10]  Tao Qin,et al.  Web image clustering by consistent utilization of visual features and surrounding texts , 2005, MULTIMEDIA '05.