The aim of this document is to describe our methods used in the Medical Image Modality Classification and Ad-hoc Image Retrieval Tasks of ImageClef 2011. The main novelty in medical image modality classification this year was, that there were more classes (18 modalities) organized in a hierarchy and for some categories only few annotated examples were available. Therefore, our strategy in image categorization was to use a semi-supervised approach. In our experiments, we investigated mono-modal (text and image) and mixed modality based classification. The image classification was based on Fisher Vectors built on SIFT-like local orientation histograms and local color statistics. For text representation we used a binarized bag-ofwords representation where each element indicated whether the term appeared in the image caption or not. In the case of multi-modal classification, we simply averaged the text and image classification scores. For the ad-hoc retrieval task, we used the image captions for text retrieval and Fisher Vectors for visual similarity and modality detection. Our text runs were based on a late fusion of dierent state of the art text experts and the Lexical Entailment model. This Lexical Entailement model used the last year articles to compute similarities between terms and rank first at the previous challenge. Concerning the submitted runs, we realized that we forgot by inadvertance 3 , to submit our best run from last year [3]. We did not submit either improvement over this run, which was proposed in [6]. Overall, this explain the medium performance of our submitted runs. In this document, we show that our system from last year and its improvements would have achieve top performance. We have not tuned the parameter of this system for this year task, we have just evaluated the runs we did not submit
[1]
John D. Lafferty,et al.
A study of smoothing methods for language models applied to Ad Hoc information retrieval
,
2001,
SIGIR '01.
[2]
Henning Müller,et al.
Overview of the CLEF 2009 Medical Image Retrieval Track
,
2009,
CLEF.
[3]
Gabriela Csurka,et al.
Semantic combination of textual and visual information in multimedia retrieval
,
2011,
ICMR.
[4]
Gabriela Csurka,et al.
Leveraging Image, Text and Cross-media Similarities for Diversity-focused Multimedia Retrieval
,
2010,
ImageCLEF.
[5]
Éric Gaussier,et al.
Information-based models for ad hoc IR
,
2010,
SIGIR '10.
[6]
Florent Perronnin,et al.
Fisher Kernels on Visual Vocabularies for Image Categorization
,
2007,
2007 IEEE Conference on Computer Vision and Pattern Recognition.
[7]
Gabriela Csurka,et al.
XRCE's Participation in Wikipedia Retrieval, Medical Image Modality Classification and Ad-hoc Retrieval Tasks of ImageCLEF 2010
,
2010,
CLEF.
[8]
Lawrence Carin,et al.
Sparse multinomial logistic regression: fast algorithms and generalization bounds
,
2005,
IEEE Transactions on Pattern Analysis and Machine Intelligence.
[9]
Éric Gaussier,et al.
Lexical Entailment for Information Retrieval
,
2006,
ECIR.
[10]
Gabriela Csurka,et al.
XRCE's Participation at Wikipedia Retrieval of ImageCLEF 2011
,
2011,
CLEF.
[11]
Gabriela Csurka,et al.
Medical image modality classification and retrieval
,
2011,
2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI).