This paper investigates whether Web comments can be exploited for cross-media retrieval. Comparing Web items such as texts, images, videos, music, products, or personal profiles can be done at various levels of detail; our focus is on topic similarity. We propose to compare user-supplied comments on Web items in lieu of the commented items themselves. If this approach is feasible, the task of extracting and mapping features between arbitrary pairs of item types can be circumvented, and well-known text retrieval models can be applied instead - given that comments are available. We report on results of a preliminary, but nonetheless large-scale experiment which shows that, if comments on textual items are compared with comments on video items, topically similar pairs achieve a sufficiently high cross-media similarity.
[1]
Nicu Sebe,et al.
Content-based multimedia information retrieval: State of the art and challenges
,
2006,
TOMCCAP.
[2]
Gilad Mishne,et al.
Leave a Reply: An Analysis of Weblog Comments
,
2006
.
[3]
Martin Potthast,et al.
Measuring the descriptiveness of web comments
,
2009,
SIGIR.
[4]
Marie-Francine Moens,et al.
Finding the Best Picture: Cross-Media Retrieval of Content
,
2008,
ECIR.
[5]
Ophir Frieder,et al.
Are Web User Comments Useful for Search?
,
2009,
LSDS-IR@SIGIR.
[6]
Steffen Becker,et al.
Opinion Summarization of Web Comments
,
2010,
ECIR.
[7]
Masashi Inoue.
On the need for annotation-based image retrieval
,
2004
.