Visual islands: intuitive browsing of visual search results

The amount of available digital multimedia has seen exponential growth in recent years. While advances have been made in the indexing and searching of images and videos, less focus has been given to aiding users in the interactive exploration of large datasets. In this paper a new framework, called visual islands, is proposed that reorganizes image query results from an initial search or even a general photo collection using a fast, non-global feature projection to compute 2D display coordinates. A prototype system is implemented and evaluated with three core goals: fast browsing, intuitive display, and non-linear exploration. Using the TRECVID2005[15] dataset, 10 users evaluated the goals over 24 topics. Experiments show that users experience improved comprehensibility and achieve a significant page-level precision improvement with the visual islands framework over traditional paged browsing.

[1]  Xiaodi Huang,et al.  Force-Transfer: A New Approach to Removing Overlapping Nodes in Graph Layout , 2003, ACSC.

[2]  Xing Xie,et al.  A visual attention model for adapting images on small displays , 2003, Multimedia Systems.

[3]  S. T. Dumais,et al.  Using latent semantic analysis to improve access to textual information , 1988, CHI '88.

[4]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[5]  Rong Yan,et al.  Exploring the Synergy of Humans and Machines in Extreme Video Retrieval , 2006, CIVR.

[6]  Susan T. Dumais,et al.  Using latent semantic analysis to improve information retrieval , 1988, CHI 1988.

[7]  Qi Tian,et al.  Visualization, Estimation and User-Modeling for Interactive Browsing of Image Libraries , 2002, CIVR.

[8]  Marcel Worring,et al.  The Mediamill Semantic Video Search Engine , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[9]  Erkki Oja,et al.  PicSOM-self-organizing image retrieval with MPEG-7 content descriptors , 2002, IEEE Trans. Neural Networks.

[10]  Jun-Cheng Chen,et al.  Audiovisual slideshow: present your journey by photos , 2006, MM '06.

[11]  Xiaofei He,et al.  Regularized Locality Preserving Projections with Two-Dimensional Discretized Laplacian Smoothing , 2006 .

[12]  Marcel Worring,et al.  Interactive access to large image collections using similarity-based visualization , 2008, J. Vis. Lang. Comput..

[13]  Dong Xu,et al.  Columbia University TRECVID-2006 Video Search and High-Level Feature Extraction , 2006, TRECVID.

[14]  Xing Xie,et al.  Effective browsing of web image search results , 2004, MIR '04.

[15]  Marcel Worring,et al.  MediaMill: semantic video search using the RotorBrowser , 2007, CIVR '07.

[16]  Shih-Fu Chang,et al.  Detecting image near-duplicate by stochastic attributed relational graph matching with learning , 2004, MULTIMEDIA '04.

[17]  Lie Lu,et al.  A generic framework of user attention model and its application in video summarization , 2005, IEEE Trans. Multim..

[18]  Shih-Fu Chang,et al.  Visual Cue Cluster Construction via Information Bottleneck Principle and Kernel Density Estimation , 2005, CIVR.

[19]  Marcel Worring,et al.  MediaMill: Semantic Video Browsing using the RotorBrowser , 2007 .