Beyond 2D-grids: a dependence maximization view on image browsing

Ideally, one would like to perform image search using an intuitive and friendly approach. Many existing image search engines, however, present users with sets of images arranged in some default order on the screen, typically the relevance to a query, only. While this certainly has its advantages, arguably, a more flexible and intuitive way would be to sort images into arbitrary structures such as grids, hierarchies, or spheres so that images that are visually or semantically alike are placed together. This paper focuses on designing such a navigation system for image browsers. This is a challenging task because arbitrary layout structure makes it difficult -- if not impossible -- to compute cross-similarities between images and structure coordinates, the main ingredient of traditional layouting approaches. For this reason, we resort to a recently developed machine learning technique: kernelized sorting. It is a general technique for matching pairs of objects from different domains without requiring cross-domain similarity measures and hence elegantly allows sorting images into arbitrary structures. Moreover, we extend it so that some images can be preselected for instance forming the tip of the hierarchy allowing to subsequently navigate through the search results in the lower levels in an intuitive way.

[1]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[2]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[3]  T. J. Jankun-Kelly,et al.  MoireGraphs: radial focus+context visualization and interaction for graphs with visual nodes , 2003, IEEE Symposium on Information Visualization 2003 (IEEE Cat. No.03TH8714).

[4]  Vincent Lepetit,et al.  A fast local descriptor for dense matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Le Song,et al.  Kernelized Sorting , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Steven F. Roth,et al.  Data characterization for intelligent graphics presentation , 1990, CHI '90.

[8]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[10]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[11]  Mingjing Li,et al.  Color texture moments for content-based image retrieval , 2002, Proceedings. International Conference on Image Processing.

[12]  Kentaro Toyama,et al.  Geographic location tags on digital images , 2003, ACM Multimedia.

[13]  Marcel Worring,et al.  Balancing thread based navigation for targeted video search , 2008, CIVR '08.

[14]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[15]  Marcel Worring,et al.  Interactive access to large image collections using similarity-based visualization , 2008, J. Vis. Lang. Comput..

[16]  Marcel Worring,et al.  Query on demand video browsing , 2007, ACM Multimedia.

[17]  Kerry Rodden,et al.  Does organisation by similarity assist image browsing? , 2001, CHI.

[18]  Geoffrey E. Hinton,et al.  Stochastic Neighbor Embedding , 2002, NIPS.

[19]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Benjamin B. Bederson PhotoMesa: A Zoomable Image Browser Using Quantum Treemaps and Bubblemaps , 2003 .

[21]  Ben Shneiderman,et al.  Meaningful presentations of photo libraries: rationale and applications of bi-level radial quantum layouts , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[22]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[23]  H. Intraub,et al.  Boundary Extension for Briefly Glimpsed Photographs: Do Common Perceptual Processes Result in Unexpected Memory Distortions? , 1996 .

[24]  Desney S. Tan,et al.  Designing Novel Image Search Interfaces by Understanding Unique Characteristics and Usage , 2009, INTERACT.

[25]  Xing Xie,et al.  Effective browsing of web image search results , 2004, MIR '04.

[26]  Jianping Fan,et al.  Semantic Image Browser: Bridging Information Visualization with Automated Intelligent Image Analysis , 2006, 2006 IEEE Symposium On Visual Analytics Science And Technology.

[27]  Kilian Q. Weinberger,et al.  An Introduction to Nonlinear Dimensionality Reduction by Maximum Variance Unfolding , 2006, AAAI.

[28]  Beng Chin Ooi,et al.  Giving meanings to WWW images , 2000, ACM Multimedia.

[29]  Patrick Baudisch,et al.  Time quilt: scaling up zoomable photo browsers for large, unstructured photo collections , 2005, CHI EA '05.

[30]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[31]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[32]  Le Song,et al.  A Hilbert Space Embedding for Distributions , 2007, Discovery Science.

[33]  Ricardo da Silva Torres,et al.  Visual structures for image browsing , 2003, CIKM '03.