Visual search in dynamic 3D visualisations of unstructured picture collections

We present two empirical studies of visual search in dynamic 3D visualisations of large, randomly ordered, photo collections. The aim is to assess the possible effects of geometrical distortions on visual search effectiveness, efficiency and comfort, by comparing the influence of two perspective representations of photo collections on participants' performance results and subjective judgments. Thumbnails of the 1000 or so photographs in each collection are plastered on the lateral surface of a vertical cylinder, either on the inside (inner view, IV) or on the outside (outer view, OV). IV and OV suggest two different interaction metaphors: locomotion in a virtual space (IV) versus manipulation of a virtual object (OV). They also implement different perspective distortions: enlargement and distortion of lateral columns (IV) versus enlargement of central columns and dwindling plus distortion of lateral columns (OV). Presentation of results focus on the second study, S2, which involved 20 participants and offered them strictly identical interaction facilities with the two views, unlike the initial pilot study, S1 (8 participants and slightly different interaction facilities between the two views). Participants in both studies were experienced computer users (average age: 25.15years, SD: 3.13). They performed two types of basic visual tasks that are carried out repeatedly while navigating photo collections: (i) searching for a photo meeting specific, visual and thematic, criteria, the photo and its location in the collection being unknown to participants (ST1) and (ii) looking for a visually familiar photo, the location of the photo being familiar to participants (ST2). According to post-experiment questionnaires and debriefings, all participants in S2 save one judged both 3D views positively in reference to standard 2D visualisations. Half of them preferred IV over OV, four appreciated OV better, and six expressed no clear opinion. Preferences were mainly motivated by the effects of perspective distortions on thumbnail visibility. They were barely influenced by interaction metaphors (e.g., the feeling of immersion induced by IV). Despite large inter-individual differences in performance, a majority of participants carried out ST1 tasks more effectively and efficiently with IV than with OV, as regards error rates (statistically significant difference) and search times (tendency). Performance results for ST2 tasks were similar with the two views, due, probably, to the simplicity and brevity of ST2 tasks. Perspective distortions seem to have exerted less influence on participants' visual strategies than horizontal scrolling, a dynamic feature common to both views. Qualitative analyses of participants' behaviours suggest that IV has the potential to support spatial memory better than OV, presumably thanks to the locomotion metaphor. These results indicate that perspective views have the potential to facilitate and improve visual search in unstructured picture collections provided that distortions are adapted to users' individual visual capabilities. Further research is needed to better understand: (i) the actual relations between visual exploration strategies and geometrical properties of perspective visualisations and (ii) the influence of the manipulation and locomotion metaphors on spatial memory. This knowledge is necessary to further improve the comfort and effectiveness of visual search in large unstructured picture collections, using 3D visualisations.

[1]  Ben Shneiderman,et al.  Image-Browser Taxonomy and Guidelines for Designers , 2003 .

[2]  Eric Lecolinet,et al.  Browsing Zoomable Treemaps: Structure-Aware Multi-Scale Navigation Techniques , 2007, IEEE Transactions on Visualization and Computer Graphics.

[3]  Matt Jones,et al.  Realistic books: a bizarre homage to an obsolete medium? , 2004, JCDL.

[4]  John A. Waterworth,et al.  Effects of desktop 3D world design on user navigation and search performance , 2000, 2000 IEEE Conference on Information Visualization. An International Conference on Computer Visualization and Graphics.

[5]  A. Oliva,et al.  From Blobs to Boundary Edges: Evidence for Time- and Spatial-Scale-Dependent Scene Recognition , 1994 .

[6]  Suzanne Kieffer,et al.  Effects of Display Layout on Gaze Activity During Visual Search , 2005, INTERACT.

[7]  Jeremiah Scholl,et al.  Extending Tree-Maps to Three Dimensions: A Comparative Study , 2004, APCHI.

[8]  Benjamin B. Bederson,et al.  A review of overview+detail, zooming, and focus+context interfaces , 2009, CSUR.

[9]  G. W. Furnas,et al.  Generalized fisheye views , 1986, CHI '86.

[10]  Benjamin B. Bederson,et al.  PhotoMesa: a zoomable image browser using quantum treemaps and bubblemaps , 2001, UIST '01.

[11]  Benjamin B. Bederson,et al.  Fisheye menus , 2000, UIST '00.

[12]  Dylan M. Jones,et al.  Navigating Buildings in "Desk-Top" Virtual Environments: Experimental Investigations Using Extended Navigational Experience , 1997 .

[13]  Jock D. Mackinlay,et al.  The perspective wall: detail and context smoothly integrated , 1991, CHI.

[14]  Weimin Mou,et al.  Roles of egocentric and allocentric spatial representations in locomotion and reorientation. , 2006, Journal of experimental psychology. Learning, memory, and cognition.

[15]  Jan Wielemaker,et al.  Ontology-Based Annotation , 2001 .

[16]  Marin Ferecatu,et al.  Semantic interactive image retrieval combining visual and conceptual content description , 2007, Multimedia Systems.

[17]  Jean-Daniel Fekete,et al.  MatrixExplorer: a Dual-Representation System to Explore Social Networks , 2006, IEEE Transactions on Visualization and Computer Graphics.

[18]  Mark D. Apperley,et al.  A review and taxonomy of distortion-oriented presentation techniques , 1994, TCHI.

[19]  Alycia M. Hund,et al.  Developmental Changes in Young Children's Spatial Memory and Language in Relation to Landmarks , 2008 .

[20]  Ben Shneiderman,et al.  Immediate usability: a case study of public access design for a community photo library , 2004, Interact. Comput..

[21]  David Pye,et al.  SHOEBOX: A DIGITAL PHOTO MANAGEMENT SYSTEM , 2000 .

[22]  Ioana M. Boier-Martin,et al.  Visualization Viewpoints , 2000 .

[23]  Ben Shneiderman,et al.  LifeLines: visualizing personal histories , 1996, CHI.

[24]  Shih-Fu Chang,et al.  Visual information retrieval from large distributed online repositories , 1997, CACM.

[25]  Qi Tian,et al.  Visualization and User-Modeling for Browsing Personal Photo Libraries , 2004, International Journal of Computer Vision.

[26]  D. Bainbridge,et al.  Realistic books: a bizarre homage to an obsolete medium? , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[27]  Ben Shneiderman,et al.  Ordered and quantum treemaps: Making effective use of 2D space to display hierarchies , 2002, TOGS.

[28]  Patrick Baudisch,et al.  Keeping things in context: a comparative evaluation of focus plus context screens, overviews, and zooming , 2002, CHI.

[29]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[30]  Andy Cockburn,et al.  An Evaluation of Cone Trees , 2000, BCS HCI.

[31]  Andy Cockburn,et al.  Evaluating the effectiveness of spatial memory in 2D and 3D physical and virtual environments , 2002, CHI.

[32]  George G. Robertson,et al.  The WebBook and the Web Forager: video use scenarios for a World-Wide Web information workspace , 1996, CHI 1996.

[33]  Mary Czerwinski,et al.  The Contribution of Thumbnail Image, Mouse-over Text and Spatial Location Memory to Web Page Retrieval in 3D , 1999, INTERACT.

[34]  Jock D. Mackinlay,et al.  3Book: a scalable 3D virtual book , 2004, CHI EA '04.

[35]  Ben Shneiderman,et al.  Tree-maps: a space-filling approach to the visualization of hierarchical information structures , 1991, Proceeding Visualization '91.

[36]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[37]  Stuart K. Card,et al.  The effect of information scent on searching information: visualizations of large tree structures , 2000, AVI '00.

[38]  Mark Witkowski,et al.  A comparison of static and moving presentation modes for image collections , 2006, AVI '06.

[39]  Allen Avner,et al.  Interacting with computer users: Design considerations , 1980 .

[40]  Michael G. Christel Establishing the utility of non-text search for news video retrieval with real world users , 2007, ACM Multimedia.

[41]  Jussi Myllymaki,et al.  Visual exploration of large data sets , 1996, Electronic Imaging.

[42]  Olivier Chapuis,et al.  Quantifying degree of goal directedness in document navigation: application to the evaluation of the perspective-drag technique , 2007, CHI.

[43]  Ben Shneiderman,et al.  Meaningful presentations of photo libraries: rationale and applications of bi-level radial quantum layouts , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[44]  Andy Cockburn,et al.  Evaluating spatial memory in two and three dimensions , 2004, Int. J. Hum. Comput. Stud..

[45]  Thomas M. Mann,et al.  Visual information retrieval for the web , 2001 .

[46]  Kerry Rodden,et al.  How do people manage their digital photographs? , 2003, CHI '03.

[47]  David Pye,et al.  Managing photos with ATT shoebox. , 2000, SIGIR 2000.

[48]  Pearl Pu,et al.  An evaluation of semantic fisheye views for opportunistic search in an annotated image collection , 2005, International Journal on Digital Libraries.

[49]  George G. Robertson,et al.  The WebBook and the Web Forager: an information workspace for the World-Wide Web , 1996, CHI.

[50]  Mary Czerwinski,et al.  Data mountain: using spatial memory for document management , 1998, UIST '98.

[51]  Jérôme Dupire,et al.  Experiments Towards 3D Immersive Interaction for Digital Libraries , 2006, Edutainment.

[52]  Jitendra Malik,et al.  Image Retrieval and Classification Using Local Distance Functions , 2006, NIPS.

[53]  Ben Shneiderman,et al.  Visualization methods for personal photo collections: browsing and searching in the PhotoFinder , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[54]  Christopher Fry,et al.  On your marks, get set, browse! , 1997, CHI Extended Abstracts.

[55]  Ben Shneiderman,et al.  A photo history of SIGCHI: evolution of design from personal to public , 2002, INTR.

[56]  Benjamin B. Bederson,et al.  Pocket PhotoMesa: a Zoomable image browser for PDAs , 2004, MUM '04.

[57]  Sally Jo Cunningham,et al.  Navigating the virtual library: A three-dimensional browsing interface for information retrieval , 2007 .

[58]  Alexandre Topol,et al.  Binding browsing and reading activities in a 3D digital library , 2002, JCDL '02.

[59]  Steven M. Drucker,et al.  MediaBrowser: reclaiming the shoebox , 2004, AVI.

[60]  Patrick Baudisch,et al.  Time quilt: scaling up zoomable photo browsers for large, unstructured photo collections , 2005, CHI EA '05.

[61]  David Sinclair,et al.  Managing photos with AT&T Shoebox (demonstration session) , 2000, SIGIR '00.

[62]  Ramana Rao,et al.  A focus+context technique based on hyperbolic geometry for visualizing large hierarchies , 1995, CHI '95.

[63]  Ben Shneiderman,et al.  Why Not Make Interfaces Better than 3D Reality? , 2003, IEEE Computer Graphics and Applications.

[64]  Mary Czerwinski,et al.  The Task Gallery: a 3D window manager , 2000, CHI.

[65]  Mats Lind,et al.  2D vs 3D, implications on spatial memory , 2001, IEEE Symposium on Information Visualization, 2001. INFOVIS 2001..

[66]  Catherine Plaisant,et al.  The challenge of information visualization evaluation , 2004, AVI.

[67]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[68]  Michael L. Creech,et al.  FotoFile: a consumer multimedia organization and retrieval system , 1999, CHI '99.

[69]  Norman G. Vinson,et al.  Design guidelines for landmarks to support navigation in virtual environments , 1999, CHI '99.

[70]  C C Presson,et al.  The development of landmarks in spatial memory: the role of differential experience. , 1987, Journal of experimental child psychology.

[71]  Ramana Rao,et al.  The table lens: merging graphical and symbolic representations in an interactive focus + context visualization for tabular information , 1994, CHI '94.

[72]  Andrian Marcus,et al.  3D representations for software visualization , 2003, SoftVis '03.

[73]  Jock D. Mackinlay,et al.  Cone Trees: animated 3D visualizations of hierarchical information , 1991, CHI.

[74]  Victoria Interrante,et al.  Visualization viewpoints , 2003 .

[75]  Ben Shneiderman,et al.  Tree visualization with tree-maps: 2-d space-filling approach , 1992, TOGS.

[76]  Benjamin B. Bederson,et al.  Does zooming improve image browsing? , 1999, DL '99.

[77]  Bob J. Wielinga,et al.  Ontology-Based Photo Annotation , 2001, IEEE Intell. Syst..

[78]  Brenda K. Wiederhold,et al.  A Review of Virtual Reality as a Psychotherapeutic Tool , 1998, Cyberpsychology Behav. Soc. Netw..