Interaction Design for Mobile Visual Search

Mobile devices are becoming ubiquitous. People take pictures via their phone cameras to explore the world on the go. In many cases, they are concerned with the picture-related information. Understanding user intent conveyed by those pictures therefore becomes important. Existing mobile applications employ visual search to connect the captured picture with the physical world. However, they only achieve limited success due to the ambiguity nature of user intent in the picture-one picture usually contains multiple objects. By taking advantage of multitouch interactions on mobile devices, this paper presents a prototype of interactive mobile visual search, named TapTell, to help users formulate their visual intent more conveniently. This kind of search leverages limited yet natural user interactions on the phone to achieve more effective visual search while maintaining a satisfying user experience. We make three contributions in this work. First, we conduct a focus study on the usage patterns and concerned factors for mobile visual search, which in turn leads to the interactive design of expressing visual intent by gesture. Second, we introduce four modes of gesture-based interactions (crop, line, lasso, and tap) and develop a mobile prototype. Third, we perform an in-depth usability evaluation on these different modes, which demonstrates the advantage of interactions and shows that lasso is the most natural and effective interaction mode. We show that TapTell provides a natural user experience to use phone camera and gesture to explore the world. Based on the observation and conclusion, we also suggest some design principles for interactive mobile visual search in the future.

[1]  Rongrong Ji,et al.  Active query sensing for mobile location search , 2011, ACM Multimedia.

[2]  Albrecht Schmidt,et al.  Utilizing multimedia capabilities of mobile phones to support teaching in schools in rural panama , 2011, CHI.

[3]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[5]  Richard E. Ladner,et al.  Usable gestures for blind people: understanding preference and performance , 2011, CHI.

[6]  R. A. Bailey,et al.  Design of comparative experiments , 2008 .

[7]  Ning Zhang,et al.  Interactive mobile visual search for social activities completion using query image contextual model , 2012, 2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP).

[8]  Max Mühlhäuser,et al.  Toward more efficient user interfaces for mobile video browsing: an in-depth exploration of the design space , 2010, ACM Multimedia.

[9]  Bernd Girod,et al.  Mobile product recognition , 2010, ACM Multimedia.

[10]  Bernd Girod,et al.  Mobile Visual Search: Architectures, Technologies, and the Emerging MPEG Standard , 2011, IEEE MultiMedia.

[11]  Wen Gao,et al.  Learning Compact Visual Descriptor for Low Bit Rate Mobile Landmark Search , 2011, IJCAI.

[12]  Johannes Schöning,et al.  The hybrid shopping list: bridging the gap between physical and digital shopping lists , 2011, Mobile HCI.

[13]  Joaquim A. Jorge,et al.  Assessing mobile touch interfaces for tetraplegics , 2010, Mobile HCI.

[14]  Ning Zhang,et al.  Tap-to-search: Interactive and contextual visual search on mobile devices , 2011, 2011 IEEE 13th International Workshop on Multimedia Signal Processing.

[15]  Bernd Girod,et al.  Inverted Index Compression for Scalable Image Matching , 2010, 2010 Data Compression Conference.

[16]  Barry Smyth,et al.  Visual Interfaces for Improved Mobile Search , 2009 .

[17]  Xin Chen,et al.  City-scale landmark identification on mobile devices , 2011, CVPR 2011.

[18]  J. B. Brooke,et al.  SUS: A 'Quick and Dirty' Usability Scale , 1996 .

[19]  Jun Gong,et al.  GUIDELINES FOR HANDHELD MOBILE DEVICE INTERFACE DESIGN , 2004 .

[20]  Barry Smyth,et al.  A large scale study of European mobile search behaviour , 2008, Mobile HCI.

[21]  John Zimmerman,et al.  Research through design as a method for interaction design research in HCI , 2007, CHI.

[22]  Tero Jokela,et al.  Mobile multimedia presentation editor: enabling creation of audio-visual stories on mobile devices , 2008, CHI.

[23]  Yang Wang,et al.  JIGSAW: interactive mobile visual search with multimodal queries , 2011, ACM Multimedia.

[24]  R. Likert “Technique for the Measurement of Attitudes, A” , 2022, The SAGE Encyclopedia of Research Design.

[25]  Peter Gall Krogh,et al.  Tactics for homing in mobile life: a fieldwalk study of extremely mobile people , 2010, Mobile HCI.