Automatic organization of large photo collections

Modern digital photography allows users to capture, store, and share thousands of digital photographs at one time. As a result, simply browsing the photo collection becomes a daunting task. A user must see and deal with every single photograph in the collection. Tasks related to browsing, such as searching for a specific photograph, or choosing a few photographs to share become difficult. Organizing the photographs to exploit this organization is one way to simplify these tasks; a user may take advantage of the organization when carrying out any of the above tasks. Unfortunately organizing the photographs by hand often requires more effort than users will apply. In this dissertation I show how using cues from metadata and image content, large collections of photographs can be organized. The photograph collection is automatically partitioned into a tree of related "events" and then a single photograph for each event can be selected to represent that group. For any given node of the tree, the user is shown only the representative photographs from the children of the node, thus reducing the visual information that they must deal with at any one time. Browsing the photographs is equivalent to traversing the tree. Other interactions with the photograph (e.g. tagging) can be carried out on individual photographs or sub-trees. The methods that I developed were informed by two user studies. The first study shows that representative photographs exist in large collections of photographs, and that humans are able to perform such selection. The second study helps illuminate the process that humans carry out when asked to select a representative photograph. The findings of these user studies helped inform the development of new methods for automatic selection of representative photographs. I present a full implementation of these methods. The implementation allows a user to browse, tag, and search photographs either on a desktop PC or over the World Wide Web.

[1]  Benjamin B. Bederson,et al.  Automatic thumbnail cropping and its effectiveness , 2003, UIST '03.

[2]  H. Charles Romesburg,et al.  Cluster analysis for researchers , 1984 .

[3]  Mor Naaman,et al.  Context data in geo-referenced digital photo collections , 2004, MULTIMEDIA '04.

[4]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[5]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[6]  Allan Kuchinsky,et al.  Requirements for photoware , 2002, CSCW '02.

[7]  Bongwon Suh Image Management using Pattern Recognition Systems , 2005 .

[8]  Raghu Ramakrishnan,et al.  Data Modeling and Querying in the PIQ Image DBMS. , 1996 .

[9]  Peter Krogh The DAM Book: Digital Asset Management for Photographers (O'Reilly Digital Studio) , 2005 .

[10]  Benjamin B. Bederson,et al.  PhotoMesa: a zoomable image browser using quantum treemaps and bubblemaps , 2001, UIST '01.

[11]  Robert R. Korfhage,et al.  Visualization of a Document Collection: The VIBE System , 1993, Inf. Process. Manag..

[12]  Ben Shneiderman,et al.  Readings in information visualization - using vision to think , 1999 .

[13]  Vijay V. Raghavan,et al.  Content-Based Image Retrieval Systems - Guest Editors' Introduction , 1995, Computer.

[14]  Michael Gleicher,et al.  Automatic image retargeting with fisheye-view warping , 2005, UIST.

[15]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[16]  Andreas Paepcke,et al.  Time as essence for photo browsing through personal digital libraries , 2002, JCDL '02.

[17]  Shingo Uchihashi,et al.  Video Manga: generating semantically meaningful video summaries , 1999, MULTIMEDIA '99.

[18]  Kerry Rodden,et al.  How do people manage their digital photographs? , 2003, CHI '03.

[19]  Gang Wei,et al.  Face detection for image annotation , 1999, Pattern Recognition Letters.

[20]  Steve Krug,et al.  Don't Make Me Think!: A Common Sense Approach to Web Usability , 2000 .

[21]  Yanfeng Sun,et al.  MiAlbum - a system for home photo managemet using the semi-automatic image annotation approach , 2000, MM 2000.

[22]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.

[23]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  I. Jolliffe Principal Component Analysis , 2002 .

[25]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[26]  Andrew Zisserman,et al.  Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?" , 2002, ECCV.

[27]  James Ze Wang,et al.  Real-Time Computerized Annotation of Pictures , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Jianping Fan,et al.  Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers , 2006, MM '06.

[29]  Manuel Blum,et al.  Peekaboom: a game for locating objects in images , 2006, CHI.

[30]  Steven M. Drucker,et al.  Photo-triage: Rapidly annotating your digital photographs , 2003 .

[31]  John Adcock,et al.  Simplifying the Management of Large Photo Collections , 2003, INTERACT.

[32]  James Fogarty,et al.  Aesthetic information collages: generating decorative displays that contain information , 2001, UIST '01.

[33]  Andreas Girgensohn,et al.  Temporal event clustering for digital photo collections , 2005, ACM Trans. Multim. Comput. Commun. Appl..

[34]  Andreas E. Savakis,et al.  Automatic image event segmentation and quality screening for albuming applications , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[35]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[36]  Jeffrey Rubin,et al.  Handbook of Usability Testing: How to Plan, Design, and Conduct Effective Tests , 1994 .

[37]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[38]  Susan T. Dumais,et al.  Milestones in Time: The Value of Landmarks in Retrieving Information from Personal Stores , 2003, INTERACT.

[39]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[40]  Andrew Blake,et al.  Digital tapestry [automatic image synthesis] , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[41]  Wen Wu,et al.  SmartLabel: an object labeling tool using iterated harmonic energy minimization , 2006, MM '06.

[42]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[43]  J. C. Platt AutoAlbum: clustering digital photographs using probabilistic model merging , 2000, 2000 Proceedings Workshop on Content-based Access of Image and Video Libraries.

[44]  Mor Naaman,et al.  Generating summaries and visualization for large collections of geo-referenced photographs , 2006, MIR '06.

[45]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[46]  Irfan A. Essa,et al.  Mediating photo collage authoring , 2005, UIST.

[47]  Mary Czerwinski,et al.  Dance your work away: exploring step user interfaces , 2006, CHI Extended Abstracts.

[48]  Ben Shneiderman,et al.  Direct annotation: a drag-and-drop strategy for labeling photos , 2000, 2000 IEEE Conference on Information Visualization. An International Conference on Computer Visualization and Graphics.

[49]  Shingo Uchihashi,et al.  An interactive comic book presentation for exploring video , 2000, CHI.

[50]  Joachim M. Buhmann,et al.  Empirical evaluation of dissimilarity measures for color and texture , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[51]  Steven M. Drucker,et al.  MediaBrowser: reclaiming the shoebox , 2004, AVI.

[52]  Patrick Baudisch,et al.  Time quilt: scaling up zoomable photo browsers for large, unstructured photo collections , 2005, CHI EA '05.

[53]  Rohini K. Srihari,et al.  Show&Tell: A Semi-Automated Image Annotation System , 2000, IEEE Multim..