A Framework for content-adaptive photo manipulation macros: Application to face, landscape, and global manipulations

We present a framework for generating content-adaptive macros that can transfer complex photo manipulations to new target images. We demonstrate applications of our framework to face, landscape, and global manipulations. To create a content-adaptive macro, we make use of multiple training demonstrations. Specifically, we use automated image labeling and machine learning techniques to learn the dependencies between image features and the parameters of each selection, brush stroke, and image processing operation in the macro. Although our approach is limited to learning manipulations where there is a direct dependency between image features and operation parameters, we show that our framework is able to learn a large class of the most commonly used manipulations using as few as 20 training demonstrations. Our framework also provides interactive controls to help macro authors and users generate training demonstrations and correct errors due to incorrect labeling or poor parameter estimation. We ask viewers to compare images generated using our content-adaptive macros with and without corrections to manually generated ground-truth images and find that they consistently rate both our automatic and corrected results as close in appearance to the ground truth. We also evaluate the utility of our proposed macro generation workflow via a small informal lab study with professional photographers. The study suggests that our workflow is effective and practical in the context of real-world photo editing.

[1]  Dani Lischinski,et al.  Personalization of image enhancement , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  William T. Freeman,et al.  Search-and-replace editing for personal photo collections , 2010, 2010 IEEE International Conference on Computational Photography (ICCP).

[3]  Takeo Igarashi,et al.  Generating photo manipulation tutorials by demonstration , 2009, ACM Trans. Graph..

[4]  Terence Sim,et al.  Digital face makeup by example , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Shree K. Nayar,et al.  Face swapping: automatically replacing faces in photographs , 2008, ACM Trans. Graph..

[6]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Alexei A. Efros,et al.  Image‐based Shaving , 2008, Comput. Graph. Forum.

[8]  Eser Kandogan,et al.  Koala: capture, share, automate, personalize business processes on the web , 2007, CHI.

[9]  Frédo Durand,et al.  Two-scale tone management for photographic look , 2006, ACM Trans. Graph..

[10]  Rob Miller,et al.  Automation and customization of rendered web pages , 2005, UIST.

[11]  Alexei A. Efros,et al.  Geometric context from a single image , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[12]  Barry Huggins,et al.  Photoshop Retouching Cookbook for Digital Photographers (Cookbooks (O'Reilly)) , 2005 .

[13]  Diemo Schwarz,et al.  Current Research in concatenative sound synthesis , 2005, ICMC.

[14]  Tessa A. Lau,et al.  Sheepdog: learning procedures for technical support , 2004, IUI '04.

[15]  Scott Kelby,et al.  The Adobe Photoshop CS4 Book for Digital Photographers , 2003 .

[16]  Daniel Cohen-Or,et al.  Example-based style synthesis , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[17]  Yi Zhou,et al.  Bayesian tangent shape model: estimating shape and pose parameters via Bayesian inference , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[18]  Nuria Oliver,et al.  Curve Analogies , 2002, Rendering Techniques.

[19]  Adam Finkelstein,et al.  WYSIWYG NPR: drawing strokes directly on 3D models , 2002, SIGGRAPH.

[20]  Erik Reinhard,et al.  Color Transfer between Images , 2001, IEEE Computer Graphics and Applications.

[21]  David Salesin,et al.  Image Analogies , 2001, SIGGRAPH.

[22]  Alexei A. Efros,et al.  Image quilting for texture synthesis and transfer , 2001, SIGGRAPH.

[23]  Zicheng Liu,et al.  Expressive expression mapping with ratio images , 2001, SIGGRAPH.

[24]  James M. Rehg,et al.  Statistical Color Models with Application to Skin Detection , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[25]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[26]  Amir A. Amini,et al.  Snakes and Splines for Tracking Non-Rigid Heart Motion , 1996, ECCV.

[27]  Brad A. Myers,et al.  Pursuit: graphically representing programs in a demonstrational visual shell , 1994, CHI Conference Companion.

[28]  Henry Lieberman,et al.  Watch what I do: programming by demonstration , 1993 .

[29]  Henry Lieberman,et al.  Mondrian: a teachable graphical editor , 1993, INTERCHI.

[30]  Steven K. Feiner,et al.  A history-based macro by example system , 1992, UIST '92.

[31]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[32]  John Hart,et al.  ACM Transactions on Graphics , 2004, SIGGRAPH 2004.

[33]  Gregory Dudek,et al.  Curve Synthesis from Learned Refinement Models , 2003, Eurographics.

[34]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.