Intelligent Portrait Composition Assistance: Integrating Deep-learned Models and Photography Idea Retrieval

Retrieving photography ideas corresponding to a given location facilitates the usage of smart cameras, where there is a high interest among amateurs and enthusiasts to take astonishing photos at anytime and in any location. Existing research captures some aesthetic techniques and retrieves useful feedbacks based on one technique. However, they are restricted to a particular technique and the retrieved results have room to improve as they are confined to the quality of the query. There is a lack of a holistic framework to capture important aspects of a given scene and help a novice photographer by informative feedback to take a better shot in his/her photography adventure. This work proposes an intelligent framework of portrait composition using our deep-learned models and image retrieval methods. A highly-rated web-crawled portrait dataset is exploited for retrieval purposes. Our framework detects and extracts ingredients of a given scene representing as a correlated semantic model. It then matches extracted semantics with the dataset of aesthetically composed photos to investigate a ranked list of photography ideas, and gradually optimizes the human pose and other artistic aspects of the composed scene supposed to be captured. The conducted user study demonstrates that our approach is more helpful than other feedback retrieval systems.

[1]  William T. Freeman,et al.  The patch transform and its applications to image editing , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[3]  Wei-Ying Ma,et al.  Auto cropping for digital photographs , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[4]  Jan P. Allebach,et al.  Feature design for aesthetic inference on photos with faces , 2013, 2013 IEEE International Conference on Image Processing.

[5]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[6]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Yael Pritch,et al.  Shift-map image editing , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[8]  David Salesin,et al.  Gaze-based interaction for semi-automatic photo cropping , 2006, CHI.

[9]  Ligang Liu,et al.  Realtime Aesthetic Image Retargeting , 2010, CAe.

[10]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[11]  Shehroz S. Khan,et al.  Evaluating visual aesthetics in photographic portraiture , 2012, CAe '12.

[12]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[13]  Adam Finkelstein,et al.  PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, SIGGRAPH 2009.

[14]  Radomír Mech,et al.  Data‐Driven Automatic Cropping Using Semantic Composition Search , 2015, Comput. Graph. Forum.

[15]  Alice Caplier,et al.  Low Level Features for Quality Assessment of Facial Images , 2015, VISAPP.

[16]  Jiebo Luo,et al.  Aesthetics and Emotions in Images , 2011, IEEE Signal Processing Magazine.

[17]  Zihan Zhou,et al.  Detecting Vanishing Points in Natural Scenes with Application in Photo Composition Analysis , 2016, ArXiv.

[18]  Gabriela Csurka,et al.  Assessing the aesthetic quality of photographs using generic image descriptors , 2011, 2011 International Conference on Computer Vision.

[19]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[20]  Alice Caplier,et al.  How to predict the global instantaneous feeling induced by a facial picture? , 2015, Signal Process. Image Commun..

[21]  Fred Stentiford,et al.  Attention Based Auto Image Cropping , 2007, ICVS 2007.

[22]  Mislav Grgic,et al.  Aesthetic quality assessment of headshots , 2013, Proceedings ELMAR-2013.

[23]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  James Ze Wang,et al.  Studying Aesthetics in Photographic Images Using a Computational Approach , 2006, ECCV.

[25]  Naila Murray,et al.  AVA: A large-scale database for aesthetic visual analysis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Mu Qiao,et al.  OSCAR: On-Site Composition and Aesthetics Feedback Through Exemplars for Photographers , 2012, International Journal of Computer Vision.

[27]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Zihan Zhou,et al.  Detecting Dominant Vanishing Points in Natural Scenes with Application to Composition-Sensitive Image Retrieval , 2016, IEEE Transactions on Multimedia.

[29]  Bernt Schiele,et al.  2D Human Pose Estimation: New Benchmark and State of the Art Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Kok-Lim Low,et al.  Saliency-enhanced image aesthetics class prediction , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[31]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[32]  Yan Ke,et al.  The Design of High-Level Features for Photo Quality Assessment , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[33]  Miriam Redi,et al.  The beauty of capturing faces: Rating the quality of digital portraits , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[34]  In-So Kweon,et al.  Modeling photo composition and its application to photo re-arrangement , 2012, 2012 19th IEEE International Conference on Image Processing.

[35]  Yanwen Guo,et al.  Improving Photo Composition Elegantly: Considering Image Similarity During Composition Optimization , 2012, Comput. Graph. Forum.

[36]  Stephen Lin,et al.  Learning the Change for Automatic Image Cropping , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Bolei Zhou,et al.  Scene Parsing through ADE20K Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Xiaoou Tang,et al.  Photo and Video Quality Evaluation: Focusing on the Subject , 2008, ECCV.

[39]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[40]  Joani Mitro Content-based image retrieval tutorial , 2016, ArXiv.

[41]  Zihan Zhou,et al.  Discovering Triangles in Portraits for Supporting Photographic Creation , 2017, IEEE Transactions on Multimedia.

[42]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[43]  Mubarak Shah,et al.  A holistic approach to aesthetic enhancement of photographs , 2011, TOMCCAP.

[44]  Roberto Valenzuela Picture Perfect Posing: Practicing the Art of Posing for Photographers and Models , 2014 .

[45]  Benjamin B. Bederson,et al.  Automatic thumbnail cropping and its effectiveness , 2003, UIST '03.

[46]  Aditi Majumder,et al.  Seam carving based aesthetics enhancement for photos , 2015, Signal Process. Image Commun..

[47]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Ming-Syan Chen,et al.  R2P: Recomposition and Retargeting of Photographic Images , 2015, ACM Multimedia.

[49]  Ronald Azuma,et al.  Real-time Guidance Camera Interface to Enhance Photo Aesthetic Quality , 2015, CHI.

[50]  Chong-Wah Ngo,et al.  Deep-based Ingredient Recognition for Cooking Recipe Retrieval , 2016, ACM Multimedia.

[51]  Mubarak Shah,et al.  A framework for photo-quality assessment and enhancement based on visual aesthetics , 2010, ACM Multimedia.

[52]  Alice Caplier,et al.  Photo rating of facial pictures based on image segmentation , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).