论文信息 - Creative and high-quality image composition based on a new criterion

Creative and high-quality image composition based on a new criterion

Wavelet pyramids and features handling were used to achieve high-quality and multi-scale compositions.One new criterion was utilized to ensure the composite results were semantically valid. Image compositing techniques are primarily utilized to achieve realistic composite results. Some existing image compositing methods, such as gradient domain and alpha matting, are widely used in the field of computer vision, and can typically achieve realistic results, especially for seamless boundaries. However, when the candidate composite images and the target images have obvious differences, such as color, texture and brightness, the composite results are unrealistic and inconsistent. At the same time, traditional compositing methods focus on basic feature matching, ignoring semantic rationality in composition processing. Quite a few compositing methods thus generate composite results without semantic rationality.In this paper, a new multi-scale image composition method has been presented. In the composition process, wavelet pyramid and basic feature handling were used to achieve multi-scale compositions. More importantly, a new criterion was established, based on the semantic rationality of images, which could ensure that the composite images are semantically valid. A large database was created to facilitate experimentation. The experiments showed that the methodology introduced in this paper produced superior results compared to traditional composition methods; the composite results were not only consistent and seamless, but were also semantically valid.

[1] Edward H. Adelson,et al. Compressing and companding high dynamic range images with subband architectures , 2005, SIGGRAPH 2005.

[2] Patrick Pérez,et al. Poisson image editing , 2003, ACM Trans. Graph..

[3] Shree K. Nayar,et al. Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[4] Yuan Yan Tang,et al. Multiview Hessian discriminative sparse coding for image annotation , 2013, Comput. Vis. Image Underst..

[5] A.C. Kokaram,et al. N-dimensional probability density function transfer and its application to color transfer , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[6] Adrian Ulges,et al. Identifying relevant frames in weakly labeled videos for training concept detectors , 2008, CIVR '08.

[7] Antonio Torralba,et al. LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[8] Jia Chen,et al. Noise brush: interactive high quality image-noise separation , 2009, SIGGRAPH 2009.

[9] Hao Wu,et al. Image completion with multi-image based on entropy reduction , 2015, Neurocomputing.

[10] Ján Morovic,et al. Accurate 3D image colour histogram transformation , 2003, Pattern Recognit. Lett..

[11] Maneesh Agrawala,et al. Multiscale shape and detail enhancement from multi-light image collections , 2007, SIGGRAPH 2007.

[12] Jiaya Jia,et al. Poisson matting , 2004, SIGGRAPH 2004.

[13] Michael Ashikhmin,et al. Synthesizing natural textures , 2001, I3D '01.

[14] Tsuhan Chen,et al. Efficient Kernels for identifying unbounded-order spatial features , 2009, CVPR.

[15] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[16] David Salesin,et al. Interactive digital photomontage , 2004, SIGGRAPH 2004.

[17] Shmuel Peleg,et al. Seamless Image Stitching in the Gradient Domain , 2004, ECCV.

[18] Abhinav Gupta,et al. Beyond active noun tagging: Modeling contextual interactions for multi-class active learning , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19] William T. Freeman,et al. Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[20] László Neumann,et al. Color Style Transfer Techniques using Hue, Lightness and Saturation Histogram Matching , 2005, CAe.

[21] Tom Duff,et al. Compositing digital images , 1984, SIGGRAPH.

[22] Edward H. Adelson,et al. A multiresolution spline with application to image mosaics , 1983, TOGS.

[23] Alexei A. Efros,et al. Scene completion using millions of photographs , 2007, SIGGRAPH 2007.

[24] Zeev Farbman,et al. Edge-preserving decompositions for multi-scale tone and detail manipulation , 2008, SIGGRAPH 2008.

[25] Tao Mei,et al. Joint multi-label multi-instance learning for image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[26] Alexei A. Efros,et al. Using Color Compatibility for Assessing Image Realism , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[27] Frédo Durand,et al. Two-scale tone management for photographic look , 2006, SIGGRAPH 2006.

[28] Wojciech Matusik,et al. Multi-scale image harmonization , 2010, SIGGRAPH 2010.

[29] Michael Cohen,et al. Soft scissors: an interactive tool for realtime high quality matting , 2007, SIGGRAPH 2007.

[30] Bernt Schiele,et al. International Journal of Computer Vision manuscript No. (will be inserted by the editor) Semantic Modeling of Natural Scenes for Content-Based Image Retrieval , 2022 .

[31] Yongdong Zhang,et al. Efficient Parallel Framework for HEVC Motion Estimation on Many-Core Processors , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[32] Erik Reinhard,et al. Color Transfer between Images , 2001, IEEE Computer Graphics and Applications.

[33] Thomas S. Huang,et al. Image Classification Using Super-Vector Coding of Local Image Descriptors , 2010, ECCV.

[34] Subhransu Maji,et al. Max-margin additive classifiers for detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[35] Dani Lischinski,et al. Coordinates for instant image cloning , 2009, SIGGRAPH 2009.

[36] Milan Sonka,et al. Image Processing, Analysis and Machine Vision , 1993, Springer US.

[37] Edward H. Adelson,et al. The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[38] Bruno A. Olshausen,et al. Learning Sparse Image Codes using a Wavelet Pyramid Architecture , 2000, NIPS.