Video enhancement : content classification and model selection

The purpose of video enhancement is to improve the subjective picture quality. The field of video enhancement includes a broad category of research topics, such as removing noise in the video, highlighting some specified features and improving the appearance or visibility of the video content. The common difficulty in this field is how to make images or videos more beautiful, or subjectively better. Traditional approaches involve lots of iterations between subjective assessment experiments and redesigns of algorithm improvements, which are very time consuming. Researchers have attempted to design a video quality metric to replace the subjective assessment, but so far it is not successful. As a way to avoid heuristics in the enhancement algorithm design, least mean square methods have received considerable attention. They can optimize filter coefficients automatically by minimizing the difference between processed videos and desired versions through a training. However, these methods are only optimal on average but not locally. To solve the problem, one can apply the least mean square optimization for individual categories that are classified by local image content. The most interesting example is Kondo’s concept of local content adaptivity for image interpolation, which we found could be generalized into an ideal framework for content adaptive video processing. We identify two parts in the concept, content classification and adaptive processing. By exploring new classifiers for the content classification and new models for the adaptive processing, we have generalized a framework for more enhancement applications. For the part of content classification, new classifiers have been proposed to classify different image degradations such as coding artifacts and focal blur. For the coding artifact, a novel classifier has been proposed based on the combination of local structure and contrast, which does not require coding block grid detection. For the focal blur, we have proposed a novel local blur estimation method based on edges, which does not require edge orientation detection and shows more robust blur estimation. With these classifiers, the proposed framework has been extended to coding artifact robust enhancement and blur dependant enhancement. With the content adaptivity to more image features, the number of content classes can increase significantly. We show that it is possible to reduce the number of classes without sacrificing much performance. For the part of model selection, we have introduced several nonlinear filters to the proposed framework. We have also proposed a new type of nonlinear filter, trained bilateral filter, which combines both advantages of the original bilateral filter and the least mean square optimization. With these nonlinear filters, the proposed framework show better performance than with linear filters. Furthermore, we have shown a proof-of-concept for a trained approach to obtain contrast enhancement by a supervised learning. The transfer curves are optimized based on the classification of global or local image content. It showed that it is possible to obtain the desired effect by learning from other computationally expensive enhancement algorithms or expert-tuned examples through the trained approach. Looking back, the thesis reveals a single versatile framework for video enhancement applications. It widens the application scope by including new content classifiers and new processing models and offers scalabilities with solutions to reduce the number of classes, which can greatly accelerate the algorithm design.

[1]  D. Youla,et al.  Image Restoration by the Method of Convex Projections: Part 1ߞTheory , 1982, IEEE Transactions on Medical Imaging.

[2]  M. Turk,et al.  A simple, real-time range camera , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Aria Nosratinia Denoising JPEG images by re-application of JPEG , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[4]  Jacques Froment,et al.  Deblocking DCT-based compressed images with weighted total variation , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Wilfried Philips,et al.  Image Upscaling Using Global Multimodal Priors , 2007, ACIVS.

[6]  M. Zhao,et al.  Content adaptive image de-blocking , 2004, IEEE International Symposium on Consumer Electronics, 2004.

[7]  Richard A. Perez Electronic display devices , 1988 .

[8]  Aziz Umit Batur,et al.  A DSP-based approach for the implementation of face recognition algorithms , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  Frédo Durand,et al.  Two-scale tone management for photographic look , 2006, SIGGRAPH 2006.

[10]  Gerard de Haan,et al.  Trained Bilateral Filters and Applications to Coding Artifacts Reduction , 2007, 2007 IEEE International Conference on Image Processing.

[11]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[12]  Qiang Ji,et al.  Real-Time Eye, Gaze, and Face Pose Tracking for Monitoring Driver Vigilance , 2002, Real Time Imaging.

[13]  R. Keys Cubic convolution interpolation for digital image processing , 1981 .

[14]  Kenneth E. Barner,et al.  Fuzzy transformation and its applications , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[15]  Thomas S. Huang,et al.  A generalization of median filtering using linear combinations of order statistics , 1983 .

[16]  Kenneth E. Barner,et al.  Permutation filters: a class of nonlinear filters based on set permutations , 1994, IEEE Trans. Signal Process..

[17]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  John D. Austin,et al.  Adaptive histogram equalization and its variations , 1987 .

[19]  G. Lakhani,et al.  Derivation of prediction equations for blocking effect reduction , 1999, IEEE Trans. Circuits Syst. Video Technol..

[20]  Lindsay W. MacDonald,et al.  Display Systems: Design and Applications , 1999 .

[21]  Karel J. Zuiderveld,et al.  Contrast Limited Adaptive Histogram Equalization , 1994, Graphics Gems.

[22]  Scott Firestone,et al.  Voice and Video Conferencing Fundamentals , 2007 .

[23]  H. R. Wu A new distortion measure for video coding blocking artifacts , 1996, Proceedings of International Conference on Communication Technology. ICCT '96.

[24]  Wen Song,et al.  Motion-blur characterization on liquid-crystal displays , 2008 .

[25]  Reginald L. Lagendijk,et al.  Identification and restoration of noisy blurred images using the expectation-maximization algorithm , 1990, IEEE Trans. Acoust. Speech Signal Process..

[26]  Gerard de Haan,et al.  Coding Artifacts Robust Resolution Up-conversion , 2007, 2007 IEEE International Conference on Image Processing.

[27]  Steven W. Zucker,et al.  Local Scale Control for Edge Detection and Blur Estimation , 1996, ECCV.

[28]  Gerard de Haan,et al.  Adaptive Image Restoration Based on Local Robust Blur Estimation , 2007, ACIVS.

[29]  Anil K. Jain Fundamentals of Digital Image Processing , 2018, Control of Color Imaging Systems.

[30]  G. de Haan,et al.  Simultaneous Coding Artifact Reduction and Sharpness Enhancement , 2007, 2007 Digest of Technical Papers International Conference on Consumer Electronics.

[31]  Yong Hoon Lee,et al.  Generalized median filtering and related nonlinear filtering techniques , 1985, IEEE Trans. Acoust. Speech Signal Process..

[32]  Michael Elad,et al.  Advances and challenges in super‐resolution , 2004, Int. J. Imaging Syst. Technol..

[33]  Shin-Tson Wu,et al.  Introduction to Flat Panel Displays , 2009 .

[34]  Saleem A. Kassam,et al.  Design and performance of combination filters for signal restoration , 1991, IEEE Trans. Signal Process..

[35]  Wouter Caarls,et al.  Architecture Study for Smart Cameras , 2005 .

[36]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[37]  Aggelos K. Katsaggelos,et al.  Iterative Image Restoration Algorithms , 1989 .

[38]  O. Prache,et al.  Active matrix molecular OLED microdisplays , 2001 .

[39]  Paul M. Hofman,et al.  Content-adaptive neural filters for image interpolation using pixel classification , 2005, IS&T/SPIE Electronic Imaging.

[40]  Avideh Zakhor,et al.  Iterative procedures for reduction of blocking effects in transform image coding , 1991, Electronic Imaging.

[41]  C. Rubinstein,et al.  On the Design of Quantizers for DPCM Coders: A Functional Relationship Between Visibility, Probability and Masking , 1978, IEEE Trans. Commun..

[42]  N. Kehtarnavaz,et al.  Development and Real-Time Implementation of Auto White Balancing Scoring Algorithm , 2002, Real Time Imaging.

[43]  Tetsuya Shigeta,et al.  19.4: Improvement of Moving‐Video Image Quality on PDPs by Reducing the Dynamic False Contour , 1998 .

[44]  Laurence S. Dooley,et al.  Video coding for mobile communications , 2008 .

[45]  Gregory P. Crawford,et al.  Flexible flat panel displays , 2005 .

[46]  M. Zhao,et al.  Content adaptive vertical temporal filtering for de-interlacing , 2005, Proceedings of the Ninth International Symposium on Consumer Electronics, 2005. (ISCE 2005)..

[47]  A. Murat Tekalp,et al.  Maximum likelihood image and blur identification: a unifying , 1990 .

[48]  Mitra Basu,et al.  Gaussian-based edge-detection methods - a survey , 2002, IEEE Trans. Syst. Man Cybern. Part C.

[49]  Michael Yuen,et al.  A survey of hybrid MC/DPCM/DCT video coding distortions , 1998, Signal Process..

[50]  Kenneth E. Barner,et al.  Extended permutation filters and their application to edge enhancement , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[51]  Sang Uk Lee,et al.  On the POCS-based postprocessing technique to reduce the blocking artifacts in transform coded images , 1998, IEEE Trans. Circuits Syst. Video Technol..

[52]  Don H. Johnson,et al.  Statistical Signal Processing , 2009, Encyclopedia of Biometrics.

[53]  Leonardo W. Estevez Real-time video surveillance system architecture , 2001, IS&T/SPIE Electronic Imaging.

[54]  Jaakko Astola,et al.  A new class of nonlinear filters-neural filters , 1993, IEEE Trans. Signal Process..

[55]  Moncef Gabbouj,et al.  Weighted median filters: a tutorial , 1996 .

[56]  Raúl Rojas,et al.  Neural Networks - A Systematic Introduction , 1996 .

[57]  Gerard de Haan,et al.  Local estimation of video compression artifacts , 2011, 2011 IEEE International Conference on Consumer Electronics (ICCE).

[58]  H. Hu,et al.  Classification-based hybrid filters for image processing , 2006, Electronic Imaging.

[59]  R. Fletcher Practical Methods of Optimization , 1988 .

[60]  Avideh Zakhor,et al.  An optimization approach for removing blocking effects in transform coding , 1995, IEEE Trans. Circuits Syst. Video Technol..

[61]  Aggelos K. Katsaggelos,et al.  A regularized iterative image restoration algorithm , 1991, IEEE Trans. Signal Process..

[62]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[63]  Marta Mrak,et al.  Perceptually adaptive joint deringing - deblocking filtering for scalable video coding , 2006, MobiMedia '06.

[64]  Jan P. Allebach,et al.  Optimal image scaling using pixel classification , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[65]  William C. O'Mara,et al.  Liquid crystal flat panel display : manufacturing science & technology , 1993 .

[66]  Keith E. Muller,et al.  Contrast-limited adaptive histogram equalization: speed and effectiveness , 1990, [1990] Proceedings of the First Conference on Visualization in Biomedical Computing.

[67]  Alex Pentland,et al.  A New Sense for Depth of Field , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[68]  Yeong Ho Ha,et al.  Adaptive scan rate up-conversion system based on human visual characteristics , 2000, IEEE Trans. Consumer Electron..

[69]  Kenneth E. Barner,et al.  Optimized fuzzy transformation for image deblocking , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[70]  Nasser Kehtarnavaz,et al.  Real-time implementation of autofocus on the TI DSC processor , 2004, IS&T/SPIE Electronic Imaging.

[71]  Jitendra Malik,et al.  Recovering high dynamic range radiance maps from photographs , 1997, SIGGRAPH.

[72]  G. de Haan,et al.  De-interlacing of video data using motion vectors and edge information , 2002, 2002 Digest of Technical Papers. International Conference on Consumer Electronics (IEEE Cat. No.02CH37300).

[73]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[74]  Gerard de Haan,et al.  True-motion estimation with 3-D recursive search block matching , 1993, IEEE Trans. Circuits Syst. Video Technol..

[75]  Carl Staelin,et al.  Neural Network Image Scaling Using Spatial Errors , 2003 .

[76]  K E Barner,et al.  Hybrid order statistic filter and its application to image restoration. , 2001, Applied optics.

[77]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[78]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[79]  A. Murat Tekalp,et al.  Digital Video Processing , 1995 .

[80]  Gerard de Haan,et al.  Low Cost Robust Blur Estimator , 2006, 2006 International Conference on Image Processing.

[81]  William T. Freeman,et al.  Example-Based Super-Resolution , 2002, IEEE Computer Graphics and Applications.

[82]  Francesco Palmieri,et al.  Ll-filters-a new class of order statistic filters , 1989, IEEE Trans. Acoust. Speech Signal Process..

[83]  de G Gerard Haan,et al.  Making the best of legacy video on modern displays , 2007 .

[84]  Heiju Uchiike Color Plasma Displays , 1997 .

[85]  T. Hirakawa,et al.  Historical view and current status of plasma displays , 2003, 38th IAS Annual Meeting on Conference Record of the Industry Applications Conference, 2003..

[86]  Ethan D. Montag,et al.  Louis Leon Thurstone in Monte Carlo: creating error bars for the method of paired comparison , 2003, IS&T/SPIE Electronic Imaging.

[87]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[88]  M. Zhao,et al.  Intra-field de-interlacing with advanced up-scaling methods , 2004, IEEE International Symposium on Consumer Electronics, 2004.

[89]  Moon Gi Kang,et al.  Super-resolution image reconstruction: a technical overview , 2003, IEEE Signal Process. Mag..

[90]  Nathalie Plaziac Image interpolation using neural networks , 1999, IEEE Trans. Image Process..

[91]  Christos Grecos,et al.  Fast skin color detector for face extraction , 2005, IS&T/SPIE Electronic Imaging.

[92]  Robert A. Hummel,et al.  Image Enhancement by Histogram transformation , 1975 .

[93]  Shoji Kawamura Capturing images with digital still cameras , 1998, IEEE Micro.

[94]  G. de Haan,et al.  Towards an Overview of Spatial Up-conversion Techniques. , 2002 .

[95]  A. M. Tekalp,et al.  Maximum likelihood parametric blur identification based on a continuous spatial domain model , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[96]  Andrew R. Barron,et al.  Universal approximation bounds for superpositions of a sigmoidal function , 1993, IEEE Trans. Inf. Theory.

[97]  Nikolas P. Galatsanos,et al.  Regularized reconstruction to reduce blocking artifacts of block discrete cosine transform compressed images , 1993, IEEE Trans. Circuits Syst. Video Technol..

[98]  Wayne Wolf,et al.  Design of a Real-Time Gesture Recognition System , 2005 .

[99]  Hao Hu,et al.  Class-count reduction techniques for content adaptive filtering , 2009, 2009 IEEE 13th International Symposium on Consumer Electronics.

[100]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[101]  Jae Lim,et al.  Reduction Of Blocking Effects In Image Coding , 1984 .

[102]  Xavier Desurmont,et al.  Real-time optimizations for integrated smart network camera , 2005, IS&T/SPIE Electronic Imaging.

[103]  de G Gerard Haan,et al.  Video processing for multimedia systems , 2000 .

[104]  H. Kawamoto,et al.  The history of liquid-crystal displays , 2002, Proc. IEEE.

[105]  Gregory J. Wolfe,et al.  Adaptive Histogram Equalization And Its Applications , 1983, Optics & Photonics.

[106]  Kwanghoon Sohn,et al.  Interpolation using neural networks for digital still cameras , 2000, IEEE Trans. Consumer Electron..

[107]  Peter A. Keller The cathode-ray tube , 1991 .

[108]  P. Haavisto,et al.  Post-Filtering Methods for Reducing Blocking Effects from Coded Images , 1994, IEEE International Conference on Consumer Electronics.

[109]  Joonki Paik,et al.  Simultaneous out-of-focus blur estimation and restoration for digital auto-focusing system , 1998 .

[110]  Jong-Sen Lee,et al.  Digital image smoothing and the sigma filter , 1983, Comput. Vis. Graph. Image Process..

[111]  John W. Tukey,et al.  Nonlinear (nonsuperposable) methods for smoothing data , 1974 .