Directional multiresolution image representations

Efficient representation of visual information lies at the foundation of many image processing tasks, including compression, filtering, and feature extraction. Efficiency of a representation refers to the ability to capture significant information of an object of interest in a small description. For practical applications, this representation has to be realized by structured transforms and fast algorithms. Recently, it has become evident that commonly used separable transforms (such as wavelets) are not necessarily best suited for images. Thus, there is a strong motivation to search for more powerful schemes that can capture the intrinsic geometrical structure of pictorial information. This thesis focuses on the development of new "true" two-dimensional representations for images. The emphasis is on the discrete framework that can lead to algorithmic implementations. The first method constructs multiresolution, local and directional image expansions by using non-separable filter banks. This discrete transform is developed in connection with the continuous-space curvelet construction in harmonic analysis. As a result, the proposed transform provides an efficient representation for two-dimensional piecewise smooth signals that resemble images. The link between the developed filter banks and the continuous-space constructions is set up in a newly defined directional multiresolution analysis. The second method constructs a new family of block directional and orthonormal transforms based on the ridgelet idea, and thus offers an efficient representation for images that are smooth away from straight edges. Finally, directional multiresolution image representations are employed together with statistical modeling, leading to powerful texture models and successful image retrieval systems.

[1]  R. Duffin,et al.  A class of nonharmonic Fourier series , 1952 .

[2]  C. A. Rogers,et al.  An Introduction to the Geometry of Numbers , 1959 .

[3]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[4]  Robert S. Shankland,et al.  Handbook of Mathematical Tables , 1963 .

[5]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[6]  Vilius Ivanauskas Conferences , 1979 .

[7]  J. Daugman Two-dimensional spectral analysis of cortical receptive field profiles , 1980, Vision Research.

[8]  Gabor T. Herman,et al.  Image reconstruction from projections : the fundamentals of computerized tomography , 1980 .

[9]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[10]  Azriel Rosenfeld,et al.  Multiresolution image processing and analysis , 1984 .

[11]  M. Vetterli Multi-dimensional sub-band coding: Some theory and algorithms , 1984 .

[12]  E. Dubois,et al.  The sampling and reconstruction of time-varying imagery with application in video systems , 1985, Proceedings of the IEEE.

[13]  E. Dubois,et al.  Digital picture processing , 1985, Proceedings of the IEEE.

[14]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[15]  L. R. Rabiner,et al.  A probabilistic distance measure for hidden Markov models , 1985, AT&T Technical Journal.

[16]  M. Unser Local linear transforms for texture measurements , 1986 .

[17]  Gregory Beylkin,et al.  Discrete radon transform , 1987, IEEE Trans. Acoust. Speech Signal Process..

[18]  Andrew B. Watson,et al.  The cortex transform: rapid computation of simulated neural images , 1987 .

[19]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[20]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[21]  Izidor Gertner A new efficient algorithm to compute the two-dimensional discrete Fourier transform , 1988, IEEE Trans. Acoust. Speech Signal Process..

[22]  Jorge L. C. Sanz,et al.  Radon and projection transform-based computer vision , 1988 .

[23]  H. Saunders,et al.  Probability, Random Variables and Stochastic Processes (2nd Edition) , 1989 .

[24]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[25]  C. W. Therrien,et al.  Decision, Estimation and Classification: An Introduction to Pattern Recognition and Related Topics , 1989 .

[26]  M. Varanasi,et al.  Parametric generalized Gaussian density estimation , 1989 .

[27]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  S. Mallat Multiresolution approximations and wavelet orthonormal bases of L^2(R) , 1989 .

[29]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[30]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[31]  N. D. Vvedenskaya and S. G. Gindikin Discrete Radon transform and image reconstruction , 1990 .

[32]  Ronald R. Coifman,et al.  Wavelet analysis and signal processing , 1990 .

[33]  Gunnar Karlsson,et al.  Theory of two-dimensional multirate filter banks , 1990, IEEE Trans. Acoust. Speech Signal Process..

[34]  Y. Meyer,et al.  Wavelets and Filter Banks , 1991 .

[35]  Jan P. Allebach,et al.  The analysis and design of multidimensional FIR perfect reconstruction filter banks for arbitrary sampling lattices , 1991 .

[36]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[37]  Roberto Hugo Bamberger,et al.  The directional filter bank: a multirate filter bank for the directional decomposition of images , 1991 .

[38]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[39]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[40]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[41]  Jian Fan,et al.  Texture Classification by Wavelet Packet Signatures , 1993, MVA.

[42]  Martin Vetterli,et al.  Wavelets and filter banks: theory and design , 1992, IEEE Trans. Signal Process..

[43]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[44]  Mark J. T. Smith,et al.  A filter bank for the directional decomposition of images: theory and design , 1992, IEEE Trans. Signal Process..

[45]  Stéphane Mallat,et al.  Singularity detection and processing with wavelets , 1992, IEEE Trans. Inf. Theory.

[46]  Edward H. Adelson,et al.  Shiftable multiscale transforms , 1992, IEEE Trans. Inf. Theory.

[47]  Jelena Kovacevic,et al.  Nonseparable multidimensional perfect reconstruction filter banks and wavelet bases for Rn , 1992, IEEE Trans. Inf. Theory.

[48]  Ronald A. DeVore,et al.  Image compression through wavelet transform coding , 1992, IEEE Trans. Inf. Theory.

[49]  Michael Unser,et al.  An improved least squares Laplacian pyramid for image compression , 1992, Signal Process..

[50]  I. Daubechies,et al.  Biorthogonal bases of compactly supported wavelets , 1992 .

[51]  Jelena Kovacevic,et al.  Filter banks and wavelets: Extensions and applications , 1992, Signal Processing.

[52]  Vijay K. Madisetti,et al.  The fast discrete Radon transform. I. Theory , 1993, IEEE Trans. Image Process..

[53]  Jean-Pierre Antoine,et al.  Image analysis with two-dimensional continuous wavelet transform , 1993, Signal Process..

[54]  Jan Flusser,et al.  Image Representation Via a Finite Radon Transform , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[55]  Martin Vetterli,et al.  Wavelets and recursive filter banks , 1993, IEEE Trans. Signal Process..

[56]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[57]  R. H. Bamberger,et al.  New results on two and three dimensional directional filter banks , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[58]  K Ramchandran,et al.  Best wavelet packet bases in a rate-distortion sense , 1993, IEEE Trans. Image Process..

[59]  S. Kay Fundamentals of statistical signal processing: estimation theory , 1993 .

[60]  Y. Meyer Wavelets and Operators , 1993 .

[61]  P. P. Vaidyanathan,et al.  Recent developments in multidimensional multirate systems , 1993, IEEE Trans. Circuits Syst. Video Technol..

[62]  Antonio Ortega,et al.  Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders , 1994, IEEE Trans. Image Process..

[63]  D. L. Donoho,et al.  Ideal spacial adaptation via wavelet shrinkage , 1994 .

[64]  Harald Niederreiter,et al.  Introduction to finite fields and their applications: Theoretical Applications of Finite Fields , 1994 .

[65]  A. Kundu,et al.  Rotation and Gray Scale Transform Invariant Texture Identification using Wavelet Decomposition and Hidden Markov Model , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[66]  Michael Unser,et al.  A general sampling theory for nonideal acquisition devices , 1994, IEEE Trans. Signal Process..

[67]  Shih-Fu Chang,et al.  Transform features for texture classification and discrimination in large image databases , 1994, Proceedings of 1st International Conference on Image Processing.

[68]  Pietro Perona,et al.  Rotation invariant texture recognition using a steerable pyramid , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[69]  James R. Bergen,et al.  Pyramid-based texture analysis/synthesis , 1995, Proceedings., International Conference on Image Processing.

[70]  Zuowei Shen Affine systems in L 2 ( IR d ) : the analysis of the analysis operator , 1995 .

[71]  Michael Unser,et al.  Texture classification and segmentation using wavelet frames , 1995, IEEE Trans. Image Process..

[72]  Jelena Kovacevic,et al.  Wavelets and Subband Coding , 2013, Prentice Hall Signal Processing Series.

[73]  J. R. Rohlicek,et al.  Parameter estimation of dependence tree models using the EM algorithm , 1995, IEEE Signal Processing Letters.

[74]  David Salesin,et al.  Fast multiresolution image querying , 1995, SIGGRAPH.

[75]  William T. Freeman,et al.  Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[76]  P. P. Vaidyanathan,et al.  A new class of two-channel biorthogonal filter banks and wavelet bases , 1995, IEEE Trans. Signal Process..

[77]  Alberto Leon-Garcia,et al.  Estimation of shape parameter for generalized Gaussian distributions in subband decompositions of video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[78]  M. Vetterli,et al.  Nonseparable two- and three-dimensional wavelets , 1995, IEEE Trans. Signal Process..

[79]  A. Aldroubi Portraits of frames , 1995 .

[80]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[81]  A. Ravishankar Rao,et al.  Towards a texture naming system: Identifying relevant dimensions of texture , 1993, Vision Research.

[82]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[83]  Wen-Rong Wu,et al.  Rotation and gray-scale transform-invariant texture classification using spiral resampling, subband decomposition, and hidden Markov model , 1996, IEEE Trans. Image Process..

[84]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[85]  Kannan Ramchandran,et al.  Multimedia Analysis and Retrieval System (MARS) Project , 1996, Data Processing Clinic.

[86]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[87]  Fang Liu,et al.  Periodicity, Directionality, and Randomness: Wold Features for Image Modeling and Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[88]  Song-Chun Zhu,et al.  FRAME: filters, random fields, and minimax entropy towards a unified theory for texture modeling , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[89]  Eero P. Simoncelli,et al.  A filter design technique for steerable pyramid image transforms , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[90]  C. Brislawn Classification of Nonexpansive Symmetric Extension Transforms for Multirate Filter Banks , 1996 .

[91]  Ronald R. Coifman,et al.  Brushlets: A Tool for Directional Image Analysis and Image Compression , 1997 .

[92]  A. Ron,et al.  Affine Systems inL2(Rd): The Analysis of the Analysis Operator , 1997 .

[93]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[94]  Martin Vetterli,et al.  Gröbner Bases and Multidimensional FIR Multirate Systems , 1997, Multidimens. Syst. Signal Process..

[95]  Paul A. Viola,et al.  Texture recognition using a non-parametric multi-scale statistical model , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[96]  Yoram Singer,et al.  Batch and On-Line Parameter Estimation of Gaussian Mixtures Based on the Joint Entropy , 1998, NIPS.

[97]  Eero P. Simoncelli,et al.  Texture characterization via joint statistics of wavelet coefficient magnitudes , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[98]  Martin Vetterli,et al.  Data Compression and Harmonic Analysis , 1998, IEEE Trans. Inf. Theory.

[99]  J. Benedetto,et al.  The Theory of Multiresolution Analysis Frames and Applications to Filter Banks , 1998 .

[100]  Wen-Rong Wu,et al.  Correction To "rotation And Gray-scale Transform-invariant Texture Classification Using Spiral Resampling, Subband Decomposition, And Hidden Markov Model" , 1996, IEEE Trans. Image Process..

[101]  Helmut Bölcskei,et al.  Frame-theoretic analysis of oversampled filter banks , 1998, IEEE Trans. Signal Process..

[102]  Vivek K. Goyal,et al.  Quantized Overcomplete Expansions in IRN: Analysis, Synthesis, and Algorithms , 1998, IEEE Trans. Inf. Theory.

[103]  Martin Vetterli,et al.  New methods for image retrieval , 1998 .

[104]  Thierry Pun,et al.  Assessing agreement between human and machine clusterings of image databases , 1998, Pattern Recognit..

[105]  Martin Vetterli,et al.  Oversampled filter banks , 1998, IEEE Trans. Signal Process..

[106]  R. DeVore,et al.  Nonlinear approximation , 1998, Acta Numerica.

[107]  S. Mallat A wavelet tour of signal processing , 1998 .

[108]  Ralph P. Grimaldi,et al.  Discrete and Combinatorial Mathematics: An Applied Introduction , 1998 .

[109]  Sang-Il Park,et al.  New directional filter banks and their applications in image processing , 1999 .

[110]  Richard G. Baraniuk,et al.  Image segmentation using wavelet-domain classification , 1999, Optics & Photonics.

[111]  Minh N. Do,et al.  Invariant Image Retrieval Using Wavelet Maxima Moment , 1999, VISUAL.

[112]  Pierre Moulin,et al.  Analysis of Multiresolution Image Denoising Schemes Using Generalized Gaussian and Complexity Priors , 1999, IEEE Trans. Inf. Theory.

[113]  Trygve Randen,et al.  Filtering for Texture Classification: A Comparative Study , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[114]  E. Candès,et al.  Ridgelets: a key to higher-dimensional intermittency? , 1999, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[115]  Mark J. T. Smith,et al.  A new directional filter bank for image analysis and classification , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[116]  D. Donoho Wedgelets: nearly minimax estimation of edges , 1999 .

[117]  Paul Scheunders,et al.  Statistical texture characterization from discrete wavelet representations , 1999, IEEE Trans. Image Process..

[118]  A. Cohen,et al.  Regularity of Multivariate Refinable Functions , 1999 .

[119]  B. S. Manjunath,et al.  Rotation-invariant texture classification using a complete space-frequency model , 1999, IEEE Trans. Image Process..

[120]  Pablo M. Salzberg,et al.  Tomography on the 3D-Torus and Crystals , 1999 .

[121]  E. Candès,et al.  Curvelets: A Surprisingly Effective Nonadaptive Representation for Objects with Edges , 2000 .

[122]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[123]  Minh N. Do,et al.  Integrated Browsing and Searching of Large Image Collections , 2000, VISUAL.

[124]  Minh N. Do,et al.  Texture similarity measurement using Kullback-Leibler distance on wavelet subbands , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[125]  Nuno Vasconcelos,et al.  A unifying view of image similarity , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[126]  Martin Vetterli,et al.  Rotation-invariant texture retrieval using steerable wavelet-domain hidden Markov models , 2000, SPIE Optics + Photonics.

[127]  Minh N. Do,et al.  Orthonormal finite ridgelet transform for image compression , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[128]  David L. Donoho,et al.  Curvelets, multiresolution representation, and scaling laws , 2000, SPIE Optics + Photonics.

[129]  Minh N. Do,et al.  Image denoising using orthonormal finite ridgelet transform , 2000, SPIE Optics + Photonics.

[130]  Jianying Hu,et al.  Matching and retrieval based on the vocabulary and grammar of color patterns , 2000, IEEE Trans. Image Process..

[131]  David L. Donoho,et al.  Digital curvelet transform: strategy, implementation, and experiments , 2000, SPIE Defense + Commercial Sensing.

[132]  Martin Vetterli,et al.  Footprints and edgeprints for image denoising and compression , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[133]  Minh N. Do,et al.  Frame reconstruction of the Laplacian pyramid , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[134]  C. Chui,et al.  Compactly supported tight and sibling frames with maximum vanishing moments , 2001 .

[135]  Minh N. Do,et al.  On the compression of two-dimensional piecewise smooth functions , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[136]  Vivek K Goyal,et al.  Quantized Frame Expansions with Erasures , 2001 .

[137]  A. Petukhov Explicit Construction of Framelets , 2001 .

[138]  Helmut Bölcskei,et al.  Noise reduction in oversampled filter banks using predictive quantization , 2001, IEEE Trans. Inf. Theory.

[139]  Minh N. Do,et al.  Best Adaptive Tiling in a Rate Distortion Sense , 2001 .

[140]  N. Kingsbury Complex Wavelets for Shift Invariant Analysis and Filtering of Signals , 2001 .

[141]  Minh N. Do,et al.  Pyramidal directional filter banks and curvelets , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[142]  Justin K. Romberg,et al.  Bayesian tree-structured image modeling using wavelet-domain hidden Markov models , 2001, IEEE Trans. Image Process..

[143]  I. Selesnick The Double Density DWT , 2001 .

[144]  Martin Vetterli,et al.  Wavelets, approximation, and compression , 2001, IEEE Signal Process. Mag..

[145]  Minh N. Do,et al.  Wavelet-based texture retrieval using generalized Gaussian density and Kullback-Leibler distance , 2002, IEEE Trans. Image Process..

[146]  Vivek K. Goyal,et al.  Filter bank frame expansions with erasures , 2002, IEEE Trans. Inf. Theory.

[147]  Robert Bregovic,et al.  Multirate Systems and Filter Banks , 2002 .

[148]  Michael T. Orchard,et al.  On the importance of combining wavelet-based nonlinear approximation with coding strategies , 2002, IEEE Trans. Inf. Theory.

[149]  Emmanuel J. Candès,et al.  The curvelet transform for image denoising , 2002, IEEE Trans. Image Process..

[150]  Minh N. Do,et al.  Rotation invariant texture characterization and retrieval using steerable wavelet-domain hidden Markov models , 2002, IEEE Trans. Multim..

[151]  Minh N. Do,et al.  The finite ridgelet transform for image representation , 2003, IEEE Trans. Image Process..

[152]  I. Daubechies,et al.  Framelets: MRA-based constructions of wavelet frames☆☆☆ , 2003 .

[153]  Petukhov,et al.  Constructive Approximation Symmetric Framelets , 2003 .

[154]  M. Do Fast approximation of Kullback-Leibler distance for dependence trees and hidden Markov models , 2003, IEEE Signal Processing Letters.

[155]  D. Donoho,et al.  Fast Slant Stack: a notion of Radon transform for data in a Cartesian grid which is rapidly computable, algebraically exact, geometrically faithful and invertible , 2003 .

[156]  Keith W. Hipel,et al.  Guest Editorial , 2003, IEEE Trans. Syst. Man Cybern. Part C.

[157]  Minh N. Do,et al.  Framing pyramids , 2003, IEEE Trans. Signal Process..

[158]  Fabrice Labeau,et al.  Discrete Time Signal Processing , 2004 .

[159]  Alex Pentland,et al.  Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[160]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[161]  Ivan W. Selesnick,et al.  Gröbner bases and wavelet design , 2004, J. Symb. Comput..

[162]  Song-Chun Zhu,et al.  Filters, Random Fields and Maximum Entropy (FRAME): Towards a Unified Theory for Texture Modeling , 1998, International Journal of Computer Vision.