Lossless wideband audio compression: prediction and transform

Lossless Wideband Audio Compression: Prediction and Transform This thesis studies lossless audio compression. In the domain of lossless compression, research takes place on two broad development sections, signal modeling and coding algorithm. The former is concerned with the understanding of the source signal, while coding is the more tightly specified task of efficiently representing a single symbol as a code. The focus of this thesis is the evaluation and the development of signal modeling techniques for lossless compression. Related with the modeling method used to decorrelate a signal, the data compression schemes are generally divided in two categories, predictive modeling and transform-based modeling. In the thesis, all two categories are investigated in depth and handled from the lossless viewpoint. The first contribution of the thesis is an exploration of the general audio compression systems including the lossy compression system. In predictive modeling, the structures of various linear prediction filters are introduced by presenting the fundamental autoregressive modeling. The prediction filters including the approaches to the nonstationary signal modeling and to the adaptive linear prediction filters are explored and evaluated by testing within a prototypical lossless audio compression system. For transform modeling, two well-known subband transform coding methods, Laplacian pyramid and subband coding scheme, are first described, and then the design methods of perfect reconstruction multirate filter banks are studied. Concerning with the modulated lapped orthogonal transform, the efficiency of linear prediction from subband and from fullband is formally compared and empirically examined. Wavelet transform is in depth studied from the various viewpoints in order to find the theoretical relationship between the wavelet and the multirate filter banks. Theoretical and practical aspects of reversible transforms are discussed by introducing the S-transform, S+P transform, and RTS transform. The lifting method is examined as a means to realize the biorthogonal wavelets. Integer lifting scheme with rounding-off method is investigated to construct reversible version of wavelet transforms and its performance is validated by applying to lossless audio compression. Finally, some of the more important results presented in this thesis are summarized with the suggesting directions for future research.

[1]  M. Vetterli Filter banks allowing perfect reconstruction , 1986 .

[2]  Meir Feder,et al.  A universal finite memory source , 1995, IEEE Trans. Inf. Theory.

[3]  B. Ninness,et al.  A unifying construction of orthonormal bases for system identification , 1997, IEEE Trans. Autom. Control..

[4]  Y. Meyer Principe d'incertitude, bases hilbertiennes et algèbres d'opérateurs , 1986 .

[5]  G. Battle A block spin construction of ondelettes. Part I: Lemarié functions , 1987 .

[6]  V. E. Benes,et al.  Statistical Theory of Communication , 1960 .

[7]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[8]  James D. Johnston,et al.  A filter family designed for use in quadrature mirror filter banks , 1980, ICASSP.

[9]  L. Griffiths A continuously-adaptive filter implemented as a lattice structure , 1977 .

[10]  Jorma Rissanen,et al.  A Predictive Least-Squares Principle , 1986 .

[11]  E. Robinson,et al.  A historical perspective of spectrum estimation , 1982, Proceedings of the IEEE.

[12]  R. Young,et al.  An introduction to nonharmonic Fourier series , 1980 .

[13]  Shidong Li General theory of discrete Gabor expansion , 1994, Optics & Photonics.

[14]  Jari P. Kaipio,et al.  Deterministic regression smoothness priors TVAR modelling , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[15]  Barry Truax,et al.  Discovering Inner Complexity: Time Shifting and Transposition with a Real-Time Granulation Technique , 1994 .

[16]  R. Coifman A real variable characterization of $H^{p}$ , 1974 .

[17]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series , 1964 .

[18]  Jaakko Astola,et al.  Adaptive context based sequential prediction for lossless audio compression , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[19]  Julius O. Smith,et al.  Bark and ERB bilinear transforms , 1999, IEEE Trans. Speech Audio Process..

[20]  Nasir D. Memon,et al.  Context-based, adaptive, lossless image coding , 1997, IEEE Trans. Commun..

[21]  L. Cohen Generalized Phase-Space Distribution Functions , 1966 .

[22]  J. Venbrux,et al.  A very high speed lossless compression/decompression chip set , 1991, [1991] Proceedings. Data Compression Conference.

[23]  S. Mallat A wavelet tour of signal processing , 1998 .

[24]  N. Levinson The Wiener (Root Mean Square) Error Criterion in Filter Design and Prediction , 1946 .

[25]  G. Strang,et al.  Fourier Analysis of the Finite Element Method in Ritz-Galerkin Theory , 1969 .

[26]  H. Feichtinger,et al.  Irregular sampling theorems and series expansions of band-limited functions , 1992 .

[27]  T. Barnwell,et al.  A procedure for designing exact reconstruction filter banks for tree-structured subband coders , 1984, ICASSP.

[28]  A. Grossmann,et al.  Transforms associated to square integrable group representations. I. General results , 1985 .

[29]  Robert F. Rice Some practical universal noiseless coding techniques, part 3, module PSl14,K+ , 1991 .

[30]  Martin Vetterli,et al.  Wavelets and recursive filter banks , 1993, IEEE Trans. Signal Process..

[31]  S. Thomas Alexander,et al.  Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.

[32]  A. Grossmann,et al.  DECOMPOSITION OF FUNCTIONS INTO WAVELETS OF CONSTANT SHAPE, AND RELATED TRANSFORMS , 1985 .

[33]  Jorma Rissanen,et al.  The Minimum Description Length Principle in Coding and Modeling , 1998, IEEE Trans. Inf. Theory.

[34]  Ali Tabatabai,et al.  Sub-band coding of digital images using symmetric short kernel filters and arithmetic coding techniques , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[35]  Thierry BLUzAbstract SIMPLE REGULARITY CRITERIA FOR SUBDIVISION SCHEMES , 1997 .

[36]  Henrique S. Malvar Lapped transforms for efficient transform/subband coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[37]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[38]  William A. Pearlman,et al.  An image multiresolution representation for lossless and lossy compression , 1996, IEEE Trans. Image Process..

[39]  B. Jawerth,et al.  A discrete transform and decompositions of distribution spaces , 1990 .

[40]  Jerry D. Gibson,et al.  Sequentially Adaptive Prediction and Coding of Speech Signals , 1974, IEEE Trans. Commun..

[41]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[42]  H. Akaike A new look at the statistical model identification , 1974 .

[43]  Plessis Robinson,et al.  TDM-FDM Transmultiplexer: Digital Polyphase and FFT , 1974 .

[44]  Stephen Todd,et al.  Parameter Reduction and Context Selection for Compression of Gray-Scale Images , 1985, IBM J. Res. Dev..

[45]  Martin Vetterli,et al.  Oversampled filter banks , 1998, IEEE Trans. Signal Process..

[46]  L. H. Anauer,et al.  Speech Analysis and Synthesis by Linear Prediction of the Speech Wave , 2000 .

[47]  R. Crochiere,et al.  Quadrature mirror filter design in the time domain , 1984 .

[48]  A. Haar Zur Theorie der orthogonalen Funktionensysteme , 1910 .

[49]  Stéphane Mallat,et al.  Singularity detection and processing with wavelets , 1992, IEEE Trans. Inf. Theory.

[50]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[51]  I. Daubechies,et al.  Biorthogonal bases of compactly supported wavelets , 1992 .

[52]  Peter Schröder,et al.  Spherical wavelets: efficiently representing functions on the sphere , 1995, SIGGRAPH.

[53]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[54]  Julius O. Smith,et al.  Spectral modeling synthesis: A sound analysis/synthesis based on a deterministic plus stochastic decomposition , 1990 .

[55]  James L. Flanagan,et al.  Digital coding of speech in sub-bands , 1976, Bell Syst. Tech. J..

[56]  J.G. Daugman,et al.  Entropy reduction and decorrelation in visual coding by oriented neural receptive fields , 1989, IEEE Transactions on Biomedical Engineering.

[57]  D. Gabor Acoustical Quanta and the Theory of Hearing , 1947, Nature.

[58]  P. Vaidyanathan Quadrature mirror filter banks, M-band extensions and perfect-reconstruction techniques , 1987, IEEE ASSP Magazine.

[59]  Thomas Kailath,et al.  A view of three decades of linear filtering theory , 1974, IEEE Trans. Inf. Theory.

[60]  Unto K. Laine,et al.  Warped linear prediction (WLP) in speech and audio processing , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[61]  Samuel D. Stearns Arithmetic coding in lossless waveform compression , 1995, IEEE Trans. Signal Process..

[62]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[63]  Martin Vetterli,et al.  Splitting a signal into subsampled channels allowing perfect reconstruction , 1985 .

[64]  I. Daubechies,et al.  Two-scale difference equations II. local regularity, infinite products of matrices and fractals , 1992 .

[65]  D. Slepian Prolate spheroidal wave functions, fourier analysis, and uncertainty — V: the discrete case , 1978, The Bell System Technical Journal.

[66]  M. G. Kendall,et al.  A Study in the Analysis of Stationary Time-Series. , 1955 .

[67]  Bernd Girod,et al.  Subband Image Coding , 1996 .

[68]  J. Doob The Elementary Gaussian Processes , 1944 .

[69]  Richard Kronland-Martinet,et al.  Detection of abrupt changes in sound signals with the help of wavelet transforms , 1987 .

[70]  Jerry D. Gibson,et al.  A comparison of backward adaptive prediction algorithms in low delay speech coders , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[71]  N. Wiener The Wiener RMS (Root Mean Square) Error Criterion in Filter Design and Prediction , 1949 .

[72]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[73]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[74]  Fred Mintzer,et al.  Filters for distortion-free two-band multirate filter banks , 1985, IEEE Trans. Acoust. Speech Signal Process..

[75]  David C. van Voorhis,et al.  Optimal source codes for geometrically distributed integer alphabets (Corresp.) , 1975, IEEE Trans. Inf. Theory.

[76]  Ian H. Witten,et al.  Arithmetic coding for data compression , 1987, CACM.

[77]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[78]  Benjamin Belzer,et al.  Wavelet filter evaluation for image compression , 1995, IEEE Trans. Image Process..

[79]  P. Tchamitchian Biorthogonalité et Théorie des Opérateurs , 1987 .

[80]  D. B. Preston Spectral Analysis and Time Series , 1983 .

[81]  John W. Woods,et al.  Subband coding of images , 1986, IEEE Trans. Acoust. Speech Signal Process..

[82]  I. Daubechies,et al.  Two-scale difference equations I: existence and global regularity of solutions , 1991 .

[83]  S. Golomb Run-length encodings. , 1966 .

[84]  Jaakko Astola,et al.  Adaptive L-predictors based on finite state machine context selection , 1997, Proceedings of International Conference on Image Processing.

[85]  Don Speck,et al.  LOW-COMPLEXITY SUBBAND CODING FOR IMAGE COMPRESSION(Ph.D. Dissertation Proposal) , 1994 .

[86]  Thomas P. Barnwell,et al.  Recursive autocorrelation computation for LPC analysis , 1977 .

[87]  Wim Sweldens,et al.  The lifting scheme: a construction of second generation wavelets , 1998 .

[88]  Truong Q. Nguyen,et al.  The GenLOT: generalized linear-phase lapped orthogonal transform , 1996, IEEE Trans. Signal Process..

[89]  C. Burrus,et al.  Introduction to Wavelets and Wavelet Transforms: A Primer , 1997 .

[90]  Robert Bregovic,et al.  Multirate Systems and Filter Banks , 2002 .

[91]  Romain Murenzi,et al.  Wavelet Transform of Fractal Aggregates , 1989 .

[92]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[93]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[94]  Shie Qian,et al.  Discrete Gabor transform , 1993, IEEE Trans. Signal Process..

[95]  P. Urcun,et al.  A MUSICAM source codec for digital audio broadcasting and storage , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[96]  William J. Williams,et al.  Improved time-frequency representation of multicomponent signals using exponential kernels , 1989, IEEE Trans. Acoust. Speech Signal Process..

[97]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[98]  B. Atal,et al.  Speech analysis and synthesis by linear prediction of the speech wave. , 1971, The Journal of the Acoustical Society of America.

[99]  Jaakko Astola,et al.  Adaptive Boolean predictive modeling with application to lossless image coding , 1997, Optics & Photonics.

[100]  S. Mallat Multiresolution approximations and wavelet orthonormal bases of L^2(R) , 1989 .

[101]  Cornelis P. Janse,et al.  Time-Frequency Distributions of Loudspeakers: The Application of the Wigner Distribution , 1983 .

[102]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[103]  Xiaolin Wu,et al.  Lossless compression of continuous-tone images via context selection, quantization, and modeling , 1997, IEEE Trans. Image Process..

[104]  Martin Vetterli,et al.  Wavelets and filter banks: theory and design , 1992, IEEE Trans. Signal Process..

[105]  Robert F. Rice,et al.  Some practical universal noiseless coding techniques , 1979 .

[106]  J. Robert Stuart,et al.  Lossless Compression Using IIR Prediction Filters , 1997 .

[107]  Unto K. Laine,et al.  WLPAC-A Perceptual Audio Codec in a Nutshell , 1997 .

[108]  J. Lina,et al.  Complex Daubechies Wavelets , 1995 .

[109]  G. Saridis Parameter estimation: Principles and problems , 1983, Proceedings of the IEEE.

[110]  P. P. Vaidyanathan,et al.  Theory and design of M-channel maximally decimated quadrature mirror filters with arbitrary M, having the perfect-reconstruction property , 1987, IEEE Trans. Acoust. Speech Signal Process..

[111]  I. Daubechies,et al.  Factoring wavelet transforms into lifting steps , 1998 .

[112]  Ahmad Zandi,et al.  CREW: Compression with Reversible Embedded Wavelets , 1995, Proceedings DCC '95 Data Compression Conference.

[113]  JORMA RISSANEN,et al.  A universal data compression system , 1983, IEEE Trans. Inf. Theory.

[114]  I. Daubechies,et al.  Wavelet Transforms That Map Integers to Integers , 1998 .

[115]  R. F. Rice,et al.  Some practical universal noiseless coding techniques, part 2 , 1983 .

[116]  John Makhoul,et al.  Adaptive noise spectral shaping and entropy coding in predictive coding of speech , 1979 .

[117]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[118]  G. Yule On a Method of Investigating Periodicities in Disturbed Series, with Special Reference to Wolfer's Sunspot Numbers , 1927 .

[119]  P. G. Lemari'e,et al.  Ondelettes `a localisation exponentielle , 1988 .

[120]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[121]  K. H. Barratt Digital Coding of Waveforms , 1985 .

[122]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[123]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .

[124]  N. Jayant Adaptive quantization with a one-word memory , 1973 .

[125]  Ali N. Akansu,et al.  On Lapped Orthogonal Transforms , 1992 .

[126]  Guillermo Sapiro,et al.  LOCO-I: a low complexity, context-based, lossless image compression algorithm , 1996, Proceedings of Data Compression Conference - DCC '96.

[127]  Y. Meyer,et al.  De la recherche pétrolière à la géométrie des espaces de Banach en passant par les paraproduits , 1986 .

[128]  R. W. Schafer,et al.  Lossless compression of digital audio , 2001, IEEE Signal Process. Mag..

[129]  Hans G. Feichtinger,et al.  Theory and practice of irregular sampling , 2021, Wavelets.

[130]  Ryozo Kishimoto,et al.  A study on perfect reconstructive subband coding , 1991, IEEE Trans. Circuits Syst. Video Technol..

[131]  H. Strube Linear prediction on a warped frequency scale , 1980 .

[132]  Gerhard Stoll,et al.  ISO-MPEG-1 Audio: A Generic Standard for Coding of High-: Quality Digital Audio , 1994 .

[133]  A. Grossmann,et al.  DECOMPOSITION OF HARDY FUNCTIONS INTO SQUARE INTEGRABLE WAVELETS OF CONSTANT SHAPE , 1984 .

[134]  R. Balian Un principe d'incertitude fort en théorie du signal ou en mécanique quantique , 1981 .

[135]  I. Daubechies,et al.  PAINLESS NONORTHOGONAL EXPANSIONS , 1986 .

[136]  Jorma Rissanen,et al.  Applications of universal context modeling to lossless compression of gray-scale images , 1995, Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers.

[137]  Christopher Heil,et al.  Wavelets and frames , 1990 .

[138]  Glen G. Langdon,et al.  Arithmetic Coding , 1979 .

[139]  A.R.D. Thornton,et al.  Foundations of Modern Auditory Theory , 1970 .

[140]  T. Robinson Simple Lossless and Near-lossless Waveform Compression , 1994 .

[141]  Curtis Roads,et al.  Automated Granular Synthesis of Sound , 1978 .

[142]  Dennis M. Healy,et al.  A parametric class of discrete Gabor expansions , 1996, IEEE Trans. Signal Process..

[143]  David G. Messerschmitt,et al.  A class of generalized lattice filters , 1980 .

[144]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[145]  Benjamin Belzer,et al.  Filter evaluation and selection in wavelet image compression , 1994, Proceedings of IEEE Data Compression Conference (DCC'94).

[146]  Gilbert Strang,et al.  Wavelets and Dilation Equations: A Brief Introduction , 1989, SIAM Rev..

[147]  P.G. Howard,et al.  Fast and efficient lossless image compression , 1993, [Proceedings] DCC `93: Data Compression Conference.

[148]  Edward H. Adelson,et al.  Orthogonal Pyramid Transforms For Image Coding. , 1987, Other Conferences.

[149]  Günther Theile,et al.  MUSICAM-Surround: A Universal Multichannel Coding System Compatible with ISO 11172-3 , 1992 .

[150]  A. Robert Calderbank,et al.  Lossless image compression using integer to integer wavelet transforms , 1997, Proceedings of International Conference on Image Processing.

[151]  Mati Wax Order selection for AR models by predictive least-squares , 1986, 1986 25th IEEE Conference on Decision and Control.

[152]  Martin Vetterli,et al.  Optimal time segmentation for signal modeling and compression , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[153]  Refractor Vision , 2000, The Lancet.

[154]  W. Sweldens The Lifting Scheme: A Custom - Design Construction of Biorthogonal Wavelets "Industrial Mathematics , 1996 .

[155]  William A. Pearlman,et al.  Reversible image compression via multiresolution representation and predictive coding , 1993, Other Conferences.

[156]  R. Duffin,et al.  A class of nonharmonic Fourier series , 1952 .

[157]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[158]  Peter Whittle,et al.  A Study in the Analysis of Stationary Time-Series. , 1954 .

[159]  R. Murenzi Wavelet Transforms Associated to the n-Dimensional Euclidean Group with Dilations: Signal in More Than One Dimension , 1990 .

[160]  Joseph Rothweiler,et al.  Polyphase quadrature filters-A new subband coding technique , 1983, ICASSP.

[161]  Eric Moulines,et al.  Non-parametric techniques for pitch-scale and time-scale modification of speech , 1995, Speech Commun..

[162]  M. Vetterli,et al.  Perfect reconstruction FIR filter banks: lapped transforms, pseudo QMFs and paraunitary matrices , 1988, 1988., IEEE International Symposium on Circuits and Systems.

[163]  D. Esteban,et al.  Application of quadrature mirror filters to split band voice coding schemes , 1977 .

[164]  Gilbert T. Walker,et al.  On Periodicity in Series of Related Terms , 1931 .

[165]  I. Daubechies Orthonormal bases of compactly supported wavelets II: variations on a theme , 1993 .

[166]  Thomas W. Parks,et al.  GENERATION AND COMBINATION OF GRAINS FOR MUSIC SYNTHESIS. , 1988 .

[167]  Helly Aufgaben und Lehrsätze aus der Analysis , 1928 .

[168]  J. Benedetto Irregular sampling and frames , 1993 .

[169]  B. Johnson,et al.  A chip set for lossless image compression , 1991 .

[170]  Xavier Serra,et al.  A sound analysis/synthesis system based on a deterministic plus stochastic decomposition , 1990 .

[171]  Simon Haykin,et al.  Advances in spectrum analysis and array processing , 1991 .

[172]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..