Characterization of Analytic Wavelet Transforms and a New Phaseless Reconstruction Algorithm

We obtain a characterization of all wavelets leading to analytic wavelet transforms (WT). The characterization is obtained as a byproduct of the theoretical foundations of a new method for wavelet phase reconstruction from magnitude-only coefficients. The cornerstone of our analysis is an expression of the partial derivatives of the continuous WT, which results in phase–magnitude relationships similar to the short-time Fourier transform setting and valid for the generalized family of Cauchy wavelets. We show that the existence of such relations is equivalent to analyticity of the WT up to a multiplicative weight and a scaling of the mother wavelet. The implementation of the new phaseless reconstruction method is considered in detail and compared to previous methods. It is shown that the proposed method provides significant performance gains and a great flexibility regarding accuracy versus complexity. In addition, we discuss the relation between scalogram reassignment operators and the wavelet transform phase gradient and present an observation on the phase around zeros of the WT.

[1]  Tuomas Virtanen,et al.  Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[3]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[4]  Patrick Flandrin,et al.  Improving the readability of time-frequency and time-scale representations by the reassignment method , 1995, IEEE Trans. Signal Process..

[5]  Irène Waldspurger,et al.  Phase Retrieval for Wavelet Transforms , 2015, IEEE Transactions on Information Theory.

[6]  Robert D. Nowak,et al.  Wavelet-based Rician noise removal for magnetic resonance imaging , 1999, IEEE Trans. Image Process..

[7]  Thomas Strohmer,et al.  Numerical algorithms for discrete Gabor expansions , 1998 .

[8]  I. Daubechies,et al.  Synchrosqueezed wavelet transforms: An empirical mode decomposition-like tool , 2011 .

[9]  R. Balan,et al.  On signal reconstruction without phase , 2006 .

[10]  Zdeněk Průša,et al.  REAL-TIME SPECTROGRAM INVERSION USING PHASE GRADIENT HEAP INTEGRATION , 2016 .

[11]  Sylvain Meignen,et al.  Time-Frequency Reassignment and Synchrosqueezing: An Overview , 2013, IEEE Signal Processing Magazine.

[12]  Ingrid Daubechies,et al.  Reconstructing Real-Valued Functions from Unsigned Coefficients with Respect to Wavelet and Other Frames , 2016, 1601.07579.

[13]  Nicki Holighaus,et al.  Phase vocoder done right , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[14]  Sofia C. Olhede,et al.  Generalized Morse Wavelets as a Superfamily of Analytic Wavelets , 2012, IEEE Transactions on Signal Processing.

[15]  D. Nelson,et al.  Cross-spectral methods for processing speech. , 2001, The Journal of the Acoustical Society of America.

[16]  R. Gerchberg A practical algorithm for the determination of phase from image and diffraction plane pictures , 1972 .

[17]  Eric Moulines,et al.  Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..

[18]  Nicki Holighaus,et al.  Non-iterative filter bank phase (re)construction , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[19]  Pavel Rajmic,et al.  Discrete Wavelet Transforms in the Large Time-Frequency Analysis Toolbox for MATLAB/GNU Octave , 2016, ACM Trans. Math. Softw..

[20]  Michael R. Portnoff Magnitude-phase relationships for short-time Fourier transforms based on Gaussian analysis windows , 1979, ICASSP.

[21]  Jean Laroche,et al.  Improved phase vocoder time-scale modification of audio , 1999, IEEE Trans. Speech Audio Process..

[22]  Patrick Flandrin,et al.  Making reassignment adjustable: The Levenberg-Marquardt approach , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Richard Kronland-Martinet,et al.  Asymptotic wavelet and Gabor analysis: Extraction of instantaneous frequencies , 1992, IEEE Trans. Inf. Theory.

[24]  Karlheinz Gröchenig,et al.  Acceleration of the frame algorithm , 1993, IEEE Trans. Signal Process..

[25]  S. Geneva,et al.  Sound Quality Assessment Material: Recordings for Subjective Tests , 1988 .

[26]  Olivier Rioul,et al.  Fast algorithms for discrete and continuous wavelet transforms , 1992, IEEE Trans. Inf. Theory.

[27]  Michael Unser,et al.  Fast implementation of the continuous wavelet transform with integer scales , 1994, IEEE Trans. Signal Process..

[28]  Patrick Flandrin,et al.  Separability, positivity, and minimum uncertainty in time-frequency energy distributions , 1998 .

[29]  Peter L. Søndergaard,et al.  The pole behavior of the phase derivative of the short-time Fourier transform , 2011, 1103.0409.

[30]  Martin Vetterli,et al.  Oversampled filter banks , 1998, IEEE Trans. Signal Process..

[31]  Lalit M. Patnaik,et al.  Classification of magnetic resonance brain images using wavelets as input to support vector machine and neural network , 2006, Biomed. Signal Process. Control..

[32]  Nicki Holighaus,et al.  The Large Time-Frequency Analysis Toolbox 2.0 , 2013, CMMR.

[33]  Michael I. Jordan,et al.  Learning Spectral Clustering, With Application To Speech Separation , 2006, J. Mach. Learn. Res..

[34]  Mark J. Shensa,et al.  The discrete wavelet transform: wedding the a trous and Mallat algorithms , 1992, IEEE Trans. Signal Process..

[35]  Philipp Grohs,et al.  Phase Retrieval In The General Setting Of Continuous Frames For Banach Spaces , 2016, SIAM J. Math. Anal..

[36]  Douglas J. Nelson,et al.  Instantaneous Higher Order Phase Derivatives , 2002, Digit. Signal Process..

[37]  Bruno Torrésani,et al.  Variations on Hough-wavelet transforms for time-frequency chirp detection , 2003, SPIE Optics + Photonics.

[38]  Emmanuel J. Candès,et al.  PhaseLift: Exact and Stable Signal Recovery from Magnitude Measurements via Convex Programming , 2011, ArXiv.

[39]  Luis Romeral,et al.  Fault Detection in Induction Machines Using Power Spectral Density in Wavelet Decomposition , 2008, IEEE Transactions on Industrial Electronics.

[40]  Samy Bengio,et al.  Tacotron: Towards End-to-End Speech Synthesis , 2017, INTERSPEECH.

[41]  Patrick Flandrin,et al.  On Phase-Magnitude Relationships in the Short-Time Fourier Transform , 2012, IEEE Signal Processing Letters.

[42]  O. Christensen An introduction to frames and Riesz bases , 2002 .

[43]  Nathanael Perraudin,et al.  Adversarial Generation of Time-Frequency Features with application in audio synthesis , 2019, ICML.

[44]  Laurent Daudet,et al.  Time-frequency and time-scale vector fields for deforming time-frequency and time-scale representations , 1999, Optics & Photonics.

[45]  Zdenek Prusa,et al.  A Noniterative Method for Reconstruction of Phase From STFT Magnitude , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[46]  Rujie Yin,et al.  Stable Phase Retrieval in Infinite Dimensions , 2016, Foundations of Computational Mathematics.

[47]  Peter L. Søndergaard,et al.  A fast Griffin-Lim algorithm , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[48]  Anssi Klapuri,et al.  A Matlab Toolbox for Efficient Perfect Reconstruction Time-Frequency Transforms with Log-Frequency Resolution , 2014, Semantic Audio.

[49]  Pavel Rajmic,et al.  Towards High Quality Real-Time Signal Reconstruction from STFT Magnitude , 2017 .

[50]  Joan Bruna,et al.  Source separation with scattering Non-Negative Matrix Factorization , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[51]  Nicolas Sturmel,et al.  SIGNAL RECONSTRUCTION FROM STFT MAGNITUDE : A STATE OF THE ART , 2011 .

[52]  J. H. Thomas,et al.  Damage characterization of polymer-based composite materials: Multivariable analysis and wavelet transform for clustering acoustic emission data , 2008 .

[53]  Stéphane Mallat,et al.  A Wavelet Tour of Signal Processing - The Sparse Way, 3rd Edition , 2008 .

[54]  Dustin G. Mixon,et al.  Saving phase: Injectivity and stability for phase retrieval , 2013, 1302.4618.

[55]  Thibaud Necciari,et al.  Audlet Filter Banks: A Versatile Analysis/Synthesis Framework Using Auditory Frequency Scales , 2018 .

[56]  I. Daubechies,et al.  Time-frequency localisation operators-a geometric phase space approach: II. The use of dilations , 1988 .

[57]  Yonina C. Eldar,et al.  Phase Retrieval with Application to Optical Imaging: A contemporary overview , 2015, IEEE Signal Processing Magazine.

[58]  Thomas Grill,et al.  A Framework for Invertible, Real-Time Constant-Q Transforms , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[59]  Dustin G. Mixon,et al.  Finite Frames and Filter Banks , 2013 .

[60]  Stéphane Mallat,et al.  Phase Retrieval for the Cauchy Wavelet Transform , 2014, ArXiv.

[61]  Helmut Bölcskei,et al.  Frame-theoretic analysis of oversampled filter banks , 1998, IEEE Trans. Signal Process..

[62]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[63]  Gerard Ascensi,et al.  Model Space Results for the Gabor and Wavelet Transforms , 2008, IEEE Transactions on Information Theory.

[64]  Eric Moulines,et al.  Non-parametric techniques for pitch-scale and time-scale modification of speech , 1995, Speech Commun..

[65]  Sofia C. Olhede,et al.  On the Analytic Wavelet Transform , 2007, IEEE Transactions on Information Theory.

[66]  Jae S. Lim,et al.  Signal estimation from modified short-time Fourier transform , 1983, ICASSP.

[67]  Martin Vetterli,et al.  Adaptive wavelet thresholding for image denoising and compression , 2000, IEEE Trans. Image Process..

[68]  Nicki Holighaus,et al.  Theory, implementation and applications of nonstationary Gabor frames , 2011, J. Comput. Appl. Math..

[69]  L. D. Abreu Superframes and Polyanalytic Wavelets , 2017 .

[70]  Shrikanth Narayanan,et al.  Environmental Sound Recognition With Time–Frequency Audio Features , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[71]  Yonina C. Eldar,et al.  GESPAR: Efficient Phase Retrieval of Sparse Signals , 2013, IEEE Transactions on Signal Processing.

[72]  J R Fienup,et al.  Phase retrieval algorithms: a comparison. , 1982, Applied optics.