Estimating a Signal from a Magnitude Spectrogram via Convex Optimization

The problem of recovering a signal from the magnitude of its short-time Fourier transform (STFT) is a longstanding one in audio signal processing. Existing approaches rely on heuristics that often perform poorly because of the nonconvexity of the problem. We introduce a formulation of the problem that lends itself to a tractable convex program. We observe that our method yields better reconstructions than the standard Griffin-Lim algorithm. We provide an algorithm and discuss practical implementation details, including how the method can be scaled up to larger examples.

[1]  Jean Laroche,et al.  About this phasiness business , 1997, ICMC.

[2]  Brendan J. Frey,et al.  Probabilistic Inference of Speech Signals from Phaseless Spectrograms , 2003, NIPS.

[3]  Richard F. Lyon,et al.  Auditory model inversion for sound separation , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  R. Gerchberg A practical algorithm for the determination of phase from image and diffraction plane pictures , 1972 .

[5]  Bhiksha Raj,et al.  Missing Data Imputation for Time-Frequency Representations of Audio Signals , 2011, J. Signal Process. Syst..

[6]  P. Smaragdis,et al.  Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[7]  Jae S. Lim,et al.  Signal estimation from modified short-time Fourier transform , 1983, ICASSP.

[8]  Nancy Bertin,et al.  Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[9]  Michael I. Jordan,et al.  A Direct Formulation for Sparse Pca Using Semidefinite Programming , 2004, SIAM Rev..

[10]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[11]  A. Oppenheim,et al.  Signal reconstruction from phase or magnitude , 1980 .

[12]  Anton van den Hengel,et al.  Semidefinite Programming , 2014, Computer Vision, A Reference Guide.

[13]  Jae S. Lim,et al.  Signal reconstruction from the short-time Fourier transform magnitude , 1982, ICASSP.

[14]  Tony Ezzat,et al.  An incremental algorithm for signal reconstruction from short-time fourier transform magnitude , 2006, INTERSPEECH.