论文信息 - Estimating a Signal from a Magnitude Spectrogram via Convex Optimization

Estimating a Signal from a Magnitude Spectrogram via Convex Optimization

The problem of recovering a signal from the magnitude of its short-time Fourier transform (STFT) is a longstanding one in audio signal processing. Existing approaches rely on heuristics that often perform poorly because of the nonconvexity of the problem. We introduce a formulation of the problem that lends itself to a tractable convex program. We observe that our method yields better reconstructions than the standard Griffin-Lim algorithm. We provide an algorithm and discuss practical implementation details, including how the method can be scaled up to larger examples.

Julius O. Smith | Dennis L. Sun | J. Smith

[1] Jean Laroche,et al. About this phasiness business , 1997, ICMC.

[2] Brendan J. Frey,et al. Probabilistic Inference of Speech Signals from Phaseless Spectrograms , 2003, NIPS.

[3] Richard F. Lyon,et al. Auditory model inversion for sound separation , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[4] R. Gerchberg. A practical algorithm for the determination of phase from image and diffraction plane pictures , 1972 .

[5] Bhiksha Raj,et al. Missing Data Imputation for Time-Frequency Representations of Audio Signals , 2011, J. Signal Process. Syst..

[6] P. Smaragdis,et al. Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[7] Jae S. Lim,et al. Signal estimation from modified short-time Fourier transform , 1983, ICASSP.

[8] Nancy Bertin,et al. Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[9] Michael I. Jordan,et al. A Direct Formulation for Sparse Pca Using Semidefinite Programming , 2004, SIAM Rev..

[10] Marc Teboulle,et al. A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[11] A. Oppenheim,et al. Signal reconstruction from phase or magnitude , 1980 .

[12] Anton van den Hengel,et al. Semidefinite Programming , 2014, Computer Vision, A Reference Guide.

[13] Jae S. Lim,et al. Signal reconstruction from the short-time Fourier transform magnitude , 1982, ICASSP.

[14] Tony Ezzat,et al. An incremental algorithm for signal reconstruction from short-time fourier transform magnitude , 2006, INTERSPEECH.