An efficient inverse short-time Fourier transform algorithm for improved signal reconstruction by time-frequency synthesis: Optimality and computational issues

This paper presents an improved signal reconstruction method based on a new inverse short-time Fourier transform (ISTFT) estimator. The main challenge addressed in this study is to design a more computationally efficient algorithm called exact formal approach (EFA) which overcomes the drawbacks of the popular overlap and add (OLA) and least squares (LS) methods in several cases of practical interest. The proposed EFA algorithm is based on a vector formulation and the exploitation of properties of the matrix formed by signal samples and frames corresponding to signals segments with overlap. A detailed comparative study shows the advantages of the EFA compared to the OLA and LS methods. Several experiments illustrate the performance and properties of the different estimators. The criteria of comparison are based on synthesis quality and denoising efficiency. The results indicate that, (1) from a computational point of view, the proposed algorithm EFA outperforms other popular ISTFT algorithms including OLA and LS and (2) that the EFA and LS have similar results in terms of synthesis quality and both outperform the algorithm currently most used for ISTFT, the OLA. The proposed EFA estimator can then improve ISTFT based applications involving signal enhancement and denoising.

[1]  Boualem Boashash,et al.  A review of time-frequency matched filter design with application to seizure detection in multichannel newborn EEG , 2014, Digit. Signal Process..

[2]  Karim Abed-Meraim,et al.  Blind Separation of Underdetermined Convolutive Mixtures Using Their Time–Frequency Representation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Boualem Boashash,et al.  Fast and memory-efficient algorithms for computing quadratic time–frequency distributions , 2013 .

[4]  Philipos C. Loizou,et al.  A noise-estimation algorithm for highly non-stationary environments , 2006, Speech Commun..

[5]  Hervé Jégou,et al.  Deleted DOI: Error Recovery Properties and Soft Decoding of Quasi-Arithmetic Codes , 2008 .

[6]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[7]  Irena Orovic,et al.  Compressive Sensing Based Separation of Nonstationary and Stationary Signals Overlapping in Time-Frequency , 2013, IEEE Transactions on Signal Processing.

[8]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[9]  Pascal Scalart,et al.  Improved Signal-to-Noise Ratio Estimation for Speech Enhancement , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Srdjan Stankovic,et al.  About time-variant filtering of speech signals with time-frequency distributions for hands-free telephone systems , 2000, Signal Process..

[11]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[12]  Boualem Boashash,et al.  An Improved Design of High-Resolution Quadratic Time–Frequency Distributions for the Analysis of Nonstationary Multicomponent Signals Using Directional Compact Kernels , 2017, IEEE Transactions on Signal Processing.

[13]  Bin Yang A study of inverse short-time fourier transform , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[15]  Jonathan Le Roux,et al.  Consistent Wiener Filtering for Audio Source Separation , 2013, IEEE Signal Processing Letters.

[16]  Mohamed Djendi,et al.  Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques , 2014, Digit. Signal Process..

[17]  Yannick Deville,et al.  A time-frequency blind signal separation method applicable to underdetermined mixtures of dependent sources , 2005, Signal Process..

[18]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[19]  Boualem Boashash,et al.  Time-Frequency Signal Analysis and Processing: A Comprehensive Reference , 2015 .

[20]  Boualem Boashash,et al.  Passive detection of accelerometer-recorded fetal movements using a time-frequency signal processing approach , 2014, Digit. Signal Process..

[21]  Boualem Boashash,et al.  Automatic signal abnormality detection using time-frequency features and machine learning: A newborn EEG seizure case study , 2016, Knowl. Based Syst..

[23]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[24]  Irena Orovic,et al.  Robust Speech Watermarking Procedure in the Time-Frequency Domain , 2008, EURASIP J. Adv. Signal Process..

[25]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[26]  Peter Vary,et al.  Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model , 2005, EURASIP J. Adv. Signal Process..

[27]  Abdeldjalil Aïssa-El-Bey,et al.  Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[28]  John M. O'Toole,et al.  Time-Frequency Processing of Nonstationary Signals: Advanced TFD Design to Aid Diagnosis with Highlights from Medical Applications , 2013, IEEE Signal Processing Magazine.