Missing Data Imputation for Time-Frequency Representations of Audio Signals

With the recent attention towards audio processing in the time-frequency domain we increasingly encounter the problem of missing data within that representation. In this paper we present an approach that allows us to recover missing values in the time-frequency domain of audio signals. The presented approach is able to deal with real-world polyphonic signals by operating seamlessly even in the presence of complex acoustic mixtures. We demonstrate that this approach outperforms generic missing data approaches, and we present a variety of situations that highlight its utility.

[1]  Trevor Hastie,et al.  Imputing Missing Data for Gene Expression Arrays , 2001 .

[2]  Richard M. Stern,et al.  Reconstruction of missing features for robust speech recognition , 2004, Speech Commun..

[3]  J. Ross Quinlan,et al.  Unknown Attribute Values in Induction , 1989, ML.

[4]  Thomas Hofmann,et al.  Learning the Similarity of Documents: An Information-Geometric Approach to Document Retrieval and Categorization , 1999, NIPS.

[5]  Jae S. Lim,et al.  Signal reconstruction from the short-time Fourier transform magnitude , 1982, ICASSP.

[6]  Tony Ezzat,et al.  An incremental algorithm for signal reconstruction from short-time fourier transform magnitude , 2006, INTERSPEECH.

[7]  Matthew Brand,et al.  Incremental Singular Value Decomposition of Uncertain Data with Missing Values , 2002, ECCV.

[8]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[9]  Bhiksha Raj,et al.  Sparse Overcomplete Latent Variable Decomposition of Counts Data , 2007, NIPS.

[10]  Michael I. Jordan,et al.  Learning from Incomplete Data , 1994 .

[11]  Richard M. Stern,et al.  Reconstruction of incomplete spectrograms for robust speech recognition , 2000 .

[12]  Jae Lim,et al.  Signal reconstruction from short-time Fourier transform magnitude , 1983 .

[13]  Hirokazu Kameoka,et al.  Computational auditory induction by missing-data non-negative matrix factorization , 2008, SAPA@INTERSPEECH.

[14]  Sam T. Roweis,et al.  One Microphone Source Separation , 2000, NIPS.

[15]  Michael I. Jordan,et al.  Unsupervised Learning from Dyadic Data , 1998 .

[16]  Jae Lim,et al.  Signal estimation from modified short-time Fourier transform , 1984 .

[17]  James Stuart Tanton,et al.  Encyclopedia of Mathematics , 2005 .

[18]  Daniel P. W. Ellis,et al.  Detailed graphical models for source separation and missing data interpolation in audio , 2004 .