Parallel and Distributed Audio Concealment using Nonlocal Sparse Representations

We present a new class of parallel and distributed audio concealment (PDAC) algorithms which recover lost audio packets at the receiver to fight against channel impairment. The main contribution of this work is the proposal of using nonlocal sparse representations to characterize the prior constraint of undamaged audio. When combined with observation constraint, we obtain an alternating projection based audio concealment algorithm which recovers missing data in a parallel and distributed fashion. We also present two extensions of PDAC for more challenging situations: expectation-maximization PDAC (EM-PDAC) to handle consecutive packet loss and filter-bank PDAC (FB-PDAC) to repair complex music signals. Excellent preliminary experimental results are reported for a wide range of audio materials and loss conditions.

[1]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[2]  Onur G. Guleryuz,et al.  Nonlinear approximation based image recovery using adaptive sparse reconstructions and iterated denoising-part II: adaptive algorithms , 2006, IEEE Transactions on Image Processing.

[3]  B. Girod,et al.  A new technique for audio packet loss concealment , 1996, Proceedings of GLOBECOM'96. 1996 IEEE Global Telecommunications Conference.

[4]  Bor-Sen Chen,et al.  Model-based multirate representation of speech signals and its application to recovery of missing speech packets , 1997, IEEE Trans. Speech Audio Process..

[5]  V. Hardman,et al.  A survey of packet loss recovery techniques for streaming audio , 1998, IEEE Network.

[6]  Xiao Su,et al.  A survey of error-concealment schemes for real-time audio and video transmissions over the Internet , 2000, Proceedings International Symposium on Multimedia Software Engineering.

[7]  Henning Sanneck,et al.  Concealment of lost speech packets using adaptive packetization , 1998, Proceedings. IEEE International Conference on Multimedia Computing and Systems (Cat. No.98TB100241).

[8]  John J. Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities , 1999 .

[9]  I. Good,et al.  Ergodic theory and information , 1966 .

[10]  P. Vaidyanathan Multirate Systems And Filter Banks , 1992 .

[11]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[12]  Kathryn Momtahan,et al.  Linear prediction based packet loss concealment algorithm for PCM coded speech , 2001, IEEE Trans. Speech Audio Process..

[13]  Anil K. Jain,et al.  Feature Selection: Evaluation, Application, and Small Sample Performance , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .