The Use of Deep Learning in Speech Enhancement

Deep learning is an emerging area in current scenario. Mostly, Convolutional Neural Network (CNN) and Deep Belief Network (DBN) are used as the model in deep learning. It is termed as Deep Neural Network (DNN). The use of DNN is widely spread in many applications, exclusively for detection and classification purpose. In this paper, authors have used the same network for signal enhancement purpose. Speech is considered for the input signal with noise. The model of DNN is used with two layers. It has been compared with the ADALINE model to prove its efficacy.

[1]  Jessica J. M. Monaghan,et al.  Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users , 2017, Hearing Research.

[2]  Mihir Narayan Mohanty,et al.  Performance Analysis of Adaptive Algorithms for Speech Enhancement Applications , 2016 .

[3]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[4]  Ignacio Rojas,et al.  Neural networks: An overview of early research, current frameworks and new challenges , 2016, Neurocomputing.

[5]  Yu Tsao,et al.  Audio-visual speech enhancement using deep neural networks , 2016, 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[6]  Liang Dong,et al.  ILMSAF based speech enhancement with DNN and noise classification , 2016, Speech Commun..

[7]  Jun Du,et al.  An Experimental Study on Speech Enhancement Based on Deep Neural Networks , 2014, IEEE Signal Processing Letters.

[8]  Jiri Malek,et al.  Single channel speech enhancement using convolutional neural network , 2017, 2017 IEEE International Workshop of Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM).

[9]  A. Sreenivasa Murthy,et al.  Comparison of Speech Enhancement Algorithms , 2016 .