Speaker Identification from Mixture of Speech and Non-speech Audio Signal

Separating speaker from an amalgam of multiple sounds is a challenging area in the domain of speech processing. Henceforth, it has been quickly led to the new area of development in the subfield of speech processing called speaker identification. The proposed work presents a new approach to catch this problem by using acoustic features of the audio signal. The mixture of speech and non-speech audio signal has got separated by using filtering algorithm followed by the recognition of the speech audio by extracting noteworthy acoustic features. A new feature has got implemented as part of contribution to the proposed work named del-MFCC. The computed features have been served for identification of speakers using different popular classifiers. The performance of the presented methodology has been compared with the existing related methods to express the usefulness of the proposed method.

[1]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[2]  Paula López-Otero,et al.  Improved strategies for speaker segmentation and emotional state detection , 2015 .

[3]  Eliathamby Ambikairajah,et al.  Investigation of Spectral Centroid Magnitude and Frequency for Speaker Recognition , 2010, Odyssey.

[4]  Petri Toiviainen,et al.  A Matlab Toolbox for Music Information Retrieval , 2007, GfKl.

[5]  George R. Doddington,et al.  Speaker recognition based on idiolectal differences between speakers , 2001, INTERSPEECH.

[6]  Pedro Gómez-Vilda,et al.  Improving Speaker Recognition by Biometric Voice Deconstruction , 2015, Front. Bioeng. Biotechnol..

[7]  Ranjan Parekh,et al.  AUTOMATED SPEECH RECOGNITION OF ISOLATED WORDS USING NEURAL NETWORKS , 2011 .

[8]  Y. Venkataramani,et al.  Text Independent Speaker Recognition and Speaker Independent Speech Recognition Using Iterative Clustering Approach , 2009 .

[9]  P. Mermelstein,et al.  Distance measures for speech recognition, psychological and instrumental , 1976 .

[10]  Kinshuk Dudeja,et al.  Applications of Digital Signal Processing to Speech Recognition , 2015 .

[11]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[12]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[13]  Hui-hong Xu Text Dependent Speaker Recognition Study , 2015 .

[14]  Shweta Srivastava,et al.  Weka: A Tool for Data preprocessing, Classification, Ensemble, Clustering and Association Rule Mining , 2014 .