Iterative Clustering Approach for Text Independent Speaker Identification using Multiple Features

The main objective of this paper is to explore the effectiveness of features for identifying speakers. We propose features such as line spectral frequency (LSF), differential line spectral frequency (DLSF), mel frequency cepstral coefficients (MFCC), discrete cosine transform cepstrum (DCTC), perceptual linear predictive cepstrum (PLP) and mel frequency perceptual linear predictive cepstrum (MF-PLP). These features are captured and training models are developed by K-means clustering procedure. A speaker identification system is evaluated on noise added test speeches and the experimental results reveal the performance of the proposed algorithm in identifying speakers based on minimum distance between test features and clusters and also highlight the best choice of feature set among all the proposed features for 50 speakers chosen randomly from ldquoTIMITrdquo database. In this work, F-ratio is computed as a theoretical measure to validate the experimental results.

[1]  Rangarao Muralishankar,et al.  Pseudo Complex Cepstrum Using Discrete Cosine Transform , 2005, Int. J. Speech Technol..

[2]  A. Revathi,et al.  A noise reduction technique of speech signal using ICA and spectral analysis , 2007 .

[3]  Hynek Hermansky,et al.  The challenge of inverse-E: the RASTA-PLP method , 1991, [1991] Conference Record of the Twenty-Fifth Asilomar Conference on Signals, Systems & Computers.

[4]  S. Arivazhagan,et al.  Fingerprint Verification Using Gabor Co-occurrence Features , 2007, International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007).

[5]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[6]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[7]  牧野 正三 Perceptually based processing in automatic speech recognition , 1986 .

[8]  Hugo Cordeiro,et al.  Speaker Characterization with MLSFs , 2006, 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.

[9]  Y. Venkataramani,et al.  Effectiveness of LP Derived Features and DCTC in Twins Identification - Iterative Speaker Clustering Approach , 2007, International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007).

[10]  Hong-Goo Kang,et al.  Speaker recognition based on transformed line spectral frequencies , 2004, Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004..