Use of perceptual features in iterative clustering based twins identification system

The main objective of this paper is to explore the effectiveness of perceptual features in identifying twins by including the interference from pseudo random noise and background conversation on test speech. An algorithm is developed for identifying twins by extracting the proposed features on speech segments of 16 msecs duration. In this algorithm, these features are captured and quantized into M = L/10 clusters representing L feature vectors of training speech. Twins are identified based on the minimum average distance between speaker models developed on clean speech and noisy test speech vectors. These perceptual features are analyzed in this work and the experimental results reveal the comparative performance of the proposed features under various SNR conditions for the speech database containing speakers in the same age group. The noteworthy feature in this work is the theoretical validation of experimental results and performance evaluation based on the reduction in training and test data.

[1]  Hynek Hermansky,et al.  The challenge of inverse-E: the RASTA-PLP method , 1991, [1991] Conference Record of the Twenty-Fifth Asilomar Conference on Signals, Systems & Computers.

[2]  S. Arivazhagan,et al.  Fingerprint Verification Using Gabor Co-occurrence Features , 2007, International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007).

[3]  Y. Venkataramani,et al.  Effectiveness of LP Derived Features and DCTC in Twins Identification - Iterative Speaker Clustering Approach , 2007, International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007).

[4]  T.K. Basu,et al.  Detection of bilingual twins by Teager energy based features , 2004, 2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04..

[5]  Hemant A. Patil,et al.  Effectiveness of LP Based Features for Identification of Professional Mimics in Indian Languages , 2006 .

[6]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[7]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[8]  A. Revathi,et al.  A noise reduction technique of speech signal using ICA and spectral analysis , 2007 .

[9]  T.K. Basu,et al.  Teager energy mel cepstrum for identification of twins in Marathi , 2004, Proceedings of the IEEE INDICON 2004. First India Annual Conference, 2004..

[10]  Y. Venkataramani,et al.  Iterative Clustering Approach for Text Independent Speaker Identification using Multiple Features , 2008, 2008 2nd International Conference on Signal Processing and Communication Systems.

[11]  Yu-Hung Kao Robustness study of free-text speaker identification and verification , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.