论文信息 - Control System with Speech Recognition Using MFCC and Euclidian Distance Algorithm

Control System with Speech Recognition Using MFCC and Euclidian Distance Algorithm

In this paper we describe the implementation of control system with speech recognition. To implement this, we used the MFCC and Euclidian distance algorithm. Using COLEA tool we give the input acoustic wave as a speech signal. In this paper, the simulation of simple digital hearing aid was developed using MATLAB programming language. Speaker recognition systems contain two main modules: Speaker Identification and Speaker Verification. With the help of MFCC we extract the information from the recognized speech signal. MFCC, the main advantage is that it uses Mel frequency scaling which is very approximate to the human auditory system. We also used VQLBG algorithm (as proposed by Y. Linde, A. Buzo & R. Gray) to generate the codebook and after that using the Euclidian distance algorithm we compare the codebook with stored data base. The primary objective of this paper is to compare and summarize some of the well known methods used for speech recognition.

Bhagwan S. Sharma | Hiren Parmar

[1] S.H.S. Salleh,et al. Implementation of speaker identification system by means of personal computer , 2000, 2000 TENCON Proceedings. Intelligent Systems and Technologies for the New Millennium (Cat. No.00CH37119).

[2] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[3] Tomi Kinnunen,et al. Real-time speaker identification and verification , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4] R. P. Ramachandran,et al. Robust speaker recognition: a feature-based approach , 1996, IEEE Signal Processing Magazine.

[5] Luc Vincent,et al. Exact Euclidean distance function by chain propagations , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6] Boling Xu,et al. Binary quantization of feature vectors for robust text-independent speaker identification , 1999, IEEE Trans. Speech Audio Process..

[7] Richard H. Wilson,et al. Speech Recognition Performance of Patients with Sensorineural Hearing Loss Under Unaided and Aided Conditions Using Linear and Compression Hearing Aids , 2002, Ear and hearing.

[8] R. Gray,et al. Vector quantization , 1984, IEEE ASSP Magazine.

[9] Thomas Quatieri,et al. Discrete-Time Speech Signal Processing: Principles and Practice , 2001 .

[10] Yoshua Bengio,et al. Discriminative feature and model design for automatic speech recognition , 1997, EUROSPEECH.

[11] Sadaoki Furui. Speaker-dependent-feature extraction, recognition and processing techniques , 1991, Speech Commun..

[12] Allen Gersho,et al. Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[13] Brian C. J. Moore,et al. The effect on speech intelligibility of varying compression time constants in a digital hearing aid , 2004, International journal of audiology.

[14] Biing-Hwang Juang,et al. A vector quantization approach to speaker recognition , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.