论文信息 - Voice Recognition System for Home Security Keys with Mel-Frequency Cepstral Coefficient Method and Backpropagation Artificial Neural Network

Voice Recognition System for Home Security Keys with Mel-Frequency Cepstral Coefficient Method and Backpropagation Artificial Neural Network

In this paper, we present the design of a home door lock control system that is activated by automatic speaker recognition (biometrics). Access to a house or building with various conventional keys, PINs, or smartcards is not reliable enough to increase security because it cannot detect the real key owner. Furthermore, the introduction of the speaker as the key to the house door is applied to overcome this problem. Speaker recognition is the process of automatically recognizing someone who is speaking based on the sound characteristics of the input speech. This technique allows the use of the speaker's voice to verify identity and control access to their homes. It is proposed mainly since votes cannot be stolen, copied, forgotten, lost, or accurately guessed. The proposed system uses Mel-frequency Cepstral Coefficient for feature extraction and Artificial Neural Network Backpropagation for speech recognition. The results of this study for voice recognition show that the success rate in distinguishing homeowners reaches 97% with optimal conditions, namely in quiet environmental conditions (34 dB) with a sound collection distance of about 10 cm.

[1] Xiaoping Zeng,et al. Multi-feature Fusion Speech Emotion Recognition Based on SVM , 2020, 2020 IEEE 10th International Conference on Electronics Information and Emergency Communication (ICEIEC).

[2] F. Mériaudeau,et al. MFCC-based Recurrent Neural Network for Automatic Clinical Depression Recognition and Assessment from Speech , 2019, Biomed. Signal Process. Control..

[3] L. Stoyanov,et al. Application of ANN for solar radiation forecasting - case study of Oryahovo , 2019, 2019 11th Electrical Engineering Faculty Conference (BulEF).

[4] Hao Meng,et al. Speech Emotion Recognition From 3D Log-Mel Spectrograms With Deep Learning Network , 2019, IEEE Access.

[5] Quoc V. Le,et al. SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition , 2019, INTERSPEECH.

[6] Wu Ting,et al. An Acoustic Recognition Model for English Speech Based on Improved HMM Algorithm , 2019, 2019 11th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA).

[7] T. Isshiki,et al. Speaker Recognition Using LPC, MFCC, ZCR Features with ANN and SVM Classifier for Large Input Database , 2019, 2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS).

[8] Hanung Adi Nugroho,et al. Cardiac Sound Classification Using Mel-Frequency Cepstral Coefficients (MFCC) and Artificial Neural Network (ANN) , 2018, 2018 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE).

[9] Lang He,et al. Automated depression analysis using convolutional neural networks from speech , 2018, J. Biomed. Informatics.

[10] Laurence Devillers,et al. CNN+LSTM Architecture for Speech Emotion Recognition with Data Augmentation , 2018, Workshop on Speech, Music and Mind (SMM 2018).

[11] Surendra Shetty,et al. Classification of Healthy and Pathological voices using MFCC and ANN , 2018, 2018 Second International Conference on Advances in Electronics, Computers and Communications (ICAECC).

[12] R. F. Olanrewaju,et al. Modeling of ANN to determine optimum adsorption capacity for removal of pollutants in wastewater , 2017, 2017 IEEE 4th International Conference on Smart Instrumentation, Measurement and Application (ICSIMA).

[13] Elvira Sukma Wahyuni,et al. Arabic speech recognition using MFCC feature extraction and ANN classification , 2017, 2017 2nd International conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE).

[14] Mark D. Plumbley,et al. Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15] John Sahaya Rani Alex,et al. Implementation of ANN based speech recognition system on an embedded board , 2017, 2017 International Conference on Nextgen Electronic Technologies: Silicon to Software (ICNETS2).

[16] P. Mahalakshmi,et al. VOICE RECOGNITION SECURITY SYSTEM USING MEL-FREQUENCY CEPSTRUM COEFFICIENTS , 2016 .

[17] Isiaka A. Alimi,et al. Voice-Based Door Access Control System using the Mel Frequency Cepstrum Coefficients and Gaussian Mixture Model , 2014 .

[18] Khac-Hoai Nam Bui,et al. Traffic Density Classification Using Sound Datasets: An Empirical Study on Traffic Flow at Asymmetric Roads , 2020, IEEE Access.

[19] Qusay H. Tawfeeq,et al. Design and Implementation of an Access Control System Using Open Source Personality Identification Software , 2020 .