论文信息 - A Mel-Filterbank and MFCC-based Neural Network Approach to Train the Houston Toad Call Detection System Design

A Mel-Filterbank and MFCC-based Neural Network Approach to Train the Houston Toad Call Detection System Design

Speaker recognition or voice detection is a state-of-art in the field of signal processing which includes human as well as animal. This paper proposes a naive approach to build a predictor model to detect the Houston Toad mating call signature in an audio file which can be paraphrased as toad voice activity detection. To accomplish that, several ideal toad call voice frames of unique characteristics in audio files have been experienced. The audio file is bandpass filtered, and then preprocessed by multiplying every frame with the hamming window to break into segments. Next, the Mel-Filterbank and Mel-Frequency Spectral Coefficient (MFCC) are used for feature extraction, while the Support Vector Machine (SVM) and Multi-layer Perceptron (MLP) neural networks are utilized as classifiers to determine the best fit. This experimental result reflects the higher accuracy of the MLP neural network over SVM showing the best potential of classification.

Damian Valles | Abdullah Al Bashit

[1] Jafreezal Jaafar,et al. FEATURE EXTRACTION USING MFCC , 2013 .

[2] Hoirin Kim,et al. Linear-scale filterbank for deep neural network-based voice activity detection , 2017, 2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA).

[3] Pushpa Rani,et al. An Approach to Extract Feature using MFCC , 2014 .

[4] Fang-Yie Leu,et al. An MFCC-Based Speaker Identification System , 2017, 2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA).

[5] Seema Khanum,et al. Text independent gender identification in noisy environmental conditions , 2017, 2017 International Conference on Computing, Communication and Automation (ICCCA).

[6] Mohaiyedin Idris,et al. Automatic gender recognition using linear prediction coefficients and artificial neural network on speech signal , 2017, 2017 7th IEEE International Conference on Control System, Computing and Engineering (ICCSCE).

[7] Chee Kyun Ng,et al. Animal voice recognition for identification (ID) detection system , 2011, 2011 IEEE 7th International Colloquium on Signal Processing and its Applications.

[8] Shambhu Shankar Bharti,et al. Support vector machine based gender identification using voiced speech frames , 2016, 2016 Fourth International Conference on Parallel, Distributed and Grid Computing (PDGC).