Speech analysis in search of speakers with MFCC, PLP, Jitter and Shimmer

This document belongs to the implementation of a system of recognition of the bases on the Hidden Models of Markov HMM. Due to a system of recognition, we will use the techniques of parameterization which handles the mechanisms of the human ear Mel Frequency cepstral coefficient MFCC and Perceptual linear prediction PLP starting from the database TIMIT. We also used two indices Jitter and Shimmer since they show very precise information about the voice of the person.

[1]  Wang Wenbo,et al.  Feature extraction of underwater target in auditory sensation area based on MFCC , 2016, 2016 IEEE/OES China Ocean Acoustics (COA).

[2]  Jeong Hwan Seo,et al.  The acoustical analysis of knee joint sounds for non-invasive diagnosis of articular pathology , 2005, IEEE Workshop on Signal Processing Systems Design and Implementation, 2005..

[3]  Christian Hacker,et al.  Revising Perceptual Linear Prediction (PLP) , 2005, INTERSPEECH.

[4]  Zied Hajaiej,et al.  Improved closed set text independent speaker identification system using Gammachirp Filterbank in noisy environments , 2014, 2014 IEEE 11th International Multi-Conference on Systems, Signals & Devices (SSD14).

[5]  G. B. Gour,et al.  Voice Disorder Analysis of Thyroid Patients , 2015 .

[6]  Alex Acero,et al.  Spoken Language Processing , 2001 .

[7]  P. Dhanalakshmi,et al.  Analysis of Throat Microphone Using MFCC Features for Speaker Recognition , 2016 .

[8]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[9]  Sazali Yaacob,et al.  Classification of speech dysfluencies with MFCC and LPCC features , 2012, Expert Syst. Appl..

[10]  Mohamed Atibi,et al.  ECG signals classification using MFCC coefficients and ANN classifier , 2016, 2016 International Conference on Electrical and Information Technologies (ICEIT).

[11]  Mondher Frikha,et al.  A Comparitive Survey of ANN and Hybrid HMM/ANN Architectures for Robust Speech Recognition , 2012 .

[12]  Lin Ma,et al.  Teager Mel and PLP Fusion Feature Based Speech Emotion Recognition , 2015, 2015 Fifth International Conference on Instrumentation and Measurement, Computer, Communication and Control (IMCCC).

[13]  Xi Li,et al.  Stress and Emotion Classification using Jitter and Shimmer Features , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[14]  Carla Teixeira Lopes,et al.  TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .

[15]  Pallavi S. Deshpande,et al.  Pitch contour modelling and modification for expressive Marathi speech synthesis , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[16]  Najet Arous Word Recognition in Continuous Speech and Speaker Independent by Means of Recurrent Self-organizing Spiking Neurons , 2011 .

[17]  Faiez Gargouri,et al.  Hybrid SVM/HMM model for the recognition of Arabic triphones-based continuous speech , 2013, 10th International Multi-Conferences on Systems, Signals & Devices 2013 (SSD13).

[18]  João Paulo Teixeira,et al.  Algorithm for Jitter and Shimmer Measurement in Pathologic Voices , 2016 .

[19]  M. Picheny,et al.  Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences , 2017 .