Glottal Closure Instant Detection from Speech Signal Using Voting Classifier and Recursive Feature Elimination

In our previous work, we introduced a classification-based method for the automatic detection of glottal closure instants (GCIs) from the speech signal and we showed it was able to perform very well on several test datasets. In this paper, we investigate whether adding more features (voiced/unvoiced, harmonic/noise, spectral etc.) and/or using an ensemble of classifiers such as a voting classifier can further improve GCI detection performance. We show that using additional features leads to a better detection accuracy; best results were obtained when recursive feature elimination was applied on the whole feature set. In addition, a voting classifier is shown to outperform other classifiers and other existing GCI detection algorithms on publicly available databases.

[1]  Thomas Drugman Residual Excitation Skewness for Automatic Speech Polarity Detection , 2013, IEEE Signal Processing Letters.

[2]  Patrick A. Naylor,et al.  Detection of Glottal Closure Instants From Speech Signals: A Quantitative Review , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Abeer Alwan,et al.  Glottal source processing: From analysis to applications , 2014, Comput. Speech Lang..

[4]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[5]  Daniel Tihelka,et al.  Classification-Based Detection of Glottal Closure Instants from Speech Signals , 2017, INTERSPEECH.

[6]  J. Matoušek,et al.  On the detection of pitch marks using a robust multi-phase algorithm , 2011, Speech Commun..

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Thierry Dutoit,et al.  Glottal closure and opening instant detection from speech signals , 2019, INTERSPEECH.

[9]  Daniel Tihelka,et al.  A robust multi-phase pitch-mark detection algorithm , 2007, INTERSPEECH.

[10]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[11]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[12]  Christophe d'Alessandro,et al.  Robust glottal closure detection using the wavelet transform , 1999, EUROSPEECH.

[13]  Prasanta Kumar Ghosh,et al.  An error correction scheme for GCI detection algorithms using pitch smoothness criterion , 2015, INTERSPEECH.

[14]  Ronald A. Cole,et al.  Pitch detection with a neural-net classifier , 1991, IEEE Trans. Signal Process..

[15]  Daniel Tihelka,et al.  Pitch Marks at Peaks or Valleys? , 2007, TSD.

[16]  A. G. Ramakrishnan,et al.  Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Ian S. Howard,et al.  Real-time portable multi-layer perceptron voice fundamental-period extractor for hearing aids and cochlear implants , 1990, Speech Commun..

[18]  Mike Brookes,et al.  Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Jindrich Matousek,et al.  Automatic pitch-synchronous phonetic segmentation , 2008, INTERSPEECH.

[20]  Patrick A. Naylor,et al.  Estimation of Glottal Closing and Opening Instants in Voiced Speech Using the YAGA Algorithm , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  John Kane,et al.  COVAREP — A collaborative voice analysis repository for speech technologies , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Mark Huckvale,et al.  Speech fundamental period estimation using a trainable pattern classifier , 1988 .

[23]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[24]  Khalid Daoudi,et al.  Detection of Glottal Closure Instants Based on the Microcanonical Multiscale Formalism , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[25]  Alan W. Black,et al.  The CMU Arctic speech databases , 2004, SSW.

[26]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[27]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .