Feature enhancement for classifier optimization and dimensionality reduction

Voice is important for professionals like speakers, teachers, actors, singers and it is the important tool for communication. Laryngeal pathologies induce perturbations in the speech signal. Speech signal is discriminated as pathological or healthy based on roughness - breathiness - hoarseness (RBH) in the quality of signal. In recent years pattern recognition along with various signal processing techniques has emerged as an effective non invasive tool for diagnosis of pathological condition. Signal processing techniques tend to generate large number of features representing the signal. Automatic feature reduction techniques are vital in identifying the relevant features and eliminating the redundant ones. We extract features from speech signal using the acoustic analysis. Features are enhanced by alleviating gender bias. Periodic variations in the signal are captured using statistical techniques. We investigate intelligent system to generate reduced feature subset with improvement in diagnostic performance.

[1]  Hong-Goo Kang,et al.  An Investigation of Vocal Tract Characteristics for Acoustic Discrimination of Pathological Voices , 2013, BioMed research international.

[2]  J. I. Godino-Llorente,et al.  Pathological likelihood index as a measurement of the degree of voice normality and perceived hoarseness. , 2010, Journal of voice : official journal of the Voice Foundation.

[3]  Thierry Dutoit,et al.  Assessment of audio features for automatic cough detection , 2011, 2011 19th European Signal Processing Conference.

[4]  Yannis Stylianou,et al.  On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices , 2011, Logopedics, phoniatrics, vocology.

[5]  Hugo Leonardo Rufiner,et al.  Visualization of normal and pathological speech data , 2007, MAVEBA.

[6]  José Fonseca,et al.  LPC Spectrum First Peak Analysis for Voice Pathology Detection , 2013 .

[7]  Dirk Michaelis,et al.  Acoustic "breathiness measures" in the description of pathologic voices , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[8]  Yannis Stylianou,et al.  Dysphonia detection based on modulation spectral features and cepstral coefficients , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Maciej Kłaczyński,et al.  A Vibroacoustic Model of Selected Human Larynx Diseases , 2007, International journal of occupational safety and ergonomics : JOSE.

[10]  Raymond D. Kent,et al.  Acoustic studies of dysarthric speech: methods, progress, and potential. , 1999, Journal of communication disorders.

[11]  B. Yegnanarayana,et al.  Analysis of breathy voice based on excitation characteristics of speech production , 2012, 2012 International Conference on Signal Processing and Communications (SPCOM).

[12]  Antanas Verikas,et al.  Fusion of voice signal information for detection of mild laryngeal pathology , 2014, Appl. Soft Comput..

[13]  Ashok Ghatol,et al.  Feature selection for medical diagnosis : Evaluation for cardiovascular diseases , 2013, Expert Syst. Appl..

[14]  Sazali Yaacob,et al.  Comparison of speech parameterization techniques for the classification of speech disfluencies , 2013 .

[15]  Germán Castellanos-Domínguez,et al.  An improved method for voice pathology detection by means of a HMM-based feature space transformation , 2010, Pattern Recognit..

[16]  Yannis Stylianou,et al.  Normalized modulation spectral features for cross-database voice pathology detection , 2009, INTERSPEECH.

[17]  Elmar Nöth,et al.  Automatic Rating of Hoarseness by Text-based Cepstral and Prosodic Evaluation , 2012, TSD.

[18]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality. , 1994, Journal of speech and hearing research.

[19]  Marcelo de Oliveira Rosa,et al.  Adaptive estimation of residue signal for voice pathology diagnosis , 2000, IEEE Trans. Biomed. Eng..

[20]  Yannis Stylianou,et al.  Voice Pathology Detection Based eon Short-Term Jitter Estimations in Running Speech , 2009, Folia Phoniatrica et Logopaedica.

[21]  Ashok A. Ghatol,et al.  Dimensionality Reduction Techniques for Improved Diagnosis of Heart Disease , 2013 .

[22]  Sazali Yaacob,et al.  Classification of Speech Dysfluencies Using LPC Based Parameterization Techniques , 2012, Journal of Medical Systems.

[23]  Adas Gelzinis,et al.  On Feature Extraction for Voice Pathology Detection from Speech Signals , 2011 .

[24]  Ronald J. Baken,et al.  Clinical measurement of speech and voice , 1987 .

[25]  Mohammad Pooyan,et al.  An optimum algorithm in pathological voice quality assessment using wavelet-packet-based features, linear discriminant analysis and support vector machine , 2012, Biomed. Signal Process. Control..

[26]  César David Paredes Crovato,et al.  The Use of Wavelet Packet Transform and Artificial Neural Networks in Analysis and Classification of Dysphonic Voices , 2007, IEEE Transactions on Biomedical Engineering.

[27]  Hugo Leonardo Rufiner,et al.  Dimensionality reduction for visualization of normal and pathological speech data , 2009, Biomed. Signal Process. Control..

[28]  Wlodzislaw Duch,et al.  Feature Selection for High-Dimensional Data: A Kolmogorov-Smirnov Correlation-Based Filter , 2005, CORES.